Object-based modelling for representing and processing speech corpora

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorAltosaar, Toomas
dc.contributor.departmentDepartment of Electrical and Communications Engineeringen
dc.contributor.departmentSähkö- ja tietoliikennetekniikan osastofi
dc.contributor.labLaboratory of Acoustics and Audio Signal Processingen
dc.contributor.labAkustiikan ja äänenkäsittelytekniikan laboratoriofi
dc.date.accessioned2012-02-13T12:26:54Z
dc.date.available2012-02-13T12:26:54Z
dc.date.issued2001-09-28
dc.description.abstractThis thesis deals with modelling data existing in large speech corpora using an object-oriented paradigm which captures important linguistic structures. Information from corpora is transformed into objects and are assigned properties regarding their behaviour. These objects, called speech units, are placed onto a multi-dimensional framework and have their relationships to other units explicitly defined through the use of links. Frameworks that model temporal utterances or atemporal information like speaker characteristics and recording conditions can be searched efficiently for contextual matches. Speech units that match desired contexts are the result of successful linguistically motivated queries and can be used in further speech processing tasks in the same computational environment. This allows for empirical studies of speech and its relation to linguistic structures to be carried out, and for the training and testing of applications like speech recognition and synthesis. Information residing in typical speech corpora is discussed first, followed by an overview of object-orientation which sets the tone for this thesis. Then the representation framework is introduced which is generated by a compiler and linker that rely on a set of domain-specific resources that transform corpus data into speech units. Operations on this framework are then presented along with a comparison between a relational and object-oriented model of identical speech data. The models described in this work are directly applicable to existing large speech corpora, and the methods developed here are tested against relational database methods. The object-oriented methods outperform the relational methods for typical linguistically relevant queries by about three orders of magnitude as measured by database search times. This improvement in simplicity of representation and search speed is crucial for the utilisation of large multi-lingual corpora in basic research on the detailed properties of speech, especially in relation to contextual variation.en
dc.description.versionrevieweden
dc.format.extent92
dc.format.mimetypeapplication/pdf
dc.identifier.isbn951-22-5623-1
dc.identifier.issn1456-6303
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/2349
dc.identifier.urnurn:nbn:fi:tkk-002940
dc.language.isoenen
dc.publisherHelsinki University of Technologyen
dc.publisherTeknillinen korkeakoulufi
dc.relation.ispartofseriesReport / Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processingen
dc.relation.ispartofseriesRaportti / Teknillinen korkeakoulu, akustiikan ja äänenkäsittelytekniikan laboratoriofi
dc.relation.ispartofseries63en
dc.subject.keywordspeech corporaen
dc.subject.keywordspeech databaseen
dc.subject.keywordobject-oriented modelen
dc.subject.keyworddatabase accessen
dc.subject.keywordspeech processingen
dc.subject.otherElectrical engineeringen
dc.titleObject-based modelling for representing and processing speech corporaen
dc.typeG4 Monografiaväitöskirjafi
dc.type.dcmitypetexten
dc.type.ontasotVäitöskirja (monografia)fi
dc.type.ontasotDoctoral dissertation (monograph)en
local.aalto.digiauthask
local.aalto.digifolderAalto_67713

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
isbn9512256231.pdf
Size:
16.38 MB
Format:
Adobe Portable Document Format