Object-based modelling for representing and processing speech corpora

 |  Login

Show simple item record

dc.contributor Aalto-yliopisto fi
dc.contributor Aalto University en
dc.contributor.author Altosaar, Toomas
dc.date.accessioned 2012-02-13T12:26:54Z
dc.date.available 2012-02-13T12:26:54Z
dc.date.issued 2001-09-28
dc.identifier.isbn 951-22-5623-1
dc.identifier.issn 1456-6303
dc.identifier.uri https://aaltodoc.aalto.fi/handle/123456789/2349
dc.description.abstract This thesis deals with modelling data existing in large speech corpora using an object-oriented paradigm which captures important linguistic structures. Information from corpora is transformed into objects and are assigned properties regarding their behaviour. These objects, called speech units, are placed onto a multi-dimensional framework and have their relationships to other units explicitly defined through the use of links. Frameworks that model temporal utterances or atemporal information like speaker characteristics and recording conditions can be searched efficiently for contextual matches. Speech units that match desired contexts are the result of successful linguistically motivated queries and can be used in further speech processing tasks in the same computational environment. This allows for empirical studies of speech and its relation to linguistic structures to be carried out, and for the training and testing of applications like speech recognition and synthesis. Information residing in typical speech corpora is discussed first, followed by an overview of object-orientation which sets the tone for this thesis. Then the representation framework is introduced which is generated by a compiler and linker that rely on a set of domain-specific resources that transform corpus data into speech units. Operations on this framework are then presented along with a comparison between a relational and object-oriented model of identical speech data. The models described in this work are directly applicable to existing large speech corpora, and the methods developed here are tested against relational database methods. The object-oriented methods outperform the relational methods for typical linguistically relevant queries by about three orders of magnitude as measured by database search times. This improvement in simplicity of representation and search speed is crucial for the utilisation of large multi-lingual corpora in basic research on the detailed properties of speech, especially in relation to contextual variation. en
dc.format.extent 92
dc.format.mimetype application/pdf
dc.language.iso en en
dc.publisher Helsinki University of Technology en
dc.publisher Teknillinen korkeakoulu fi
dc.relation.ispartofseries Report / Helsinki University of Technology Laboratory of Acoustics and Audio Signal Processing en
dc.relation.ispartofseries Raportti / Teknillinen korkeakoulu, akustiikan ja äänenkäsittelytekniikan laboratorio fi
dc.relation.ispartofseries 63 en
dc.subject.other Electrical engineering en
dc.title Object-based modelling for representing and processing speech corpora en
dc.type G4 Monografiaväitöskirja fi
dc.description.version reviewed en
dc.contributor.department Department of Electrical and Communications Engineering en
dc.contributor.department Sähkö- ja tietoliikennetekniikan osasto fi
dc.subject.keyword speech corpora en
dc.subject.keyword speech database en
dc.subject.keyword object-oriented model en
dc.subject.keyword database access en
dc.subject.keyword speech processing en
dc.identifier.urn urn:nbn:fi:tkk-002940
dc.type.dcmitype text en
dc.type.ontasot Väitöskirja (monografia) fi
dc.type.ontasot Doctoral dissertation (monograph) en
dc.contributor.lab Laboratory of Acoustics and Audio Signal Processing en
dc.contributor.lab Akustiikan ja äänenkäsittelytekniikan laboratorio fi


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search archive


Advanced Search

article-iconSubmit a publication

Browse

My Account