Donate Speech: Collecting and Sharing a Large-Scale Speech Database for Social Sciences, Humanities and Artificial Intelligence Research and Innovation

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorLindén, Krister
dc.contributor.authorJauhiainen, Tommi
dc.contributor.authorLennes, Mietta
dc.contributor.authorKurimo, Mikko
dc.contributor.authorRossi, Aleksi
dc.contributor.authorKurki, Tommi
dc.contributor.authorPitkänen, Olli
dc.contributor.departmentUniversity of Helsinki
dc.contributor.departmentSpeech Recognition
dc.contributor.departmentYleisradio
dc.contributor.departmentUniversity of Turku
dc.contributor.department1001 Lakes Oy
dc.date.accessioned2023-02-01T09:11:11Z
dc.date.available2023-02-01T09:11:11Z
dc.date.issued2022-10
dc.description.abstractThe Donate Speech campaign aimed to collect 10 000 hours of ordinary, casual Finnish speech to be used for studying language as well as for developing technology and services that can be readily used in the languages spoken in Finland. In this project, particular attention has been paid to allowing for both academic and commercial use of the material. Even though the ambitious target currently seems to evade us, the Donate Speech campaign has managed to collect an extensive resource of more than 3500 h of Finnish colloquial speech with more than 200 000 speech recordings by roughly 50 000 speakers from all over Finland in just a few months.en
dc.description.versionPeer revieweden
dc.format.extent30
dc.format.mimetypeapplication/pdf
dc.identifier.citationLindén , K , Jauhiainen , T , Lennes , M , Kurimo , M , Rossi , A , Kurki , T & Pitkänen , O 2022 , Donate Speech: Collecting and Sharing a Large-Scale Speech Database for Social Sciences, Humanities and Artificial Intelligence Research and Innovation . in CLARIN : the infrastructure for language resources . Digital Linguistics , vol. 1 , De Gruyter . https://doi.org/10.1515/9783110767377-019en
dc.identifier.doi10.1515/9783110767377-019
dc.identifier.isbn978-3-11-076734-6
dc.identifier.isbn978-3-11-076737-7
dc.identifier.otherPURE UUID: 642eaa89-1930-445e-9b87-ff09575f1360
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/642eaa89-1930-445e-9b87-ff09575f1360
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/99001714/10.1515_9783110767377_019.pdf
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/119541
dc.identifier.urnURN:NBN:fi:aalto-202302011891
dc.language.isoenen
dc.relation.ispartofseriesCLARIN : the infrastructure for language resourcesen
dc.rightsopenAccessen
dc.subject.keywordspeech resources
dc.subject.keywordcolloquial speech
dc.subject.keywordlarge-scale data collection
dc.subject.keywordacademic and commercial use
dc.titleDonate Speech: Collecting and Sharing a Large-Scale Speech Database for Social Sciences, Humanities and Artificial Intelligence Research and Innovationen
dc.typeA3 Kirjan tai muun kokoomateoksen osafi
dc.type.versionpublishedVersion
Files