Super-Wideband Spectral Envelope Modeling for Speech Coding

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorFuchs, Guillaumeen_US
dc.contributor.authorAshour, Chamranen_US
dc.contributor.authorBäckström, Tomen_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.groupauthorSpeech Communication Technologyen
dc.contributor.groupauthorSpeech Interaction Technologyen
dc.contributor.organizationFraunhofer Institute for Integrated Circuitsen_US
dc.date.accessioned2019-10-01T12:08:19Z
dc.date.available2019-10-01T12:08:19Z
dc.date.issued2019-09en_US
dc.description.abstractSignificant improvements in the quality of speech coders have been achieved by widening the coded frequency range from narrowband to wideband. However, existing speech coders still employ a limited band source-filter model extended by parametric coding of the higher band. In the present work, a superwideband source-filter model running at 32 kHz is considered and especially its spectral magnitude envelope modeling. To match super-wideband operating mode, we adapted and compared two methods; Linear Predictive Coding (LPC) and Distribution Quantization (DQ). LPC uses autoregressive modeling, while DQ quantifies the energy ratios between different parts of the spectrum. Parameters of both methods were quantized with a multi-stage vector quantization. Objective and subjective evaluations indicate that both methods used in a super-wideband source-filter coding scheme offer the same quality range, making them an attractive alternative to conventional speech coders that require additional bandwidth extension.en
dc.description.versionPeer revieweden
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationFuchs, G, Ashour, C & Bäckström, T 2019, Super-Wideband Spectral Envelope Modeling for Speech Coding. in Proceedings of Interspeech. Interspeech - Annual Conference of the International Speech Communication Association, International Speech Communication Association (ISCA), pp. 3411-3415, Interspeech, Graz, Austria, 15/09/2019. https://doi.org/10.21437/Interspeech.2019-1620en
dc.identifier.doi10.21437/Interspeech.2019-1620en_US
dc.identifier.issn2308-457X
dc.identifier.otherPURE UUID: e8fd46cf-eef4-4f01-b4f4-bbfac9da5d4aen_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/e8fd46cf-eef4-4f01-b4f4-bbfac9da5d4aen_US
dc.identifier.otherPURE LINK: https://www.isca-speech.org/archive/Interspeech_2019/pdfs/1620.pdfen_US
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/37129476/ELEC_Fuchs_Super_wideband_Interspeech2019.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/40549
dc.identifier.urnURN:NBN:fi:aalto-201910015569
dc.language.isoenen
dc.relation.ispartofInterspeechen
dc.relation.ispartofseriesProceedings of Interspeechen
dc.relation.ispartofseriespp. 3411-3415en
dc.relation.ispartofseriesInterspeech - Annual Conference of the International Speech Communication Associationen
dc.rightsopenAccessen
dc.subject.keywordLPCen_US
dc.subject.keywordSpectral envelope modelingen_US
dc.subject.keywordSpeech Codingen_US
dc.titleSuper-Wideband Spectral Envelope Modeling for Speech Codingen
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionpublishedVersion

Files