Super-Wideband Spectral Envelope Modeling for Speech Coding
dc.contributor | Aalto-yliopisto | fi |
dc.contributor | Aalto University | en |
dc.contributor.author | Fuchs, Guillaume | en_US |
dc.contributor.author | Ashour, Chamran | en_US |
dc.contributor.author | Bäckström, Tom | en_US |
dc.contributor.department | Department of Signal Processing and Acoustics | en |
dc.contributor.groupauthor | Speech Communication Technology | en |
dc.contributor.groupauthor | Speech Interaction Technology | en |
dc.contributor.organization | Fraunhofer Institute for Integrated Circuits | en_US |
dc.date.accessioned | 2019-10-01T12:08:19Z | |
dc.date.available | 2019-10-01T12:08:19Z | |
dc.date.issued | 2019-09 | en_US |
dc.description.abstract | Significant improvements in the quality of speech coders have been achieved by widening the coded frequency range from narrowband to wideband. However, existing speech coders still employ a limited band source-filter model extended by parametric coding of the higher band. In the present work, a superwideband source-filter model running at 32 kHz is considered and especially its spectral magnitude envelope modeling. To match super-wideband operating mode, we adapted and compared two methods; Linear Predictive Coding (LPC) and Distribution Quantization (DQ). LPC uses autoregressive modeling, while DQ quantifies the energy ratios between different parts of the spectrum. Parameters of both methods were quantized with a multi-stage vector quantization. Objective and subjective evaluations indicate that both methods used in a super-wideband source-filter coding scheme offer the same quality range, making them an attractive alternative to conventional speech coders that require additional bandwidth extension. | en |
dc.description.version | Peer reviewed | en |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.citation | Fuchs, G, Ashour, C & Bäckström, T 2019, Super-Wideband Spectral Envelope Modeling for Speech Coding. in Proceedings of Interspeech. Interspeech - Annual Conference of the International Speech Communication Association, International Speech Communication Association (ISCA), pp. 3411-3415, Interspeech, Graz, Austria, 15/09/2019. https://doi.org/10.21437/Interspeech.2019-1620 | en |
dc.identifier.doi | 10.21437/Interspeech.2019-1620 | en_US |
dc.identifier.issn | 2308-457X | |
dc.identifier.other | PURE UUID: e8fd46cf-eef4-4f01-b4f4-bbfac9da5d4a | en_US |
dc.identifier.other | PURE ITEMURL: https://research.aalto.fi/en/publications/e8fd46cf-eef4-4f01-b4f4-bbfac9da5d4a | en_US |
dc.identifier.other | PURE LINK: https://www.isca-speech.org/archive/Interspeech_2019/pdfs/1620.pdf | en_US |
dc.identifier.other | PURE FILEURL: https://research.aalto.fi/files/37129476/ELEC_Fuchs_Super_wideband_Interspeech2019.pdf | en_US |
dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/40549 | |
dc.identifier.urn | URN:NBN:fi:aalto-201910015569 | |
dc.language.iso | en | en |
dc.relation.ispartof | Interspeech | en |
dc.relation.ispartofseries | Proceedings of Interspeech | en |
dc.relation.ispartofseries | pp. 3411-3415 | en |
dc.relation.ispartofseries | Interspeech - Annual Conference of the International Speech Communication Association | en |
dc.rights | openAccess | en |
dc.subject.keyword | LPC | en_US |
dc.subject.keyword | Spectral envelope modeling | en_US |
dc.subject.keyword | Speech Coding | en_US |
dc.title | Super-Wideband Spectral Envelope Modeling for Speech Coding | en |
dc.type | A4 Artikkeli konferenssijulkaisussa | fi |
dc.type.version | publishedVersion |