Analytic Filter Bank for Speech Analysis, Feature Extraction and Perceptual Studies
Loading...
Access rights
openAccess
© 2017 ISCA. This article was originally published in the Proceedings of Interspeech 2017: Laine, U.K. (2017) Analytic Filter Bank for Speech Analysis, Feature Extraction and Perceptual Studies. Proc. Interspeech 2017, 449-453, DOI: 10.21437/Interspeech.2017-1232.
publishedVersion
URL
Journal Title
Journal ISSN
Volume Title
A4 Artikkeli konferenssijulkaisussa
This publication is imported from Aalto University research portal.
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
View publication in the Research portal (opens in new window)
View/Open full text file from the Research portal (opens in new window)
Authors
Date
2017-08
Major/Subject
Mcode
Degree programme
Language
en
Pages
5
Series
Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Volume 2017-August, pp. 449-453, Interspeech: Annual Conference of the International Speech Communication Association
Abstract
Speech signal consists of events in time and frequency, and therefore its analysis with high-resolution time-frequency tools is often of importance. Analytic filter bank provides a simple, fast, and flexible method to construct time-frequency representations of signals. Its parameters can be easily adapted to different situations from uniform to any auditory frequency scale, or even to a focused resolution. Since the Hilbert magnitude values of the channels are obtained at every sample, it provides a practical tool for a high-resolution time-frequency analysis. The present study describes the basic theory of analytic filters and tests their main properties. Applications of analytic filter bank to different speech analysis tasks including pitch period estimation and pitch synchronous analysis of formant frequencies and bandwidths are demonstrated. In addition, a new feature vector called group delay vector is introduced. It is shown that this representation provides comparable, or even better results, than those obtained by spectral magnitude feature vectors in the analysis and classification of vowels. The implications of this observation are discussed also from the speech perception point of view.Description
Keywords
speech analysis, pitch-synchronous analysis, time-frequency methods
Other note
Citation
Laine, U 2017, Analytic Filter Bank for Speech Analysis, Feature Extraction and Perceptual Studies . in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH . vol. 2017-August, Interspeech: Annual Conference of the International Speech Communication Association, International Speech Communication Association (ISCA), pp. 449-453, Interspeech, Stockholm, Sweden, 20/08/2017 . https://doi.org/10.21437/Interspeech.2017-1232