Learning Centre

Frequency Domain Methods for Coding the Linear Predictive Residual of Speech Signals

 |  Login

Show simple item record

dc.contributor Aalto-yliopisto fi
dc.contributor Aalto University en
dc.contributor.advisor Markovic, Goran
dc.contributor.author Perez Zarazaga, Pablo
dc.date.accessioned 2017-09-04T10:34:04Z
dc.date.available 2017-09-04T10:34:04Z
dc.date.issued 2017-08-28
dc.identifier.uri https://aaltodoc.aalto.fi/handle/123456789/27924
dc.description.abstract The most frequently used speech coding paradigm is ACELP, famous because it encodes speech with high quality, while consuming a small bandwidth. ACELP performs linear prediction filtering in order to eliminate the effect of the spectral envelope from the signal. The noise-like excitation is then encoded using algebraic codebooks. The search of this codebook, however, can not be performed optimally with conventional encoders due to the correlation between their samples. Because of this, more complex algorithms are required in order to maintain the quality. Four different transformation algorithms have been implemented (DCT, DFT, Eigenvalue decomposition and Vandermonde decomposition) in order to decorrelate the samples of the innovative excitation in ACELP. These transformations have been integrated in the ACELP of the EVS codec. The transformed innovative excitation is coded using the envelope based arithmetic coder. Objective and subjective tests have been carried out to evaluate the quality of the encoding, the degree of decorrelation achieved by the transformations and the computational complexity of the algorithms. en
dc.format.extent (6) + 72
dc.format.mimetype application/pdf en
dc.language.iso en en
dc.title Frequency Domain Methods for Coding the Linear Predictive Residual of Speech Signals en
dc.type G2 Pro gradu, diplomityö fi
dc.contributor.school Sähkötekniikan korkeakoulu fi
dc.subject.keyword speech coding en
dc.subject.keyword transform coding en
dc.subject.keyword vandermonde decomposition en
dc.subject.keyword EVS en
dc.subject.keyword ACELP en
dc.identifier.urn URN:NBN:fi:aalto-201709046823
dc.programme.major Acoustics and Audio Technology fi
dc.programme.mcode ELEC3030 fi
dc.type.ontasot Master's thesis en
dc.type.ontasot Diplomityö fi
dc.contributor.supervisor Bäckström, Tom
dc.programme CCIS - Master’s Programme in Computer, Communication and Information Sciences (TS2013) fi
dc.ethesisid Aalto 9553
dc.location P1 fi


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search archive


Advanced Search

article-iconSubmit a publication

Browse

Statistics