Frequency domain methods for coding the linear predictive residual of speech signals

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorMarkovic, Goran
dc.contributor.authorPerez Zarazaga, Pablo
dc.contributor.schoolSähkötekniikan korkeakoulufi
dc.contributor.supervisorBäckström, Tom
dc.date.accessioned2017-09-04T10:34:04Z
dc.date.available2017-09-04T10:34:04Z
dc.date.issued2017-08-28
dc.description.abstractThe most frequently used speech coding paradigm is ACELP, famous because it encodes speech with high quality, while consuming a small bandwidth. ACELP performs linear prediction filtering in order to eliminate the effect of the spectral envelope from the signal. The noise-like excitation is then encoded using algebraic codebooks. The search of this codebook, however, can not be performed optimally with conventional encoders due to the correlation between their samples. Because of this, more complex algorithms are required in order to maintain the quality. Four different transformation algorithms have been implemented (DCT, DFT, Eigenvalue decomposition and Vandermonde decomposition) in order to decorrelate the samples of the innovative excitation in ACELP. These transformations have been integrated in the ACELP of the EVS codec. The transformed innovative excitation is coded using the envelope based arithmetic coder. Objective and subjective tests have been carried out to evaluate the quality of the encoding, the degree of decorrelation achieved by the transformations and the computational complexity of the algorithms.en
dc.ethesisidAalto 9553
dc.format.extent(6) + 72
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/27924
dc.identifier.urnURN:NBN:fi:aalto-201709046823
dc.language.isoenen
dc.locationP1fi
dc.programmeCCIS - Master's Programme in Computer, Communication and Information Sciences (TS2013)fi
dc.programme.majorAcoustics and Audio Technologyfi
dc.programme.mcodeELEC3030fi
dc.subject.keywordspeech codingen
dc.subject.keywordtransform codingen
dc.subject.keywordvandermonde decompositionen
dc.subject.keywordEVSen
dc.subject.keywordACELPen
dc.titleFrequency domain methods for coding the linear predictive residual of speech signalsen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
master_Perez_Zarazaga_Pablo_2017.pdf
Size:
1.43 MB
Format:
Adobe Portable Document Format