Frequency domain methods for coding the linear predictive residual of speech signals
| dc.contributor | Aalto-yliopisto | fi |
| dc.contributor | Aalto University | en |
| dc.contributor.advisor | Markovic, Goran | |
| dc.contributor.author | Perez Zarazaga, Pablo | |
| dc.contributor.school | Sähkötekniikan korkeakoulu | fi |
| dc.contributor.supervisor | Bäckström, Tom | |
| dc.date.accessioned | 2017-09-04T10:34:04Z | |
| dc.date.available | 2017-09-04T10:34:04Z | |
| dc.date.issued | 2017-08-28 | |
| dc.description.abstract | The most frequently used speech coding paradigm is ACELP, famous because it encodes speech with high quality, while consuming a small bandwidth. ACELP performs linear prediction filtering in order to eliminate the effect of the spectral envelope from the signal. The noise-like excitation is then encoded using algebraic codebooks. The search of this codebook, however, can not be performed optimally with conventional encoders due to the correlation between their samples. Because of this, more complex algorithms are required in order to maintain the quality. Four different transformation algorithms have been implemented (DCT, DFT, Eigenvalue decomposition and Vandermonde decomposition) in order to decorrelate the samples of the innovative excitation in ACELP. These transformations have been integrated in the ACELP of the EVS codec. The transformed innovative excitation is coded using the envelope based arithmetic coder. Objective and subjective tests have been carried out to evaluate the quality of the encoding, the degree of decorrelation achieved by the transformations and the computational complexity of the algorithms. | en |
| dc.ethesisid | Aalto 9553 | |
| dc.format.extent | (6) + 72 | |
| dc.format.mimetype | application/pdf | en |
| dc.identifier.uri | https://aaltodoc.aalto.fi/handle/123456789/27924 | |
| dc.identifier.urn | URN:NBN:fi:aalto-201709046823 | |
| dc.language.iso | en | en |
| dc.location | P1 | fi |
| dc.programme | CCIS - Master's Programme in Computer, Communication and Information Sciences (TS2013) | fi |
| dc.programme.major | Acoustics and Audio Technology | fi |
| dc.programme.mcode | ELEC3030 | fi |
| dc.subject.keyword | speech coding | en |
| dc.subject.keyword | transform coding | en |
| dc.subject.keyword | vandermonde decomposition | en |
| dc.subject.keyword | EVS | en |
| dc.subject.keyword | ACELP | en |
| dc.title | Frequency domain methods for coding the linear predictive residual of speech signals | en |
| dc.type | G2 Pro gradu, diplomityö | fi |
| dc.type.ontasot | Master's thesis | en |
| dc.type.ontasot | Diplomityö | fi |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- master_Perez_Zarazaga_Pablo_2017.pdf
- Size:
- 1.43 MB
- Format:
- Adobe Portable Document Format