Machine-learning-based estimation of room acoustic parameters

 |  Login

Show simple item record

dc.contributor Aalto-yliopisto fi
dc.contributor Aalto University en
dc.contributor.advisor McCormack, Leo
dc.contributor.author Falcon Perez, Ricardo
dc.date.accessioned 2018-12-14T16:12:38Z
dc.date.available 2018-12-14T16:12:38Z
dc.date.issued 2018-12-10
dc.identifier.uri https://aaltodoc.aalto.fi/handle/123456789/35553
dc.description.abstract Traditional methods to study sound propagation inside rooms can be divided in two approaches: geometrical models and wave-based models. In the former, sound is analyzed as rays, giving a valid approximation for high frequencies while failing to model certain wave effects such as diffraction or inference. The latter, finds solutions for the wave equation, providing better accuracy at the cost of much higher computational complexity. This thesis presents a proof of concept for a novel machine learning method to estimate a set of typical room acoustics parameters using only geometrical information as input features. First, a room acoustics dataset composed of real world acoustical measurements is analyzed and processed using microphone array encoding techniques to extract room impulse responses and acoustical absorption area for multiple directions. The dataset is explored to identify correlation between features and general properties, including a low dimensionality representation for visualization. The proposed method uses geometrical features as input for a neural network model that estimates room acoustics parameters, such as reverberation time (T60), and early decay time (EDT). For reverberation time, this model is evaluated against the Sabine method and the results show much higher accuracy, especially at low frequencies. The method is then expanded to include input features for the locations of the source and microphone, where the results also achieve high performance. Furthermore, an hyperparameter optimization procedure using random search reveals three main findings. First, that a large range of neural networks architectures, even with very few trainable parameters, achieve high performance. Second, the depth of the models has little influence on the results. Third, the benefit of increasing the amount of training data examples for a single loudspeaker saturates after around 100 examples. en
dc.format.extent 72+6
dc.format.mimetype application/pdf en
dc.language.iso en en
dc.title Machine-learning-based estimation of room acoustic parameters en
dc.type G2 Pro gradu, diplomityö fi
dc.contributor.school Sähkötekniikan korkeakoulu fi
dc.subject.keyword room acoustics en
dc.subject.keyword room impulse response en
dc.subject.keyword machine learning en
dc.subject.keyword neural networks en
dc.subject.keyword microhpone array en
dc.subject.keyword data analysis en
dc.identifier.urn URN:NBN:fi:aalto-201812146569
dc.programme.major Acoustics and Audio Technology fi
dc.programme.mcode ELEC3030 fi
dc.type.ontasot Master's thesis en
dc.type.ontasot Diplomityö fi
dc.contributor.supervisor Pulkki, Ville
dc.programme CCIS - Master’s Programme in Computer, Communication and Information Sciences (TS2013) fi
dc.location P1 fi
local.aalto.electroniconly yes
local.aalto.openaccess yes


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search archive


Advanced Search

article-iconSubmit a publication

Browse