Optimizing spherical loudspeaker array for voice directivity using the spherical cap model

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.advisorMeyer-Kahlen, Nils
dc.contributor.authorPalakkal Raghunath, Midhun
dc.contributor.schoolSähkötekniikan korkeakoulufi
dc.contributor.schoolSchool of Electrical Engineeringen
dc.contributor.supervisorArend, Johannes M.
dc.date.accessioned2026-01-19T18:06:40Z
dc.date.available2026-01-19T18:06:40Z
dc.date.issued2025-12-31
dc.description.abstractConventional mouth simulators are limited by fixed radiation patterns, low power and high self-noise, failing to capture the complex spatial characteristics of real speech, such as the downward tilt of the main radiation lobe. To overcome these limitations, a simulation-driven approach is developed using the analytical spherical cap model, which was implemented and validated against known reference from the work of Aarts and Jansen [ 1 ]. This thesis investigates the optimization of a spherical loudspeaker array to accurately reproduce the dynamic and articulation-dependent directivity patterns of the human voice. A grid-search optimization method is applied to evaluate physical parameters, including array radius , driver radius , for different layouts (1, 4, 9, and 16 drivers), using a magnitude-weighted energy error metric. The single-driver configuration served as baseline proving the existence of optimal points, whereas they were found to be acoustically insufficient for replicating the asymmetric and time-varying nature of human speech. In contrast, by utilizing a frequency-dependent regularized least-squares control strategy (Tikhonov regularization) the multi-driver arrays successfully reproduce the higher-order spatial modes required to match measured phoneme patterns. The results indicate that a 16 driver configuration provides a critical threshold of perceptual authenticity, effectively reproducing the downward-tilted radiation lobes of vowels and the unique spatial signatures of fricatives and nasals. This work establishes a robust framework for developing physical prototypes.en
dc.format.extent44
dc.format.mimetypeapplication/pdfen
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/142082
dc.identifier.urnURN:NBN:fi:aalto-202601191458
dc.language.isoenen
dc.locationP1fi
dc.programmeMaster's Programme in Computer, Communication and Information Sciencesen
dc.programmeTieto-, tietoliikenne- ja informaatiotekniikan maisteriohjelmafi
dc.programmeMagisterprogrammet i data-, informations- och kommunikationstekniksv
dc.programme.majorAcoustics and Audio Technologyen
dc.subject.keywordspherical loudspeaker arrayen
dc.subject.keywordvoice directivityen
dc.subject.keywordspherical cap modelen
dc.subject.keywordregularizationen
dc.subject.keywordphoneme-dependent radiationen
dc.subject.keywordambisonicsen
dc.titleOptimizing spherical loudspeaker array for voice directivity using the spherical cap modelen
dc.typeG2 Pro gradu, diplomityöfi
dc.type.ontasotMaster's thesisen
dc.type.ontasotDiplomityöfi
local.aalto.electroniconlyyes
local.aalto.openaccessyes

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
master_Palakkal_Raghunath_Midhun_2026.pdf
Size:
5.85 MB
Format:
Adobe Portable Document Format