Cancellation of Local Competing Speaker with Near-field Localization for Distributed Ad-Hoc Sensor Network

dc.contributorAalto-yliopistofi
dc.contributorAalto Universityen
dc.contributor.authorPerez Zarazaga, Pabloen_US
dc.contributor.authorBouafif, Mariemen_US
dc.contributor.authorBäckström, Tomen_US
dc.contributor.authorLachiri, Zieden_US
dc.contributor.departmentDepartment of Signal Processing and Acousticsen
dc.contributor.groupauthorSpeech Communication Technologyen
dc.contributor.groupauthorSpeech Interaction Technologyen
dc.contributor.organizationUniversité de Tunis El Manaren_US
dc.date.accessioned2021-09-29T09:59:07Z
dc.date.available2021-09-29T09:59:07Z
dc.date.issued2021-08-31en_US
dc.description.abstractIn scenarios such as remote work, open offices and call centers, multiple people may simultaneously have independent spoken interactions with their devices in the same room. The speech of competing speakers will however be picked up by all microphones, both reducing the quality of audio and exposing speakers to breaches in privacy. We propose a cooperative cross-talk cancellation solution breaking the single active speaker assumption employed by most telecommunication systems. The proposed method applies source separation on the microphone signals of independent devices, to extract the dominant speaker in each device. It is realized using a localization estimator based on a deep neural network, followed by a time-frequency mask to separate the target speech from the interfering one at each time-frequency unit referring to its orientation. By experimental evaluation, we confirm that the proposed method effectively reduces crosstalk and exceeds the baseline expectation maximization method by 10 dB in terms of interference rejection. This performance makes the proposed method a viable solution for cross-talk cancellation in near-field conditions, thus protecting the privacy of external speakers in the same acoustic space.en
dc.description.versionPeer revieweden
dc.format.extent5
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationPerez Zarazaga, P, Bouafif, M, Bäckström, T & Lachiri, Z 2021, Cancellation of Local Competing Speaker with Near-field Localization for Distributed Ad-Hoc Sensor Network. in 22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021. Annual Conference of the International Speech Communication Association, International Speech Communication Association (ISCA), pp. 1176-1180, Interspeech, Brno, Czech Republic, 30/08/2021. https://doi.org/10.21437/Interspeech.2021-1329en
dc.identifier.doi10.21437/Interspeech.2021-1329en_US
dc.identifier.isbn9781713836902
dc.identifier.issn1990-9772
dc.identifier.issn2308-457X
dc.identifier.otherPURE UUID: 80db0346-29fd-46bb-9613-12e77ff6d929en_US
dc.identifier.otherPURE ITEMURL: https://research.aalto.fi/en/publications/80db0346-29fd-46bb-9613-12e77ff6d929en_US
dc.identifier.otherPURE LINK: http://www.scopus.com/inward/record.url?scp=85119173934&partnerID=8YFLogxK
dc.identifier.otherPURE FILEURL: https://research.aalto.fi/files/67780606/Cancellation_of_Local_Competing_Speaker_with_Near_Field_Localization_for_Distributed_ad_hoc_Sensor_Network.pdfen_US
dc.identifier.urihttps://aaltodoc.aalto.fi/handle/123456789/110172
dc.identifier.urnURN:NBN:fi:aalto-202109299372
dc.language.isoenen
dc.relation.ispartofInterspeechen
dc.relation.ispartofseries22nd Annual Conference of the International Speech Communication Association, INTERSPEECH 2021en
dc.relation.ispartofseriespp. 1176-1180en
dc.relation.ispartofseriesAnnual Conference of the International Speech Communication Associationen
dc.rightsopenAccessen
dc.titleCancellation of Local Competing Speaker with Near-field Localization for Distributed Ad-Hoc Sensor Networken
dc.typeA4 Artikkeli konferenssijulkaisussafi
dc.type.versionpublishedVersion

Files