[期刊论文][research article]


Speech Envelope Dynamics for Noise-Robust Auditory Scene Analysis in Robotics

作   者:
Francesco Rea;Austin Kothig;Lukas Grasse;Matthew Tata;

出版年:2020

页     码:664 - 676
出版社:World Scientific Publishing Company


摘   要:

Humans make extensive use of auditory cues to interact with other humans, especially in challenging real-world acoustic environments. Multiple distinct acoustic events usually mix together in a complex auditory scene. The ability to separate and localize mixed sound in complex auditory scenes remains a demanding skill for binaural robots. In fact, binaural robots are required to disambiguate and interpret the environmental scene with only two sensors. At the same time, robots that interact with humans should be able to gain insights about the speakers in the environment, such as how many speakers are present and where they are located. For this reason, the speech signal is distinctly important among auditory stimuli commonly found in human-centered acoustic environments. In this paper, we propose a Bayesian method of selectively processing acoustic data that exploits the characteristic amplitude envelope dynamics of human speech to infer the location of speakers in the complex auditory scene. The goal was to demonstrate the effectiveness of this speech-specific temporal dynamics approach. Further, we measure how effective this method is in comparison with more traditional methods based on amplitude detection only.



关键字:

Auditory robotics;speech;sound localizationauditory scene analysis


所属期刊
International Journal of Humanoid Robotics
ISSN: 0219-8436
来自:World Scientific Publishing Company