Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition

Loading...
Thumbnail Image
File version

Version of Record (VoR)

Author(s)
Himawan, Ivan
Motlicek, Petr
Sridharan, Sridha
Dean, David
Tjondronegoro, Dian
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
2015
Size
File type(s)
Location

Dresden, Germany

License
Abstract

Automatic speech recognition from multiple distant microphones poses significant challenges because of noise and reverberations. The quality of speech acquisition may vary between microphones because of movements of speakers and channel distortions. This paper proposes a channel selection approach for selecting reliable channels based on selection criterion operating in the short-term modulation spectrum domain. The proposed approach quantifies the relative strength of speech from each microphone and speech obtained from beamforming modulations. The new technique is compared experimentally in the real reverb conditions in terms of perceptual evaluation of speech quality (PESQ) measures and word error rate (WER). Overall improvement in recognition rate is observed using delay-sum and superdirective beamformers compared to the case when the channel is selected randomly using circular microphone arrays.

Journal Title
Conference Title

16th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2015)

Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
DOI
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© 2015 ISCA and the Author(s). The attached file is reproduced here in accordance with the copyright policy of the publisher. For information about this conference please refer to the conference’s website or contact the author(s).

Item Access Status
Note
Access the data
Related item(s)
Subject

Acoustics and noise control (excl. architectural acoustics)

Science & Technology

Acoustics

Computer Science, Interdisciplinary Applications

Persistent link to this record
Citation

Himawan, I; Motlicek, P; Sridharan, S; Dean, D; Tjondronegoro, D, Channel Selection in the Short-time Modulation Domain for Distant Speech Recognition, 16th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2015), pp. 741-745