Cepstral distance based channel selection for distant speech recognition. (January 2018)

Record Type:: Journal Article
Title:: Cepstral distance based channel selection for distant speech recognition. (January 2018)
Main Title:: Cepstral distance based channel selection for distant speech recognition
Authors:: Guerrero Flores, Cristina
Tryfou, Georgina
Omologo, Maurizio
Abstract:: Highlights: This work concerns distant speech recognition (DSR) with microphones sparse in space. We introduce a methodology to conduct studies on channel selection (CS) for DSR. A CS method is proposed and relies on cepstral distance as measure of signal quality. Experiments are conducted on both simulated and real multi-microphone data sets. Results demonstrate the effectiveness of the proposed methodology and techniques. Abstract: Shifting from a single to a multi-microphone setting, distant speech recognition can be benefited from the multiple instances of the same utterance in many ways. An effective approach, especially when microphones are not organized in an array fashion, is given by channel selection (CS), which assumes that for each utterance there is at least one channel that can improve the recognition results when compared to the decoding of the remaining channels. In order to identify this most favourable channel, a possible approach is to estimate the degree of distortion that characterizes each microphone signal. In a reverberant environment, this distortion can vary significantly across microphones, for instance due to the orientation of the speaker's head. In this work, we investigate on the application of cepstral distance as a distortion measure that turns out to be closely related to properties of the room acoustics, such as reverberation time and direct-to-reverberant ratio. From this measure, a blind CS method is derived, which relies on a reference … (more)
Is Part Of:: Computer speech & language. Volume 47(2018)
Journal:: Computer speech & language
Issue:: Volume 47(2018)
Issue Display:: Volume 47, Issue 2018 (2018)
Year:: 2018
Volume:: 47
Issue:: 2018
Issue Sort Value:: 2018-0047-2018-0000
Page Start:: 314
Page End:: 332
Publication Date:: 2018-01
Subjects:: Distant speech recognition -- Channel selection -- Cepstral distance -- Reverberation -- Direct to reverberant ratio -- T60
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454
Journal URLs:: http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.csl.2017.08.003 ↗
Languages:: English
ISSNs:: 0885-2308
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 20832.xml