MISC

査読有り
2016年

Robust front-end for speech recognition by human and machine in noisy reverberant environments: the effect of phase information

2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP)
  • Yang Liu
  • ,
  • Naushin Nower
  • ,
  • Shota Morita
  • ,
  • Masashi Unoki

記述言語
英語
掲載種別
記事・総説・解説・論説等(国際会議プロシーディングズ)
出版者・発行元
IEEE

This paper proposes a robust front-end for speech applications based on restoration scheme of instantaneous amplitude and phase. Typical applications such as hearing aids and automatic speech recognition systems still have challenging issues with regard to robustness against noise and reverberation. The proposed front-end employed a combination of our previously proposed method for restoring instantaneous amplitude and phase on a Gammatone filterbank and cepstral mean normalization (CMN). The first method can remove late reverberated and additive noise components from the observed speech, while the second method can remove the early reflection. In this paper, we comparatively evaluated the proposed method with other typical methods as robust front-end for speech recognition by human and machine in noisy reverberant environments. Modified Rhyme tests and word recognition tests were carried out as speech recognition by human and machine. The results of both evaluations revealed that the proposed front-end could effectively improve correctness of speech intelligibility and word recognition rate in noisy reverberant environments. In addition, effect of phase information was found to greatly improve the quality and intelligibility of speech.

リンク情報
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000405610900026&DestApp=WOS_CPL
ID情報
  • Web of Science ID : WOS:000405610900026

エクスポート
BibTeX RIS