2016年
Robust front-end for speech recognition by human and machine in noisy reverberant environments: the effect of phase information
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP)
- ,
- ,
- ,
- 記述言語
- 英語
- 掲載種別
- 記事・総説・解説・論説等(国際会議プロシーディングズ)
- 出版者・発行元
- IEEE
This paper proposes a robust front-end for speech applications based on restoration scheme of instantaneous amplitude and phase. Typical applications such as hearing aids and automatic speech recognition systems still have challenging issues with regard to robustness against noise and reverberation. The proposed front-end employed a combination of our previously proposed method for restoring instantaneous amplitude and phase on a Gammatone filterbank and cepstral mean normalization (CMN). The first method can remove late reverberated and additive noise components from the observed speech, while the second method can remove the early reflection. In this paper, we comparatively evaluated the proposed method with other typical methods as robust front-end for speech recognition by human and machine in noisy reverberant environments. Modified Rhyme tests and word recognition tests were carried out as speech recognition by human and machine. The results of both evaluations revealed that the proposed front-end could effectively improve correctness of speech intelligibility and word recognition rate in noisy reverberant environments. In addition, effect of phase information was found to greatly improve the quality and intelligibility of speech.
- リンク情報
- ID情報
-
- Web of Science ID : WOS:000405610900026