2016年
SMT-Based Lexicon Expansion for Broadcast Transcription
2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA)
- ,
- ,
- ,
- ,
- ,
- 記述言語
- 英語
- 掲載種別
- 研究論文(国際会議プロシーディングス)
- DOI
- 10.1109/APSIPA.2016.7820682
- 出版者・発行元
- IEEE
We describe a method of lexicon expansion to tackle variations of spontaneous speech. The variations of utterances are found widely in the programs such as conversations talk shows and are typically observed as unintelligible utterances with a high speech-rate. Unlike read speech in news programs, these variations often severely degrade automatic speech recognition (ASR) performance. Then, these variations are considered as new versions of original entries in the ASR lexicon. The new entries are generated based on the SMT approach, in which translation models are trained from corpus translating phoneme sequence in a lexicon into the sequence obtained by phoneme recognition. We introduce a new method in which unreliable entries are removed from the lexicon. Our SMT-based approach achieved a 0.1 % WER reduction for a variety of broadcasting programs.
- リンク情報
- ID情報
-
- DOI : 10.1109/APSIPA.2016.7820682
- Web of Science ID : WOS:000393591800010