SMT-Based Lexicon Expansion for Broadcast Transcription

2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA)

Manon Ichiki
Aiko Hagiwara
Hitoshi Ito
Kazuo Onoe
Shoei Sato
Akio Kobayashi

記述言語: 英語
掲載種別: 研究論文（国際会議プロシーディングス）
DOI: 10.1109/APSIPA.2016.7820682
出版者・発行元: IEEE

We describe a method of lexicon expansion to tackle variations of spontaneous speech. The variations of utterances are found widely in the programs such as conversations talk shows and are typically observed as unintelligible utterances with a high speech-rate. Unlike read speech in news programs, these variations often severely degrade automatic speech recognition (ASR) performance. Then, these variations are considered as new versions of original entries in the ASR lexicon. The new entries are generated based on the SMT approach, in which translation models are trained from corpus translating phoneme sequence in a lexicon into the sequence obtained by phoneme recognition. We introduce a new method in which unreliable entries are removed from the lexicon. Our SMT-based approach achieved a 0.1 % WER reduction for a variety of broadcasting programs.

リンク情報

DOI: https://doi.org/10.1109/APSIPA.2016.7820682
Web of Science: https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000393591800010&DestApp=WOS_CPL

ID情報

DOI : 10.1109/APSIPA.2016.7820682
Web of Science ID : WOS:000393591800010

エクスポート: BibTeX RIS

小林彰夫

論文

SMT-Based Lexicon Expansion for Broadcast Transcription

メニュー

共著者の一覧