論文

査読有り
2016年

SMT-Based Lexicon Expansion for Broadcast Transcription

2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA)
  • Manon Ichiki
  • ,
  • Aiko Hagiwara
  • ,
  • Hitoshi Ito
  • ,
  • Kazuo Onoe
  • ,
  • Shoei Sato
  • ,
  • Akio Kobayashi

記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
DOI
10.1109/APSIPA.2016.7820682
出版者・発行元
IEEE

We describe a method of lexicon expansion to tackle variations of spontaneous speech. The variations of utterances are found widely in the programs such as conversations talk shows and are typically observed as unintelligible utterances with a high speech-rate. Unlike read speech in news programs, these variations often severely degrade automatic speech recognition (ASR) performance. Then, these variations are considered as new versions of original entries in the ASR lexicon. The new entries are generated based on the SMT approach, in which translation models are trained from corpus translating phoneme sequence in a lexicon into the sequence obtained by phoneme recognition. We introduce a new method in which unreliable entries are removed from the lexicon. Our SMT-based approach achieved a 0.1 % WER reduction for a variety of broadcasting programs.

リンク情報
DOI
https://doi.org/10.1109/APSIPA.2016.7820682
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000393591800010&DestApp=WOS_CPL
ID情報
  • DOI : 10.1109/APSIPA.2016.7820682
  • Web of Science ID : WOS:000393591800010

エクスポート
BibTeX RIS