MISC

査読有り
2010年

IMPROVED STATISTICAL MODELS FOR SMT-BASED SPEAKING STYLE TRANSFORMATION

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING
  • Graham Neubig
  • ,
  • Yuya Akita
  • ,
  • Shinsuke Mori
  • ,
  • Tatsuya Kawahara

開始ページ
5206
終了ページ
5209
記述言語
英語
掲載種別
DOI
10.1109/ICASSP.2010.5494997
出版者・発行元
IEEE

Automatic speech recognition (ASR) results contain not only ASR errors, but also disfluencies and colloquial expressions that must be corrected to create readable transcripts. We take the approach of statistical machine translation (SMT) to "translate" from ASR results into transcript-style text. We introduce two novel modeling techniques in this framework: a context-dependent translation model, which allows for usage of context to accurately model translation probabilities, and log-linear interpolation of conditional and joint probabilities, which allows for frequently observed translation patterns to be given higher priority. The system is implemented using weighted finite state transducers (WFST). On an evaluation using ASR results and manual transcripts of meetings of the Japanese Diet (national congress), the proposed methods showed a significant increase in accuracy over traditional modeling techniques.

リンク情報
DOI
https://doi.org/10.1109/ICASSP.2010.5494997
J-GLOBAL
https://jglobal.jst.go.jp/detail?JGLOBAL_ID=201002255197707616
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000287096005030&DestApp=WOS_CPL
ID情報
  • DOI : 10.1109/ICASSP.2010.5494997
  • ISSN : 1520-6149
  • J-Global ID : 201002255197707616
  • Web of Science ID : WOS:000287096005030

エクスポート
BibTeX RIS