IMPROVED STATISTICAL MODELS FOR SMT-BASED SPEAKING STYLE TRANSFORMATION

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING

Graham Neubig
Yuya Akita
Shinsuke Mori
Tatsuya Kawahara

開始ページ: 5206
終了ページ: 5209
記述言語: 英語
掲載種別
DOI: 10.1109/ICASSP.2010.5494997
出版者・発行元: IEEE

Automatic speech recognition (ASR) results contain not only ASR errors, but also disfluencies and colloquial expressions that must be corrected to create readable transcripts. We take the approach of statistical machine translation (SMT) to "translate" from ASR results into transcript-style text. We introduce two novel modeling techniques in this framework: a context-dependent translation model, which allows for usage of context to accurately model translation probabilities, and log-linear interpolation of conditional and joint probabilities, which allows for frequently observed translation patterns to be given higher priority. The system is implemented using weighted finite state transducers (WFST). On an evaluation using ASR results and manual transcripts of meetings of the Japanese Diet (national congress), the proposed methods showed a significant increase in accuracy over traditional modeling techniques.

リンク情報

DOI: https://doi.org/10.1109/ICASSP.2010.5494997
J-GLOBAL: https://jglobal.jst.go.jp/detail?JGLOBAL_ID=201002255197707616
Web of Science: https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000287096005030&DestApp=WOS_CPL

ID情報

DOI : 10.1109/ICASSP.2010.5494997
ISSN : 1520-6149
J-Global ID : 201002255197707616
Web of Science ID : WOS:000287096005030

エクスポート: BibTeX RIS

森信介

MISC

IMPROVED STATISTICAL MODELS FOR SMT-BASED SPEAKING STYLE TRANSFORMATION

メニュー

共著者の一覧