2010年
IMPROVED STATISTICAL MODELS FOR SMT-BASED SPEAKING STYLE TRANSFORMATION
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING
- ,
- ,
- ,
- 開始ページ
- 5206
- 終了ページ
- 5209
- 記述言語
- 英語
- 掲載種別
- DOI
- 10.1109/ICASSP.2010.5494997
- 出版者・発行元
- IEEE
Automatic speech recognition (ASR) results contain not only ASR errors, but also disfluencies and colloquial expressions that must be corrected to create readable transcripts. We take the approach of statistical machine translation (SMT) to "translate" from ASR results into transcript-style text. We introduce two novel modeling techniques in this framework: a context-dependent translation model, which allows for usage of context to accurately model translation probabilities, and log-linear interpolation of conditional and joint probabilities, which allows for frequently observed translation patterns to be given higher priority. The system is implemented using weighted finite state transducers (WFST). On an evaluation using ASR results and manual transcripts of meetings of the Japanese Diet (national congress), the proposed methods showed a significant increase in accuracy over traditional modeling techniques.
- リンク情報
- ID情報
-
- DOI : 10.1109/ICASSP.2010.5494997
- ISSN : 1520-6149
- J-Global ID : 201002255197707616
- Web of Science ID : WOS:000287096005030