Comparison of tied- mixture and state-clustered HMMs with respect to recognition performance and training method

Journal of Information Technology Research

Hiroyuki Segi
Kazuo Onoe
Shoei Sato
Akio Kobayashi
Akio Ando

巻: 7
号: 3
開始ページ: 15
終了ページ: 31
記述言語: 英語
掲載種別: 研究論文（学術雑誌）
DOI: 10.4018/jitr.2014070102
出版者・発行元: IGI Global

Tied-mixture HMMs have been proposed as the acoustic model for large-vocabulary continuous speech recognition and have yielded promising results. They share base-distribution and provide more flexibility in choosing the degree of tying than state-clustered HMMs. However, it is unclear which acoustic models to superior to the other under the same training data. Moreover, LBG algorithm and EM algorithm, which are the usual training methods for HMMs, have not been compared. Therefore in this paper, the recognition performance of the respective HMMs and the respective training methods are compared under the same condition. It was found that the number of parameters and the word error rate for both HMMs are equivalent when the number of codebooks is sufficiently large. It was also found that training method using the LBG algorithm achieves a 90% reduction in training time compared to training method using the EM algorithm, without degradation of recognition accuracy.

リンク情報

DOI: https://doi.org/10.4018/jitr.2014070102
DBLP: https://dblp.uni-trier.de/rec/journals/jitr/SegiOSKA14
URL: http://dblp.uni-trier.de/db/journals/jitr/jitr7.html#journals/jitr/SegiOSKA14

ID情報

DOI : 10.4018/jitr.2014070102
ISSN : 1938-7865
ISSN : 1938-7857
DBLP ID : journals/jitr/SegiOSKA14
SCOPUS ID : 84919483339

エクスポート: BibTeX RIS

世木寛之

論文

Comparison of tied- mixture and state-clustered HMMs with respect to recognition performance and training method

メニュー