Improving Speech Recognition for the Elderly: A New Corpus of Elderly Japanese Speech and Investigation of Acoustic Modeling for Speech Recognition

Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)

福田芽衣子
西崎博光
Yurie Iribe
西村良太
北岡教英

開催年月日: 2020年5月13日 - 2020年5月15日
記述言語: 英語
会議種別: 口頭発表（一般）

In an aging society like Japan, a highly accurate speech recognition system is needed for use in electronic devices for the elderly, but this level of accuracy cannot be obtained using conventional speech recognition systems due to the unique features of the speech of elderly people. S-JNAS, a corpus of elderly Japanese speech, is widely used for acoustic modeling in Japan, but the average age of its speakers is 67.6 years old. Since average life expectancy in Japan is now 84.2 years, we are constructing a new speech corpus, which currently consists of the utterances of 221 speakers with an average age of 79.2, collected from four regions of Japan. In addition, we expand on our previous study (Fukuda, 2019) by further investigating the construction of acoustic models suitable for elderly speech. We create new acoustic models and train them using a combination of existing Japanese speech corpora (JNAS, S-JNAS, CSJ), with and without our super-elderly speech data, and conduct speech recognition experiments. Our new acoustic models achieve word error rates (WER) as low as 13.38%, exceeding the results of our previous study in which we used the CSJ acoustic model adapted for elderly speech (17.4% WER).

リンク情報

URL: https://web.db.tokushima-u.ac.jp/cgi-bin/edb_browse?EID=373073
URL: https://www.scopus.com/record/display.url?eid=2-s2.0-85096619800&origin=inward

西村良太

講演・口頭発表等

Improving Speech Recognition for the Elderly: A New Corpus of Elderly Japanese Speech and Investigation of Acoustic Modeling for Speech Recognition

メニュー

共著者の一覧

フォロー一覧