Presentations

Oct 22, 2020

Construction of EARS, a Speech Corpus for the Elderly, and Preliminary Study of Its Application to Speech Recognition

IPSJ SIG Technical Report
  • Fukuda Meiko
  • ,
  • Yurie Iribe
  • ,
  • Hiromitsu Nishizaki
  • ,
  • Kazumasa Yamamoto
  • ,
  • Nishimura Ryota
  • ,
  • Kitaoka Norihide

Language
Japanese
Presentation type

Since the speech of the elderly has several features different from those of the general population, the recognition accuracy of the elderly is currently inadequate. In order to improve the accuracy, a large amount of speech data of the elderly is necessary. The S-JNAS, which has an average speaker age of 67.6 years, is widely used as a large-scale corpus of elderly speech. The S-JNAS, which has an average age of 67.6 years, has been widely used as a large-scale corpus of speech for the elderly. However, since there is a large age difference between the average life expectancy of the elderly in Japan and that of the Japanese population, we have started to construct a corpus of speech for the very elderly (EARS: Elderly Adults Read Speech). The design of the corpus is based on S-JNAS, and to date, we have collected and compiled a database of the speech of 121 people (average age: 83.4 years). In this paper, we describe the specifications of the corpus, and also report a preliminary study of the acoustic model of elderly speech using this corpus.

Link information
URL
https://web.db.tokushima-u.ac.jp/cgi-bin/edb_browse?EID=373078