論文

2017年

Speaker Dependent Approach for Enhancing a Glossectomy Patient's Speech via GMM-based Voice Conversion

18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6
  • Kei Tanaka
  • ,
  • Sunao Hara
  • ,
  • Masanobu Abe
  • ,
  • Masaaki Sato
  • ,
  • Shogo Minagi

開始ページ
3384
終了ページ
3388
記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
DOI
10.21437/Interspeech.2017-841
出版者・発行元
ISCA-INT SPEECH COMMUNICATION ASSOC

In this paper, using GMM-based voice conversion algorithm, we propose to generate speaker-dependent mapping functions to improve the intelligibility of speech uttered by patients with a wide glossectomy. The speaker-dependent approach enables to generate the mapping functions that reconstruct missing spectrum features of speech uttered by a patient without having influences of a speaker's factor. The proposed idea is simple, i.e., to collect speech uttered by a patient before and after the glossectomy, but in practice it is hard to ask patients to utter speech just for developing algorithms. To confirm the performance of the proposed approach, in this paper, in order to simulate glossectomy patients, we fabricated an intraoral appliance which covers lower dental arch and tongue surface to restrain tongue movements. In terms of the Mel-frequency cepstrum (MFC) distance, by applying the voice conversion, the distances were reduced by 25% and 42% for speaker dependent case and speaker-independent case, respectively. In terms of phoneme intelligibility, dictation tests revealed that speech reconstructed by speaker-dependent approach almost always showed better performance than the original speech uttered by simulated patients, while speaker-independent approach did not.

リンク情報
DOI
https://doi.org/10.21437/Interspeech.2017-841
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000457505000698&DestApp=WOS_CPL
ID情報
  • DOI : 10.21437/Interspeech.2017-841
  • ISSN : 2308-457X
  • Web of Science ID : WOS:000457505000698

エクスポート
BibTeX RIS