Papers

Peer-reviewed Last author International journal
Sep, 2018

Naturalness Improvement Algorithm for Reconstructed Glossectomy Patient's Speech Using Spectral Differential Modification in Voice Conversion.

Interspeech 2018, 19th Annual Conference of the International Speech Communication Association, Hyderabad, India, 2-6 September 2018.
  • Hiroki Murakami
  • ,
  • Sunao Hara
  • ,
  • Masanobu Abe
  • ,
  • Masaaki Sato
  • ,
  • Shogo Minagi

Volume
2018-September
Number
First page
2464
Last page
2468
Language
English
Publishing type
Research paper (international conference proceedings)
DOI
10.21437/Interspeech.2018-1239
Publisher
ISCA

In this paper, we propose an algorithm to improve the naturalness of the reconstructed glossectomy patient's speech that is generated by voice conversion to enhance the intelligibility of speech uttered by patients with a wide glossectomy. While existing VC algorithms make it possible to improve intelligibility and naturalness, the result is still not satisfying. To solve the continuing problems, we propose to directly modify the speech waveforms using a spectrum differential. The motivation is that glossectomy patients mainly have problems in their vocal tract, not in their vocal cords. The proposed algorithm requires no source parameter extractions for speech synthesis, so there are no errors in source parameter extractions and we are able to make the best use of the original source characteristics. In terms of spectrum conversion, we evaluate with both GMM and DNN. Subjective evaluations show that our algorithm can synthesize more natural speech than the vocoder-based method. Judging from observations of the spectrogram, power in high-frequency bands of fricatives and stops is reconstructed to be similar to that of natural speech.

Link information
DOI
https://doi.org/10.21437/Interspeech.2018-1239
DBLP
https://dblp.uni-trier.de/rec/conf/interspeech/MurakamiHASM18
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000465363900518&DestApp=WOS_CPL
URL
http://dblp.uni-trier.de/db/conf/interspeech/interspeech2018.html#conf/interspeech/MurakamiHASM18
Scopus
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85054996045&origin=inward
Scopus Citedby
https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=85054996045&origin=inward
ID information
  • DOI : 10.21437/Interspeech.2018-1239
  • ISSN : 2308-457X
  • eISSN : 1990-9772
  • DBLP ID : conf/interspeech/MurakamiHASM18
  • SCOPUS ID : 85054996045
  • Web of Science ID : WOS:000465363900518

Export
BibTeX RIS