A voice conversion based on phoneme segment mapping

Journal of the Acoustical Society of Japan (E)

Masanobu Abe
Shigeki Sagayama

Volume: 13
Number: 3
First page: 131
Last page: 139
Language: English
Publishing type: Research paper (scientific journal)
DOI: 10.1250/ast.13.131

Voice conversion is a technique to change speaker individuality
i.e., speech uttered by a speaker is changed to sound as if another speaker had uttered it. In this paper, we propose a voice conversion algorithm that uses speech segments as conversion units. Input speech is decomposed into speech segments by a speech recognition module and the segments are replaced by speech segments uttered by another speaker. This algorithm makes it possible to convert not only the static characteristics but also speaker individuality contained in phoneme segments. The proposed voice conversion algorithm was performed between two male speakers. Spectrum distortion between target speech and the converted speech was reduced to one-third the natural spectrum difference between the two speakers. A litening experiment showed that, in terms of speaker identification accuracy, the speech converted by segment-sized units gave a score 20% higher than the speech converted frame-by-frame. © 1992, Acoustical Society of Japan. All rights reserved.

Link information

DOI: https://doi.org/10.1250/ast.13.131

ID information

DOI : 10.1250/ast.13.131
ISSN : 0388-2861
SCOPUS ID : 85007732138

Export: BibTeX RIS

Masanobu Abe

Papers

A voice conversion based on phoneme segment mapping

Menu

Coauthors