論文

査読有り
2013年

Controlling linguistic information and filtered sound identity for a new cross-synthesis vocoder

Acoustical Science and Technology
  • Taiki Nishi
  • ,
  • Ryuichi Nisimura
  • ,
  • Toshio Irino
  • ,
  • Hideki Kawahara

34
4
開始ページ
287
終了ページ
288
記述言語
英語
掲載種別
研究論文(学術雑誌)
DOI
10.1250/ast.34.287

A study was conducted to propose a new cross-synthesis framework based on an interference-free representation of a power spectrum combined with normalization and modulation transfer function design for spectral envelope preprocessing of speech sounds. The proposed cross-synthesis enabled control of the linguistic information and the timbre identity. The spectral envelope of speech was extracted in the proposed method using a F0-adaptive procedure called TANDEM-STRAIGHT. It was demonstrated that the procedure effectively removed interference caused by periodic excitation from the spectrogram of the speech and yielded a smooth representation. A two-staged procedure was also introduced to remove the timbre-modifying components from the speech spectral envelope. The primary procedure involved the approximation of the global spectral shape and the secondary one was the filtering of temporal modulations.

リンク情報
DOI
https://doi.org/10.1250/ast.34.287
ID情報
  • DOI : 10.1250/ast.34.287
  • ISSN : 1346-3969
  • ISSN : 1347-5177
  • SCOPUS ID : 84880653803

エクスポート
BibTeX RIS