2013年
Controlling linguistic information and filtered sound identity for a new cross-synthesis vocoder
Acoustical Science and Technology
- ,
- ,
- ,
- 巻
- 34
- 号
- 4
- 開始ページ
- 287
- 終了ページ
- 288
- 記述言語
- 英語
- 掲載種別
- 研究論文(学術雑誌)
- DOI
- 10.1250/ast.34.287
A study was conducted to propose a new cross-synthesis framework based on an interference-free representation of a power spectrum combined with normalization and modulation transfer function design for spectral envelope preprocessing of speech sounds. The proposed cross-synthesis enabled control of the linguistic information and the timbre identity. The spectral envelope of speech was extracted in the proposed method using a F0-adaptive procedure called TANDEM-STRAIGHT. It was demonstrated that the procedure effectively removed interference caused by periodic excitation from the spectrogram of the speech and yielded a smooth representation. A two-staged procedure was also introduced to remove the timbre-modifying components from the speech spectral envelope. The primary procedure involved the approximation of the global spectral shape and the secondary one was the filtering of temporal modulations.
- リンク情報
- ID情報
-
- DOI : 10.1250/ast.34.287
- ISSN : 1346-3969
- ISSN : 1347-5177
- SCOPUS ID : 84880653803