NAISTAR
Advanced Search
Japanese | English

naistar (NAIST Academic Repository) >
学術リポジトリ naistar / NAIST Academic Repository naistar >
国際会議発表論文 / Proceedings >
情報科学研究科 / Graduate School of Information Science >

Please use this identifier to cite or link to this item: http://hdl.handle.net/10061/8269

Title: An Excitation Model for HMM-Based Speech Synthesis Based on Residual Modeling
Authors: Ranniery Maia
Tomoki Toda
Heiga Zen
Yoshihiko Nankaku
Keiichi Tokuda
Issue Date: Aug-2007
Start page: 131
End page: 136
Abstract: This paper describes a trainable excitation approach to eliminate the unnaturalness of HMM-based speech synthesizers. During the waveform generation part, mixed excitation is constructed by state-dependent filtering of pulse trains and white noise sequences. In the training part, filters and pulse trains are jointly optimized through a procedure which resembles analysis-bysynthesis speech coding algorithms, where likelihood maximization of residual signals (derived from the same database which is used to train the HMM-based synthesizer) is pursued. Preliminary results show that the novel excitation model in question eliminates the unnaturalness of synthesized speech, being comparable in quality to the the best approaches thus far reported to eradicate the buzziness of HMM-based synthesizers.
Description: SSW6: 6th ISCA Speech Synthesis Workshop, August 22-24, 2007, Bonn, Germany.
URI: http://hdl.handle.net/10061/8269
Text Version: Publisher
Appears in Collections:情報科学研究科 / Graduate School of Information Science

Files in This Item:

File Description SizeFormat
SSW_2007_131.pdf399.89 kBAdobe PDFView/Open
presentation.pdf1.17 MBAdobe PDFView/Open
sound_examples_natural.wav134.12 kBWAVView/Open
sound_examples_proposed1.wav130.51 kBWAVView/Open
sound_examples_proposed2.wav130.51 kBWAVView/Open
sound_examples_simple1.wav130.51 kBWAVView/Open
sound_examples_simple2.wav130.51 kBWAVView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

 

Copyright (c) 2007-2012 Nara Institute of Science and Technology All Rights Reserved.
DSpace Software Copyright © 2002-2010  Duraspace - Feedback