Advanced Search
Japanese | English

naistar (NAIST Academic Repository) >
学術リポジトリ naistar / NAIST Academic Repository naistar >
国際会議発表論文 / Proceedings >
情報科学研究科 / Graduate School of Information Science >

Please use this identifier to cite or link to this item:

Title: Communicative Speech Synthesis with XIMERA: A First Step
Authors: Sinsuke Sakai
Jinfu Ni
Ranniery Maia
Keiichi Tokuda
Minoru Tsuzaki
Tomoki Toda
Hisashi Kawai
Satoshi Nakamura
Issue Date: Aug-2007
Start page: 28
End page: 33
Abstract: This paper presents a corpus-based approach to communicative speech synthesis. We chose "good news" style and "bad news" style for our initial attempt to synthesize speech that has appropriate expressiveness desired in human-human or human-machine dialog. We utilized 10-hour "neutral" style speech corpus as well as smaller corpora with good news and bad news styles, each consisting of two to three hours of speech from the same speaker. We trained target HMM models with each style and synthesized speech with unit databases containing speech with the relevant style as well as neutral speech. From the listening tests, we found out that intended communicative styles were comprehended by listeners and that considerably high mean opinion score on naturalness was achieved with rather small, style-specific corpora.
Description: SSW6: 6th ISCA Speech Synthesis Workshop, August 22-24, 2007, Bonn, Germany.
Text Version: Publisher
Appears in Collections:情報科学研究科 / Graduate School of Information Science

Files in This Item:

File Description SizeFormat
SSW_2007_28.pdf353.37 kBAdobe PDFView/Open
presentation.pdf670.72 kBAdobe PDFView/Open

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.


Copyright (c) 2007-2012 Nara Institute of Science and Technology All Rights Reserved.
DSpace Software Copyright © 2002-2010  Duraspace - Feedback