|
naistar (NAIST Academic Repository) >
学術リポジトリ naistar / NAIST Academic Repository naistar >
国際会議発表論文 / Proceedings >
情報科学研究科 / Graduate School of Information Science >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10061/7923
|
| Title: | ATRECSS: ATR English Speech Corpus for Speech Synthesis |
| Authors: | Jinfu Ni Toshio Hirai Hisashi Kawai Tomoki Toda Keiichi Tokuda Minoru Tsuzaki Sinsuke Sakai Ranniery Maia Satoshi Nakamura |
| Issue Date: | Aug-2007 |
| Start page: | 1 |
| End page: | 4 |
| Article Number: | 002 |
| Abstract: | This paper introduces a large-scale phonetically-balanced English speech corpus developed at ATR for corpus-based speech synthesis. This corpus includes a 16-hour American English speech data spoken by a professional male narrator in “reading style.” The contents of prompt sentences concern basically news articles, travel conversations, and novels. The prompt sentences were selected from huge collections of texts using a greedy algorithm to maximize the coverage of linguistic units, such as diphones and triphones. A few measures were taken to control undesirable recording variations in voice quality in the short term (daily) and long term (monthly) while recording the prompt sentences. Statistical figures of the corpus developed as well as those of subsets provided for Blizzard Challenge 2006 and 2007 are presented. |
| Description: | Blizzard Challenge 2007 Workshop, August 25, 2007, Bonn, Germany. |
| URI: | http://hdl.handle.net/10061/7923 |
| Text Version: | Publisher |
| Appears in Collections: | 情報科学研究科 / Graduate School of Information Science
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|