|
naistar (NAIST Academic Repository) >
学術リポジトリ naistar / NAIST Academic Repository naistar >
国際会議発表論文 / Proceedings >
情報科学研究科 / Graduate School of Information Science >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10061/7941
|
| Title: | Common Platform of Japanese Large Vocabulary Continuous Speech Recognizer Assessment -- Proposal and Initial Results -- |
| Authors: | Tatsuya Kawahara Akinobu Lee Tetsunori Kobayashi Kazuya Takeda Nobuaki Minematsu Katsunobu Itou Akinori Ito Mikio Yamamoto;Atsushi Yamada Takehito Utsuro Kiyohiro Shikano |
| Issue Date: | May-1998 |
| Start page: | 117 |
| End page: | 122 |
| Abstract: | In this paper we present the first public Japanese speech corpus for large vocabulary continuous speech recognition (LVCSR) technology, which we have titled JNAS (Japanese Newspaper Article Sentences). We designed it to be comparable to the corpora used in the American and European LVCSR projects. The corpus contains speech recordings (60 hrs.) and their orthographic transcriptions for 306 speakers (153 males and 153 females) reading excerpts from the newspaper's articles and phonetically balanced (PB) sentences. This corpus contains utterances of about 45,000 sentences as a whole with each speaker reading about 150 sentences. JNAS is being distributed on 16 CD-ROMs. |
| Description: | EALREW98: First International Workshop on East-Asian Language Resource and Evaluation(Oriental COCOSDA Workshop), May, 1998. |
| URI: | http://hdl.handle.net/10061/7941 |
| Text Version: | Publisher |
| Appears in Collections: | 情報科学研究科 / Graduate School of Information Science
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|