|
naistar (NAIST Academic Repository) >
学術リポジトリ naistar / NAIST Academic Repository naistar >
国際会議発表論文 / Proceedings >
情報科学研究科 / Graduate School of Information Science >
Please use this identifier to cite or link to this item:
http://hdl.handle.net/10061/8169
|
| Title: | Comparison of Methods for Topic Classification in a Speech-Oriented Guidance System |
| Authors: | Rafael Torres Shota Takeuchi Hiromichi Kawanami Tomoko Matsui Hiroshi Saruwatari Kiyohiro Shikano |
| Keywords: | Speech Oriented Guidance System Topic Classification Support Vector Machine PrefixSpan Boosting Maximum Entropy |
| Issue Date: | Sep-2010 |
| Start page: | 1261 |
| End page: | 1264 |
| Abstract: | This work addresses the classification in topics of utterances in Japanese, received by a speech-oriented guidance system operating in a real environment. For this, we compare the performance of Support Vector Machine and PrefixSpan Boosting, against a conventional Maximum Entropy classification method. We are interested in evaluating their strength against automatic speech recognition (ASR) errors and the sparseness of the features present in spontaneous speech. To deal with the shortness of the utterances, we also proposed to use characters as features instead of words, which is possible with the Japanese language due to the presence of kanji; ideograms from Chinese characters that represent not only sound but meaning. Experimental results show a classification performance improvement from 92.2% to 94.4%, with Support Vector Machine using character unigrams and bigrams as features, in comparison to the conventional method. |
| Description: | INTERSPEECH2010: 11th Annual Conference of the International Speech Communication Association, September 26-30, 2010, Chiba, Japan. |
| URI: | http://hdl.handle.net/10061/8169 |
| Rights: | Copyright 2010 ISCA |
| Text Version: | Publisher |
| Appears in Collections: | 情報科学研究科 / Graduate School of Information Science
|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
|