DSpace Repository

A Large-Vocabulary Continuous Speech Recognition Algorithm and its Application to a Multi-modal Telephone Directory Assistance System

Show simple item record

dc.contributor.author Yasuhiro Minami en
dc.contributor.author Kiyohiro Shikano en
dc.contributor.author Osamu Yoshioka en
dc.contributor.author Satoshi Takahashi en
dc.contributor.author Tomokazu Yamada en
dc.contributor.author Sadaoki Furui en
dc.date.accessioned 2012-08-22T07:58:44Z en
dc.date.available 2012-08-22T07:58:44Z en
dc.date.issued 1997-03 en
dc.identifier.isbn 1558603573 en
dc.identifier.uri http://hdl.handle.net/10061/7988 en
dc.description HLT1994: Workshop on Human Language Technology , March 8-11, 1994, Plainsboro, New Jerey, USA. en
dc.description.abstract This paper describes an accurate and efficient algorithm for very-large-vocabulary continuous speech recognition based on an HMM-LR algorithm. The HMM-LR algorithm uses a generalized LR parser as a language model and hidden Markov models (HMMs) as phoneme models. To reduce the search space without pruning the correct candidate, we use forward and backward trellis likelihoods, an adjusting window for choosing only the probable part of the trellis for each predicted phoneme, and an algorithm for merging candidates that have the same allophonic phoneme sequences and the same context-free grammar states. Candidates are also merged at the meaning level. This algorithm is applied to a telephone directory assistance system that recognizes spontaneous speech containing the names and addresses of more than 70,000 subscribers (vocabulary size is about 80,000). The experimental results show that the system performs well in spite of the large perplexity. This algorithm was also applied to a multi-modal telephone directory assistance system, and the system was evaluated from the human-interface point of view. To cope with the problem of background noise, an HMM composition technique which combines a noise-source HMM and a clean phoneme HMM into a noise-added phoneme HMM was investigated and incorporated into the system. en
dc.language.iso en en
dc.rights Copyright 1994 Association for Computational Linguistics en
dc.title A Large-Vocabulary Continuous Speech Recognition Algorithm and its Application to a Multi-modal Telephone Directory Assistance System en
dc.type.nii Conference Paper en
dc.textversion Publisher en
dc.identifier.spage 387 en
dc.identifier.epage 392 en
dc.relation.doi 10.3115/1075812.1075902 en


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account