TOC
フクスウニン カイワ シーン ブンセキ ニ オケル マイクロホン アレイ オンセイ ショリ
荒木章子
生駒 : 奈良先端科学技術大学院大学, 2012.7
Lecture ArchiveNo. | Printing year | Location | Call Number | Material ID | Circulation class | Status | Waiting |
---|---|---|---|---|---|---|---|
1 |
|
|
M009763 |
|
|
|
Recognition of conversation scenes has recently been tackled to achieve a variety of tasks such as automatic annotation, minute taking, and meeting assistance. Since participants speak spontaneously in a conversation, a recorded conversation includes many speaker overlaps and ambient noise. To handle such complicated recordings, speech signal processing techniques play an important role. In this lecture, I will focus on some multi-channel speech enhancement and "who spoke when" estimation (speaker diarization) techniques for conversation scene analysis. Prototype meeting recognition and meeting assistance systems are also introduced.
2012
電子化映像資料(1時間29分53秒)
Speech processing techniques for conversation scene analysis
情報科学研究科・ゼミナール講演 ; 平成24年度
講演者所属: 日本電信電話株式会社 NTTコミュニケーション科学基礎研究所
講演日: 平成24年7月9日
講演場所: 情報科学研究科大講義室L1
Japan
Japanese (jpn)
Japanese (jpn)
荒木, 章子 (アラキ, ショウコ)