図書館

トップ画面
詳細(学内所蔵)

Exploiting Web Data for NLP Research: from Multilingual Text to Social Media
目次あり

荒瀬由紀

生駒 : 奈良先端科学技術大学院大学, 2013.01

授業アーカイブ

巻号情報

全1件

No.	刷年	所在	請求記号	資料ID	貸出区分	状況	予約人数
1		電子化資料	LA-I-R[MPDASH][Mobile]	M010314

巻号情報へ戻る

内容紹介: The Web is a gold-mine of data in diverse categories and characteristics, which contains ones that had been hardly available in past, such as a large amount of text in different languages and data from social media. Such data contributes to accelerate progress of research, and at the same time, brings new challenges. In this talk, I introduce our recent research efforts exploiting such novel data on the Web. First, we exploit Twitter data for classifying spiking queries into their topical categories. Spiking queries show sudden spikes in search engines, which represents users' hot attention to them. Therefore, accurate classification of spiking query is important for search engines. Next, I introduce our effort to extract Japanese-English parallel sentence pairs from the Web. We took 3 approaches to mine such data and carefully developed data cleaning framework to extract only high-quality portion. Then I briefly introduce our approach on Japanese-English statistical machine translation.

詳細情報

刊年: 2013

形態: 電子化映像資料(1時間30分40秒)

シリーズ名: 情報科学研究科・ゼミナール講演 ; 平成24年度

注記

講演者所属: Microsoft Research Asia

講演日: 平成25年1月29日

講演場所: 情報科学研究科大講義室L1

標題言語: 英語 (eng)

本文言語: 英語 (eng)

著者情報: 荒瀬, 由紀 (アラセ, ユキ)

Exploiting Web Data for NLP Research: from Multilingual Text to Social Media
目次あり

メールで送信

宛先

件名

巻号情報

メールで送信

宛先

件名

詳細情報

関連資料を探す

シリーズ名

著者情報

ブックマークを編集

リストを選択

メモ

リストを選択

ブックマークに登録

リストを選択

メモ

Exploiting Web Data for NLP Research: from Multilingual Text to Social Media 目次あり

メールで送信

宛先

件名

巻号情報

メールで送信

宛先

件名

詳細情報

関連資料を探す

シリーズ名

著者情報

ブックマークを編集

リストを選択

メモ

リストを選択

ブックマークに登録

リストを選択

メモ

Exploiting Web Data for NLP Research: from Multilingual Text to Social Media
目次あり