Library

Top
Details (Local collection)

Exploiting Web Data for NLP Research: from Multilingual Text to Social Media
TOC

荒瀬由紀

生駒 : 奈良先端科学技術大学院大学, 2013.01

Lecture Archive

Volume No.

Total: 1

No.	Printing year	Location	Call Number	Material ID	Circulation class	Status	Waiting
1		Digital Library	LA-I-R[MPDASH][Mobile]	M010314

Back to volume top

Contents Intro.: The Web is a gold-mine of data in diverse categories and characteristics, which contains ones that had been hardly available in past, such as a large amount of text in different languages and data from social media. Such data contributes to accelerate progress of research, and at the same time, brings new challenges. In this talk, I introduce our recent research efforts exploiting such novel data on the Web. First, we exploit Twitter data for classifying spiking queries into their topical categories. Spiking queries show sudden spikes in search engines, which represents users' hot attention to them. Therefore, accurate classification of spiking query is important for search engines. Next, I introduce our effort to extract Japanese-English parallel sentence pairs from the Web. We took 3 approaches to mine such data and carefully developed data cleaning framework to extract only high-quality portion. Then I briefly introduce our approach on Japanese-English statistical machine translation.

Details

Publication year: 2013

Form: 電子化映像資料(1時間30分40秒)

Series title: 情報科学研究科・ゼミナール講演 ; 平成24年度

Note

講演者所属: Microsoft Research Asia

講演日: 平成25年1月29日

講演場所: 情報科学研究科大講義室L1

Country of publication: Japan

Title language: English (eng)

Language of texts: English (eng)

Author information: 荒瀬, 由紀 (アラセ, ユキ)

Find Materials

Series title

情報科学研究科・ゼミナール講演 ; 平成24年度

Author information

荒瀬, 由紀 (アラセ, ユキ)

Back Next

Exploiting Web Data for NLP Research: from Multilingual Text to Social Media
TOC

Send by email

Address

Subject

Volume No.

Send by email

Address

Subject

Details

Find Materials

Series title

Author information

Edit bookmark

Select list

Memo

Select list

Add to bookmark

Select list

Memo

Exploiting Web Data for NLP Research: from Multilingual Text to Social Media TOC

Send by email

Address

Subject

Volume No.

Send by email

Address

Subject

Details

Find Materials

Series title

Author information

Edit bookmark

Select list

Memo

Select list

Add to bookmark

Select list

Memo

Exploiting Web Data for NLP Research: from Multilingual Text to Social Media
TOC