Misc.

Lead author
Dec 12, 2011

Extraction of new abbreviated words using Crowdsourcing System

IEICE technical report. Speech
  • SAKAI Toshihiko
  • ,
  • ASHIKAWA Masayuki
  • ,
  • HIROKAWA Sachio

Volume
111
Number
365
First page
13
Last page
17
Language
Japanese
Publishing type
Publisher
The Institute of Electronics, Information and Communication Engineers

New words and abbreviated words are being born every day in CGM (consumer generated media) on the Web, such as Facebook and Twitter. Those words are not in the standard dictionaries and cause many difficulties in morphological analysis. This paper proposes a method to increase vocabularies from Twitter using Crowdsourcing. At the first stage, unknown words are chosen as candidates of new abbreviated words using a standard morphological analysis. At the second stage, Crowdsourcing System is used to determine if a word is an abbreviated word. Couwdsourcing System is used at the third stage to obtain the correct reading and the proper word.

Link information
CiNii Articles
http://ci.nii.ac.jp/naid/10031110512
CiNii Books
http://ci.nii.ac.jp/ncid/AN10013221
URL
http://id.ndl.go.jp/bib/023379295
ID information
  • ISSN : 0913-5685
  • CiNii Articles ID : 10031110512
  • CiNii Books ID : AN10013221

Export
BibTeX RIS