Dec 12, 2011
Extraction of new abbreviated words using Crowdsourcing System
IEICE technical report. Speech
- ,
- ,
- Volume
- 111
- Number
- 365
- First page
- 13
- Last page
- 17
- Language
- Japanese
- Publishing type
- Publisher
- The Institute of Electronics, Information and Communication Engineers
New words and abbreviated words are being born every day in CGM (consumer generated media) on the Web, such as Facebook and Twitter. Those words are not in the standard dictionaries and cause many difficulties in morphological analysis. This paper proposes a method to increase vocabularies from Twitter using Crowdsourcing. At the first stage, unknown words are chosen as candidates of new abbreviated words using a standard morphological analysis. At the second stage, Crowdsourcing System is used to determine if a word is an abbreviated word. Couwdsourcing System is used at the third stage to obtain the correct reading and the proper word.
- Link information
-
- CiNii Articles
- http://ci.nii.ac.jp/naid/10031110512
- CiNii Books
- http://ci.nii.ac.jp/ncid/AN10013221
- URL
- http://id.ndl.go.jp/bib/023379295
- ID information
-
- ISSN : 0913-5685
- CiNii Articles ID : 10031110512
- CiNii Books ID : AN10013221