論文

査読有り
2014年7月

Adding Twitter-Specific Features to Stylistic Features for Classifying Tweets by User Type and Number of Retweets

JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY
  • Yui Arakawa
  • ,
  • Akihiro Kameda
  • ,
  • Akiko Aizawa
  • ,
  • Takafumi Suzuki

65
7
開始ページ
1416
終了ページ
1423
記述言語
英語
掲載種別
研究論文(学術雑誌)
DOI
10.1002/asi.23126
出版者・発行元
WILEY-BLACKWELL

Recently, Twitter has received much attention, both from the general public and researchers, as a new method of transmitting information. Among others, the number of retweets (RTs) and user types are the two important items of analysis for understanding the transmission of information on Twitter. To analyze this point, we applied text classification and feature extraction experiments using random forests machine learning with conventional stylistic and Twitter-specific features. We first collected tweets from 40 accounts with a high number of followers and created tweet texts from 28,756 tweets. We then conducted 15 types of classification experiments using a variety of combinations of features such as function words, speech terms, Twitter's descriptive grammar, and information roles. We deliberately observed the effects of features for classification performance. The results indicated that class classification per user indicated the best performance. Furthermore, we observed that certain features had a greater impact on classification. In the case of the experiments that assessed the level of RT quantity, information roles had an impact. In the case of user experiments, important features, such as the honorific postpositional particle and auxiliary verbs, such as "desu" and "masu," had an impact. This research clarifies the features that are useful for categorizing tweets according to the number of RTs and user types.

リンク情報
DOI
https://doi.org/10.1002/asi.23126
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000337030700008&DestApp=WOS_CPL
ID情報
  • DOI : 10.1002/asi.23126
  • ISSN : 2330-1635
  • eISSN : 2330-1643
  • Web of Science ID : WOS:000337030700008

エクスポート
BibTeX RIS