MISC

2016年1月1日

Statistical analysis of automatic seed word acquisition to improve harmful expression extraction in cyberbullying detection

International Journal of Engineering and Technology Innovation
  • Suzuha Hatakeyama
  • ,
  • Fumito Masui
  • ,
  • Michal Ptaszynski
  • ,
  • Kazuhide Yamamoto

6
2
開始ページ
165
終了ページ
172

© TAETI. We study the social problem of cyberbullying, defined as a new form of bullying that takes place in the Internet space. This paper proposes a method for automatic acquisition of seed words to improve performance of the original method for the cyberbullying detection by Nitta et al. [1]. We conduct an experiment exactly in the same settings to find out that the method based on a Web mining technique, lost over 30% points of its performance since being proposed in 2013. Thus, we hypothesize on the reasons for the decrease in the performance and propose a number of improvements, from which we experimentally choose the best one. Furthermore, we collect several seed word sets using different approaches, evaluate and their precision. We found out that the influential factor in extraction of harmful expressions is not the number of seed words, but the way the seed words were collected and filtered.

リンク情報
URL
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=84986232791&origin=inward
Scopus Citedby
https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=84986232791&origin=inward
ID情報
  • ISSN : 2223-5329
  • eISSN : 2226-809X
  • SCOPUS ID : 84986232791

エクスポート
BibTeX RIS