MISC

2008年12月

BioCaster: detecting public health rumors with a Web-based text mining system

BIOINFORMATICS
  • Nigel Collier
  • Son Doan
  • Ai Kawazoe
  • Reiko Matsuda Goodwin
  • Mike Conway
  • Yoshio Tateno
  • Quoc-Hung Ngo
  • Dinh Dien
  • Asanee Kawtrakul
  • Koichi Takeuchi
  • Mika Shigematsu
  • Kiyosu Taniguchi
  • 全て表示

24
24
開始ページ
2940
終了ページ
2941
記述言語
英語
掲載種別
DOI
10.1093/bioinformatics/btn534
出版者・発行元
OXFORD UNIV PRESS

BioCaster is an ontology-based text mining system for detecting and tracking the distribution of infectious disease outbreaks from linguistic signals on the Web. The system continuously analyzes documents reported from over 1700 RSS feeds, classifies them for topical relevance and plots them onto a Google map using geocoded information. The background knowledge for bridging the gap between Layman's terms and formal-coding systems is contained in the freely available BioCaster ontology which includes information in eight languages focused on the epidemiological role of pathogens as well as geographical locations with their latitudes/longitudes. The system consists of four main stages: topic classification, named entity recognition (NER), disease/location detection and event recognition. Higher order event analysis is used to detect more precisely specified warning signals that can then be notified to registered users via email alerts. Evaluation of the system for topic recognition and entity identification is conducted on a gold standard corpus of annotated news articles.

リンク情報
DOI
https://doi.org/10.1093/bioinformatics/btn534
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000261456700027&DestApp=WOS_CPL
ID情報
  • DOI : 10.1093/bioinformatics/btn534
  • ISSN : 1367-4803
  • eISSN : 1460-2059
  • identifiers.cinii_nr_id : 9000239248799
  • Web of Science ID : WOS:000261456700027

エクスポート
BibTeX RIS