# Doctoral Degrees:1996, Ph.D. (UMIST, UK)
# Research Fields:Natural Language Processing, Machine Learning, Information Extraction
Web surveillance for public health risk management:
Suppose a serious public health crisis occurs in Japan, or somewhere else in the world. What's the most effective way to minimize the risk (risk management) of problems caused by the crisis? The first course of action is to identify the crisis and take all possible response measures. To do this successfully, it's important to obtain accurate information on the crisis as quickly as possible.
Developing a system to prevent the spread of communicable diseases:
If an infectious disease outbreak occurs somewhere in the world, one of the most important things to do at the initial stage of risk management is finding a way to allow national governments and medical specialists to get accurate and correct information as quickly as possible. In the past, information was acquired by manual efforts including monitoring local news broadcasts in the region where the disease broke out. But due to several factors, including language differences and the massive volume of news, this isn't an especially efficient way to collect information. A project called BioCaster was initiated in 2006 to reduce this problem.
BioCaster aims to automatically scan news in the regions or countries where the outbreak of a disease is reported and produces a brief summary of the incident in various languages. This allows public health workers around the world to find out about the outbreaks of dangerous diseases in real-time. To start, the system is scheduled to provide information in four languages, English, Japanese, Vietnamese, and Thai.
Creating a medical information infrastructure based on BioCaster:
Several current systems scan public health information on the Web, but they all have drawbacks-limited language compatibility or lack of ease of use. The basic technologies for solving these problems are being developed in BioCaster. Some of the benefits of the research can be seen on our website (http://biocaster.nii.ac.jp) including the global health monitor for mapping disease and a searchable ontology.
In addition to obtaining information on disease outbreaks, we also aim to provide intelligent links to biomedical journals into the system to give medical specialists easy access to the latest in research information.
My field of expertise is natural language processing, but I've been studying text mining (text data analysis methods) since participating in the University of Tokyo's GENIA project in 1998. The goal of the GENIA project was to develop a system that automatically analyzes and retrieves research results from the Web to reduce the significant delays in getting biomedical research results into databases. Text mining technologies can be applied to various fields and offer great potential for helping professional workers find specific information more efficiently. Implementing these technologies will require the mutual efforts of experts from various fields.
Rebholz-Schuhmann D, Yepes AJ, Li C, Kafkas S, Lewin I, Kang N, Corbett P, Milward D, Buyko E, Beisswanger E, Hornbostel K, Kouznetsov A, Witte R, Laurila JB, Baker CJ, Kuo CJ, Clematide S, Rinaldi F, Farkas R, Móra G, Hara K, Furlong LI, Rautschka M, Neves ML, Pascual-Montano A, Wei Q, Collier N, Chowdhury MF, Lavelli A, Berlanga R, Morante R, Van Asch V, Daelemans W, Marina JL, van Mulligen E, Kors J, Hahn U
Journal of biomedical semantics 2 Suppl 5 S11 Oct 2011 [Refereed]
Recent studies have shown strong correlation between social networking data
and national influenza rates. We expanded upon this success to develop an
automated text mining system that classifies Twitter messages in real time into
six syndromic categories based on key terms from a public health ontology.
10-fold cross validation tests were used to compare Naive Bayes (NB) and
Support Vector Mach...
Journal of Biomedical Semantics 2011, 2(Suppl 5):S10 Oct 2011
Background: Online news reports are increasingly becoming a source for event
based early warning systems that detect natural disasters. Harnessing the
massive volume of information available from multilingual newswire presents as
many challenges as opportunities due to the patterns of reporting complex
spatiotemporal events. Results: In this article we study the problem of
utilising correlated ...
Social media such as Facebook and Twitter have proven to be a useful resource
to understand public opinion towards real world events. In this paper, we
investigate over 1.5 million Twitter messages (tweets) for the period 9th March
2011 to 31st May 2011 in order to track awareness and anxiety levels in the
Tokyo metropolitan district to the 2011 Tohoku Earthquake and subsequent
tsunami and nucl...
This paper look at how the Hopfield neural network can be used to store and
recall patterns constructed from natural language sentences. As a pattern
recognition and storage tool, the Hopfield neural network has received much
attention. This attention however has been mainly in the field of statistical
physics due to the model's simple abstraction of spin glass systems. A
discussion is made of ...