論文

査読有り 本文へのリンクあり 国際誌
2022年6月18日

Automatic data extraction to support meta-analysis statistical analysis: a case study on breast cancer.

BMC medical informatics and decision making
  • Faith Wavinya Mutinda
  • ,
  • Kongmeng Liew
  • ,
  • Shuntaro Yada
  • ,
  • Shoko Wakamiya
  • ,
  • Eiji Aramaki

22
1
開始ページ
158
終了ページ
158
記述言語
英語
掲載種別
研究論文(学術雑誌)
DOI
10.1186/s12911-022-01897-4

BACKGROUND: Meta-analyses aggregate results of different clinical studies to assess the effectiveness of a treatment. Despite their importance, meta-analyses are time-consuming and labor-intensive as they involve reading hundreds of research articles and extracting data. The number of research articles is increasing rapidly and most meta-analyses are outdated shortly after publication as new evidence has not been included. Automatic extraction of data from research articles can expedite the meta-analysis process and allow for automatic updates when new results become available. In this study, we propose a system for automatically extracting data from research abstracts and performing statistical analysis. MATERIALS AND METHODS: Our corpus consists of 1011 PubMed abstracts of breast cancer randomized controlled trials annotated with the core elements of clinical trials: Participants, Intervention, Control, and Outcomes (PICO). We proposed a BERT-based named entity recognition (NER) model to identify PICO information from research abstracts. After extracting the PICO information, we parse numeric outcomes to identify the number of patients having certain outcomes for statistical analysis. RESULTS: The NER model extracted PICO elements with relatively high accuracy, achieving F1-scores greater than 0.80 in most entities. We assessed the performance of the proposed system by reproducing the results of an existing meta-analysis. The data extraction step achieved high accuracy, however the statistical analysis step achieved low performance because abstracts sometimes lack all the required information. CONCLUSION: We proposed a system for automatically extracting data from research abstracts and performing statistical analysis. We evaluated the performance of the system by reproducing an existing meta-analysis and the system achieved a relatively good performance, though more substantiation is required.

リンク情報
DOI
https://doi.org/10.1186/s12911-022-01897-4 本文へのリンクあり
PubMed
https://www.ncbi.nlm.nih.gov/pubmed/35717167
PubMed Central
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9206132
ID情報
  • DOI : 10.1186/s12911-022-01897-4
  • PubMed ID : 35717167
  • PubMed Central 記事ID : PMC9206132

エクスポート
BibTeX RIS