論文

査読有り 国際誌
2020年7月

Embeddings of Label Components for Sequence Labeling: A Case Study of Fine-grained Named Entity Recognition.

ACL-2020 (Student Research Workshop)
  • Takuma Kato
  • ,
  • Kaori Abe
  • ,
  • Hiroki Ouchi
  • ,
  • Shumpei Miyawaki
  • ,
  • Jun Suzuki
  • ,
  • Kentaro Inui

開始ページ
222
終了ページ
229
記述言語
英語
掲載種別
DOI
10.18653/v1/2020.acl-srw.30
出版者・発行元
Association for Computational Linguistics

In general, the labels used in sequence labeling consist of different types
of elements. For example, IOB-format entity labels, such as B-Person and
I-Person, can be decomposed into span (B and I) and type information (Person).
However, while most sequence labeling models do not consider such label
components, the shared components across labels, such as Person, can be
beneficial for label prediction. In this work, we propose to integrate label
component information as embeddings into models. Through experiments on English
and Japanese fine-grained named entity recognition, we demonstrate that the
proposed method improves performance, especially for instances with
low-frequency labels.

リンク情報
DOI
https://doi.org/10.18653/v1/2020.acl-srw.30
DBLP
https://dblp.uni-trier.de/rec/conf/acl/KatoAOMSI20
arXiv
http://arxiv.org/abs/arXiv:2006.01372
URL
https://www.aclweb.org/anthology/2020.acl-srw.30/
URL
https://dblp.uni-trier.de/conf/acl/2020-s
URL
https://dblp.uni-trier.de/db/conf/acl/acl2020-s.html#KatoAOMSI20
ID情報
  • DOI : 10.18653/v1/2020.acl-srw.30
  • DBLP ID : conf/acl/KatoAOMSI20
  • arXiv ID : arXiv:2006.01372

エクスポート
BibTeX RIS