2020年7月
Embeddings of Label Components for Sequence Labeling: A Case Study of Fine-grained Named Entity Recognition.
ACL-2020 (Student Research Workshop)
- ,
- ,
- ,
- ,
- ,
- 開始ページ
- 222
- 終了ページ
- 229
- 記述言語
- 英語
- 掲載種別
- DOI
- 10.18653/v1/2020.acl-srw.30
- 出版者・発行元
- Association for Computational Linguistics
In general, the labels used in sequence labeling consist of different types
of elements. For example, IOB-format entity labels, such as B-Person and
I-Person, can be decomposed into span (B and I) and type information (Person).
However, while most sequence labeling models do not consider such label
components, the shared components across labels, such as Person, can be
beneficial for label prediction. In this work, we propose to integrate label
component information as embeddings into models. Through experiments on English
and Japanese fine-grained named entity recognition, we demonstrate that the
proposed method improves performance, especially for instances with
low-frequency labels.
of elements. For example, IOB-format entity labels, such as B-Person and
I-Person, can be decomposed into span (B and I) and type information (Person).
However, while most sequence labeling models do not consider such label
components, the shared components across labels, such as Person, can be
beneficial for label prediction. In this work, we propose to integrate label
component information as embeddings into models. Through experiments on English
and Japanese fine-grained named entity recognition, we demonstrate that the
proposed method improves performance, especially for instances with
low-frequency labels.
- リンク情報
-
- DOI
- https://doi.org/10.18653/v1/2020.acl-srw.30
- DBLP
- https://dblp.uni-trier.de/rec/conf/acl/KatoAOMSI20
- arXiv
- http://arxiv.org/abs/arXiv:2006.01372
- URL
- https://www.aclweb.org/anthology/2020.acl-srw.30/
- URL
- https://dblp.uni-trier.de/conf/acl/2020-s
- URL
- https://dblp.uni-trier.de/db/conf/acl/acl2020-s.html#KatoAOMSI20
- ID情報
-
- DOI : 10.18653/v1/2020.acl-srw.30
- DBLP ID : conf/acl/KatoAOMSI20
- arXiv ID : arXiv:2006.01372