2019年
Recognition of Japanese historical text lines by an attention-based encoder-decoder and text line generation
PROCEEDINGS OF THE 2019 WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING (HIP' 19)
- ,
- ,
- ,
- ,
- 開始ページ
- 37
- 終了ページ
- 41
- 記述言語
- 英語
- 掲載種別
- 研究論文(国際会議プロシーディングス)
- DOI
- 10.1145/3352631.3352641
- 出版者・発行元
- ASSOC COMPUTING MACHINERY
Inspired by the recent successes of attention based encoder-decoder (AED) approach on image captioning, machine translation, we present an AED model as an end-to-end recognition system for recognizing Japanese historical documents. The recognition system has two main modules: a dense convolution neural network for extracting features, and a Long Shor Term Memory (LSTM) decoder integrating with attention model for generating target text. We can train the model end-to-end. The model requires only input text line images and corresponding output characters. Therefore, we don't need annotations for characters and save a lot of time for making annotations. We also present a method to generate artificial text lines to solve the imbalance problem of the current annotated database. The results of experiments on the annotated and artificial databases demonstrate the effectiveness of the text line generation. Our recognition system achieved Character Error Rate of 23.76% and 22.52% by training with and without artificial text lines, respectively. Moreover, our recognition system outperforms the CNN-LSTM system, which achieved the state-of-art results in other document recognition tasks.
- リンク情報
-
- DOI
- https://doi.org/10.1145/3352631.3352641
- Web of Science
- https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000518196200007&DestApp=WOS_CPL
- Scopus
- https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85074797503&origin=inward
- Scopus Citedby
- https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=85074797503&origin=inward
- ID情報
-
- DOI : 10.1145/3352631.3352641
- ISSN : 2153-1633
- SCOPUS ID : 85074797503
- Web of Science ID : WOS:000518196200007