論文

2019年

Recognition of Japanese historical text lines by an attention-based encoder-decoder and text line generation

PROCEEDINGS OF THE 2019 WORKSHOP ON HISTORICAL DOCUMENT IMAGING AND PROCESSING (HIP' 19)
  • Anh Duc Le
  • ,
  • Daichi Mochihashi
  • ,
  • Katsuya Masuda
  • ,
  • Hideki Mima
  • ,
  • Nam Tuan Ly

開始ページ
37
終了ページ
41
記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
DOI
10.1145/3352631.3352641
出版者・発行元
ASSOC COMPUTING MACHINERY

Inspired by the recent successes of attention based encoder-decoder (AED) approach on image captioning, machine translation, we present an AED model as an end-to-end recognition system for recognizing Japanese historical documents. The recognition system has two main modules: a dense convolution neural network for extracting features, and a Long Shor Term Memory (LSTM) decoder integrating with attention model for generating target text. We can train the model end-to-end. The model requires only input text line images and corresponding output characters. Therefore, we don't need annotations for characters and save a lot of time for making annotations. We also present a method to generate artificial text lines to solve the imbalance problem of the current annotated database. The results of experiments on the annotated and artificial databases demonstrate the effectiveness of the text line generation. Our recognition system achieved Character Error Rate of 23.76% and 22.52% by training with and without artificial text lines, respectively. Moreover, our recognition system outperforms the CNN-LSTM system, which achieved the state-of-art results in other document recognition tasks.

リンク情報
DOI
https://doi.org/10.1145/3352631.3352641
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000518196200007&DestApp=WOS_CPL
Scopus
https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=85074797503&origin=inward
Scopus Citedby
https://www.scopus.com/inward/citedby.uri?partnerID=HzOxMe3b&scp=85074797503&origin=inward
ID情報
  • DOI : 10.1145/3352631.3352641
  • ISSN : 2153-1633
  • SCOPUS ID : 85074797503
  • Web of Science ID : WOS:000518196200007

エクスポート
BibTeX RIS