論文

査読有り
2018年

Efficient and Interactive Spatial-Semantic Image Retrieval

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
  • Ryosuke Furuta
  • ,
  • Naoto Inoue
  • ,
  • Toshihiko Yamasaki

10704
開始ページ
190
終了ページ
202
記述言語
英語
掲載種別
研究論文(国際会議プロシーディングス)
DOI
10.1007/978-3-319-73603-7_16
出版者・発行元
Springer Verlag

This paper proposes an efficient image retrieval system. When users wish to retrieve images with semantic and spatial constraints (e.g., a horse is located at the center of the image, and a person is riding on the horse), it is difficult for conventional text-based retrieval systems to retrieve such images exactly. In contrast, the proposed system can consider both semantic and spatial information, because it is based on semantic segmentation using fully convolutional networks (FCN). The proposed system can accept three types of images as queries: a segmentation map sketched by the user, a natural image, or a combination of the two. The distance between the query and each image in the database is calculated based on the output probability maps from the FCN. In order to make the system efficient in terms of both the computation time and memory usage, we employ the product quantization technique (PQ). The experimental results show that the PQ is compatible with the FCN-based image retrieval system, and that the quantization process results in little information loss. It is also shown that our method outperforms a conventional text-based search system.

リンク情報
DOI
https://doi.org/10.1007/978-3-319-73603-7_16
DBLP
https://dblp.uni-trier.de/rec/conf/mmm/FurutaIY18
URL
http://dblp.uni-trier.de/db/conf/mmm/mmm2018-1.html#conf/mmm/FurutaIY18
ID情報
  • DOI : 10.1007/978-3-319-73603-7_16
  • ISSN : 1611-3349
  • ISSN : 0302-9743
  • DBLP ID : conf/mmm/FurutaIY18
  • SCOPUS ID : 85042112521

エクスポート
BibTeX RIS