2018年1月19日
Temporal Localization and Spatial Segmentation of Joint Attention in Multiple First-Person Videos
Proceedings - 2017 IEEE International Conference on Computer Vision Workshops, ICCVW 2017
- ,
- ,
- ,
- ,
- ,
- 巻
- 2018-
- 号
- 開始ページ
- 2313
- 終了ページ
- 2321
- 記述言語
- 英語
- 掲載種別
- 研究論文(国際会議プロシーディングス)
- DOI
- 10.1109/ICCVW.2017.273
- 出版者・発行元
- Institute of Electrical and Electronics Engineers Inc.
This work aims to develop a computer-vision technique for understanding objects jointly attended by a group of people during social interactions. As a key tool to discover such objects of joint attention, we rely on a collection of wearable eye-tracking cameras that provide a first-person video of interaction scenes and points-of-gaze data of interacting parties. Technically, we propose a hierarchical conditional random field-based model that can 1) localize events of joint attention temporally and 2) segment objects of joint attention spatially. We show that by alternating these two procedures, objects of joint attention can be discovered reliably even from cluttered scenes and noisy points-of-gaze data. Experimental results demonstrate that our approach outperforms several state-of-the-art methods for co-segmentation and joint attention discovery.
- リンク情報
- ID情報
-
- DOI : 10.1109/ICCVW.2017.273
- DBLP ID : conf/iccvw/HuangCKYHS17
- SCOPUS ID : 85046298107