論文

2017年7月

An in-depth study on diversity evaluation: The importance of intrinsic diversity

INFORMATION PROCESSING & MANAGEMENT
  • Hai-Tao Yu
  • ,
  • Adam Jatowt
  • ,
  • Roi Blanco
  • ,
  • Hideo Joho
  • ,
  • Joemon M. Jose

53
4
開始ページ
799
終了ページ
813
記述言語
英語
掲載種別
研究論文(学術雑誌)
DOI
10.1016/j.ipm.2017.03.001
出版者・発行元
ELSEVIER SCI LTD

Diversified document ranking has been recognized as an effective strategy to tackle ambiguous and/or underspecified queries. In this paper, we conduct an in-depth study on diversity evaluation that provides insights for assessing the performance of a diversified retrieval system. By casting the widely used diversity metrics (e.g., ERR-IA, alpha-nDCG and D#-nDCG) into a unified framework based on marginal utility, we analyze how these metrics capture extrinsic diversity and intrinsic diversity. Our analyses show that the prior metrics (ERR-IA, a-nDCG and D#-nDCG) are not able to precisely measure intrinsic diversity if we merely feed a set of subtopics into them in a traditional manner (i.e., without fine-grained relevance knowledge per subtopic). As the redundancy of relevant documents with respect to each specific information need (i.e., subtopic) can not be then detected and solved, the overall diversity evaluation may not be reliable. Furthermore, a series of experiments are conducted on a gold standard collection (English and Chinese) and a set of submitted runs, where the intent-square metrics that extend the diversity metrics through incorporating hierarchical subtopics are used as references. The experimental results show that the intent square metrics disagree with the diversity metrics (ERR-IA and a-nDCG) being used in a traditional way oh top-ranked runs, and that the average precision correlation scores between intent-square metrics and the prior diversity metrics (ERR-IA and a-nDCG) are fairly low. These results justify our analyses, and uncover the previously-unknown importance of intrinsic diversity to the overall diversity evaluation. (C) 2017 Elsevier Ltd. All rights reserved.

リンク情報
DOI
https://doi.org/10.1016/j.ipm.2017.03.001
Web of Science
https://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=JSTA_CEL&SrcApp=J_Gate_JST&DestLinkType=FullRecord&KeyUT=WOS:000401400000004&DestApp=WOS_CPL
ID情報
  • DOI : 10.1016/j.ipm.2017.03.001
  • ISSN : 0306-4573
  • eISSN : 1873-5371
  • Web of Science ID : WOS:000401400000004

エクスポート
BibTeX RIS