This paper adopts the premise that the 'semantic gap' is an incompletely surveyed feature in the landscape of visual image retrieval, and proposes a framework within which this deficiency might be made good. Simple classifications of types of image and of types of user are proposed. Consideration is then given in outline to how semantic content is realised by each class of user within each class of image. The argument is advanced that this realisation finds expression in perceptual, generic interpretive and specific interpretive content. This analytic framework provides the basis for the specification of a broadly encompassing evaluation study, which will employ the image/user type classification and the expert domain knowledge of selected user groups in the construction of segmented test collections of real queries, images and relevance judgements. From this study should come a better-informed view on the nature of semantic information need, and on the representation and recovery of semantic content across a broad spectrum of image retrieval activity.
|Number of pages||9|
|Journal||Lecture Notes in Computer Science|
|Publication status||Published - 2003|
Bibliographical noteThe final publication is available at link.springer.com
Enser, P. G. B., & Sandom, C. J. (2003). Towards a comprehensive survey of the semantic gap in visual image retrieval. Lecture Notes in Computer Science, 2728, 291-299. http://www.springerlink.com/openurl.asp?genre=article&issn=0302-9743&volume=2728&spage=291