Options
Adapting a spatial access structure for document representations in vector space
Henrich, Andreas (1996): Adapting a spatial access structure for document representations in vector space, in: M. Tamer Özsu, Ken Barker, M. Tamer Özsu, u. a. (Hrsg.), CIKM ’96 : Proceedings of the fifth international conference on Information and knowledge management, New York u.a.: ACM, S. 19–26, doi: 10.1145/238355.238367.
Author:
Title of the compilation:
CIKM '96 : Proceedings of the fifth international conference on Information and knowledge management
Editors:
Özsu, M. Tamer
Barker, Ken
Conference:
1996 ACM CIKM International Conference on Information and Knowledge Management : November 12 - 16, 1996 ; Rockville, Maryland, USA
Publisher Information:
Year of publication:
1996
Pages:
ISBN:
978-0-89791-873-2
Language:
English
Abstract:
In the field of information-retrieval the vector space model has been proposed. In this model queries and documents are represented ae term vectors where each coefficient represents the relevance of a given term with respect to the document or query. A typical task in this context is to search for the documents most similar to a given query vector. On the other hand, algorithms to perform nearest neighbor and distance scan queries have been proposed for various types of spatial access structures. Unfortunately, these access structures assume implicitly that the number of dimensions is relatively small — which is not the case for document representation vectors. In this paper we discuss the adaptation of spatial access structures for document representation vectors. We describe how some peculiarities of document representation vectors can be exploited to overcome the problems with higher dimensions to a certain extend. We exploit these peculiarities introducing a new cluster split technique and a sophisticated algorithm to calculate an upper bound for the similarity of the documents located in a subtree of the access structure.
Keywords:
spatial access structure
Type:
Conferenceobject
Activation date:
July 13, 2015
Versioning
Question on publication
Permalink
https://fis.uni-bamberg.de/handle/uniba/36034