Abstract
The SPY-TEC (Spherical Pyramid-Technique) [8] was proposed as a new indexing method for high-dimensional data spaces using a special partitioning strategy that divides a d-dimensional data space into 2d spherical pyramids. Although the authors of [8] proposed an efficient algorithm for processing hyperspherical range queries, they did not propose an algorithm for processing k-nearest neighbor queries that are frequently used in similarity search. In this paper, we propose an efficient algorithm for processing exact nearest neighbor queries on the SPY-TEC by extending the incremental nearest neighbor algorithm proposed in [10]. We also introduce a metric that can be used to guide an ordered best-first traversal when finding nearest neighbors on the SPYTEC. Finally, we show that our technique significantly outperforms the related techniques in processing k-nearest neighbor queries by comparing it to the R*-tree, the X-tree, and the sequential scan through extensive experiments.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
A. Guttman. “R-trees: a dynamic index structure for spatial searching”. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 47–57, June 1984.
A. Henrich. “The LSDh-Tree: An Access Structure for Feature Vectors”. Proc. 14th Int. Conf on Data Engineering, pages 362–369, 1998.
C. Faloutsos, R. Barber, M. Flickner, J. Hafner, W. Niblack, D. Petkovic, and W. Equiz. “Efficient and Effective Querying by Image Content”. Journal of Intelligent Information System(JIIS), 3(3):231–262, July 1994.
B.C. Ooi, K.L. Tan, T.S. Chua, and W. Hsu. “Fast image retrieval using colorspatial information”. The VLDB Journal, 7(2):115–128, 1998.
C.E. Jacobs, A. Finkelstein, and D.H. Salesin. “Fast Multiresolution Image Query”. Proc. of the 1995 ACM SIGGRAPH, New York, 1995.
D.A. White and R. Jain. “Similarity Indexing with the SS-tree”. Proc. 12th Int. Conf on Data Engineering, pages 516–523, 1996.
D.B. Lomet and B. Salzberg. “The hB-Tree: A Multiattribute Indexing Method with Good Guaranteed Performance”. ACM Transaction on Database Systems, 15(4):625–658, 1990.
D.H. Lee and H.J. Kim. “SPY-TEC: An Efficient Indexing Method for Similarity Search in High-Dimensional Data Spaces”. Data & Knowledge Engineering, 34(1):77–97, 2000.
C. Faloutsos. “Fast Searching by Content in Multimedia Databases”. Data Engineering Bulletin, 18(4), 1995.
G.R. Hjaltason and H. Samet. “Distance Browsing in Spatial Databases”. ACM Transaction on Database Systems, 24(2):265–318, 1999.
J. Bentley. “Mutidimensional binary search trees used for associative searching”. Communications of the ACM, 18(9):509–517, 1975.
J.R. Smith and S.-F. Chang. “VisualSEEk: a fully automated content-based image query system”. ACM Multimedia 96, Boston, MA, 1996.
J.T. Robinson. “The K-D-B-tree: a Search Structure for Large Multidimensional Dynamic Indexes”. Proc. ACM SIGMOD, Ann Arbor, USA, pages 10–18, April 1981.
K.-I. Lin, H.V. Jagadish, and C. Faloutsos. “The TV-tree: An Index Structure for High-Dimensional Data”. The VLDB Journal, 3(4):517–542, 1994.
L. Leithold. “Trigonometry”. Addison-Wesley, 1989.
D.H. Lee and H.J. Kim. “An Efficient Nearest Neighbor Search in High-Dimensional Data Spaces”. Seoul National University, CE Technical Report (OOPSLA-TR1028), http://oopsla.snu.ac.kr/~dhlee/OOPSLA-TR1028.ps , 2000.
N. Katayama and S. Satoh. “The SR-tree: An Index Structure for High-Dimensional Nearest Neighbor Queries”. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 517–542, May 1997.
N. Roussopoulos, S. Kelley, and F. Vincent. “Nearest Neighbor Queries”. Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 71–79, 1995.
K. Beyer, J. Goldstein, R. Ramakrishnan, and U. Shaft. “When Is “Nearest Neighbor” Meaningful ? ”. Proc. 7th Int. Conf. on Database Teory, pages 217–235, January 1999.
S. Berchtold, C. Böhm, and H.-P. Kriegel. “The Pyramid-Technique: Towards Breaking the Curse of Dimensionality”. Proc. ACM SIGMOD Int. Conf. on Management of Data, 1998.
S. Berchtold, C. Böhm, D.A. Keim, and H.-P. Kriegel. “A Cost Model For Nearest Neighbor Search in High-Dimensional Data Space”. ACM PODS Symposium on Principles of Database Systems, Tucson, Arizona, 1997.
S. Berchtold, D.A. Keim, and H.-P. Kriegel. “The X-tree: An Indexing Structure for High-Dimensional Data”. Proc. 22nd Int. Conf. on Very Large Database, pages 28–39, September 1996.
P.M. Kelly, T.M. Cannon and D.R. Hush. “Query by image example: the CANDID approach”. Proc. SPIE Storage and Retrieval for Image and Video Databases III, 2420: 238–248, 1995.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lee, DH., Lee, HD., Choi, IH., Kim, HJ. (2001). An Algorithm for Incremental Nearest Neighbor Search in High-Dimensional Data Spaces. In: Kim, W., Ling, TW., Lee, YJ., Park, SS. (eds) The Human Society and the Internet Internet-Related Socio-Economic Issues. HSI 2001. Lecture Notes in Computer Science, vol 2105. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-47749-7_35
Download citation
DOI: https://doi.org/10.1007/3-540-47749-7_35
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42313-3
Online ISBN: 978-3-540-47749-5
eBook Packages: Springer Book Archive