A new access method, called M-tree, is proposed to organize and search large data sets from a generic "metric space", i.e. where object proximity is only denned by a distance function satisfying the positivity, symmetry, and triangle inequality postulates. We detail algorithms for insertion of objects and split management, which keep the M-tree always balanced-several heuristic split alternatives are considered and experimentally evaluated. Algorithms for similarity (range and k-nearest neighbors) queries are also described. Results from extensive experimentation with a prototype system are reported, considering as the performance criteria the number of page I/O's and the number of distance computations. The results demonstrate that the M-tree indeed extends the domain of applicability beyond the traditional vector spaces, performs reasonably well in high-dimensional data spaces, and scales well in case of growing files.

M-tree: An efficient access method for similarity search in metric spaces / Ciaccia P.; Patella M.; Zezula P.. - STAMPA. - (1997), pp. 426-435. (Intervento presentato al convegno 23rd International Conference on Very Large Databases, VLDB 1997 tenutosi a Athens, Greece nel August 25-29, 1997).

M-tree: An efficient access method for similarity search in metric spaces

Ciaccia P.;Patella M.;
1997

Abstract

A new access method, called M-tree, is proposed to organize and search large data sets from a generic "metric space", i.e. where object proximity is only denned by a distance function satisfying the positivity, symmetry, and triangle inequality postulates. We detail algorithms for insertion of objects and split management, which keep the M-tree always balanced-several heuristic split alternatives are considered and experimentally evaluated. Algorithms for similarity (range and k-nearest neighbors) queries are also described. Results from extensive experimentation with a prototype system are reported, considering as the performance criteria the number of page I/O's and the number of distance computations. The results demonstrate that the M-tree indeed extends the domain of applicability beyond the traditional vector spaces, performs reasonably well in high-dimensional data spaces, and scales well in case of growing files.
1997
Proceedings of the 23rd International Conference on Very Large Databases, VLDB 1997
426
435
M-tree: An efficient access method for similarity search in metric spaces / Ciaccia P.; Patella M.; Zezula P.. - STAMPA. - (1997), pp. 426-435. (Intervento presentato al convegno 23rd International Conference on Very Large Databases, VLDB 1997 tenutosi a Athens, Greece nel August 25-29, 1997).
Ciaccia P.; Patella M.; Zezula P.
File in questo prodotto:
Eventuali allegati, non sono esposti

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11585/918389
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1276
  • ???jsp.display-item.citation.isi??? 705
social impact