Faster proximity searching with the distal SAT. (July 2016)
- Record Type:
- Journal Article
- Title:
- Faster proximity searching with the distal SAT. (July 2016)
- Main Title:
- Faster proximity searching with the distal SAT
- Authors:
- Chávez, Edgar
Ludueña, Verónica
Reyes, Nora
Roggero, Patricia - Abstract:
- Abstract: Searching by proximity has been a source of puzzling behaviors and counter-intuitive findings for well established algorithmic design rules. One example is a linked list; it is the worst data structure for exact searching, and one of the most competitive for proximity searching. Common sense also dictates that an online data structure is less competitive than the full-knowledge, static version. A counter example in proximity searching is the static Spatial Approximation Tree ( SAT ), which is slower than its dynamic version ( DSAT ). In this paper we show that changing only the insertion policy of the SAT, leaving every other aspect of the data structure untouched, can produce a systematically faster index. We call the index Distal Spatial Approximation Tree ( DiSAT ). We found that even a random insertion policy produce a faster version of the SAT, which explains why the DSAT is faster than SAT . In brief, the SAT is improved by selecting distal, instead of proximal, nodes. This is the exact opposite of the insertion policy proposed in the original paper, and can be used in main or secondary memory versions of the index. We tested our approach with representatives of the state of the art in exact proximity searching. As it happens often in experimental setups, there are no absolute winners in all the aspects tested. Our data structure has no parameters to tune-up and a small memory footprint. In addition it can be constructed quickly. Our approach is among theAbstract: Searching by proximity has been a source of puzzling behaviors and counter-intuitive findings for well established algorithmic design rules. One example is a linked list; it is the worst data structure for exact searching, and one of the most competitive for proximity searching. Common sense also dictates that an online data structure is less competitive than the full-knowledge, static version. A counter example in proximity searching is the static Spatial Approximation Tree ( SAT ), which is slower than its dynamic version ( DSAT ). In this paper we show that changing only the insertion policy of the SAT, leaving every other aspect of the data structure untouched, can produce a systematically faster index. We call the index Distal Spatial Approximation Tree ( DiSAT ). We found that even a random insertion policy produce a faster version of the SAT, which explains why the DSAT is faster than SAT . In brief, the SAT is improved by selecting distal, instead of proximal, nodes. This is the exact opposite of the insertion policy proposed in the original paper, and can be used in main or secondary memory versions of the index. We tested our approach with representatives of the state of the art in exact proximity searching. As it happens often in experimental setups, there are no absolute winners in all the aspects tested. Our data structure has no parameters to tune-up and a small memory footprint. In addition it can be constructed quickly. Our approach is among the most competitive, those outperforming DiSAT achieve this at the expense of larger memory usage or an impractical construction time. … (more)
- Is Part Of:
- Information systems. Volume 59(2016)
- Journal:
- Information systems
- Issue:
- Volume 59(2016)
- Issue Display:
- Volume 59, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 59
- Issue:
- 2016
- Issue Sort Value:
- 2016-0059-2016-0000
- Page Start:
- 15
- Page End:
- 47
- Publication Date:
- 2016-07
- Subjects:
- Similarity search -- Metric spaces -- Metric access methods
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2015.10.014 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 366.xml