Fast in‐memory XPath search using compressed indexes. (3rd October 2013)
- Record Type:
- Journal Article
- Title:
- Fast in‐memory XPath search using compressed indexes. (3rd October 2013)
- Main Title:
- Fast in‐memory XPath search using compressed indexes
- Authors:
- Arroyuelo, Diego
Claude, Francisco
Maneth, Sebastian
Mäkinen, Veli
Navarro, Gonzalo
Nguyễn, Kim
Sirén, Jouni
Välimäki, Niko - Abstract:
- <abstract abstract-type="main" id="spe2227-abs-0001"> <title>Summary</title> <p>Extensible Markup Language (XML) documents consist of text data plus structured data (markup). XPath allows to query both text and structure. Evaluating such hybrid queries is challenging. We present a system for in‐memory evaluation of <italic>XPath search queries</italic>, that is, queries with text and structure predicates, yet without advanced features such as backward axes, arithmetics, and joins. We show that for this query fragment, which contains <italic>Forward Core XPath</italic>, our system, dubbed Succinct XML Self‐Index ('SXSI'), outperforms existing systems by 1–3 orders of magnitude. SXSI is based on state‐of‐the‐art indexes for text and structure data. It combines two novelties. On one hand, it represents the XML data in a compact indexed form, which allows it to handle larger collections in main memory while supporting powerful search and navigation operations over the text and the structure. On the other hand, it features an execution engine that uses tree automata and cleverly chooses evaluation orders that leverage the speeds of the respective indexes. SXSI is modular and allows seamless replacement of its indexes. This is demonstrated through experiments with (1) a text index specialized for search of bio sequences, and (2) a word‐based text index specialized for natural language search. Copyright © 2013 John Wiley & Sons, Ltd.</p> </abstract>
- Is Part Of:
- Software, practice & experience. Volume 45:Number 3(2015)
- Journal:
- Software, practice & experience
- Issue:
- Volume 45:Number 3(2015)
- Issue Display:
- Volume 45, Issue 3 (2015)
- Year:
- 2015
- Volume:
- 45
- Issue:
- 3
- Issue Sort Value:
- 2015-0045-0003-0000
- Page Start:
- 399
- Page End:
- 434
- Publication Date:
- 2013-10-03
- Subjects:
- Computer software -- Periodicals
Computer programming -- Periodicals
Computer programs -- Periodicals
005.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/spe.2227 ↗
- Languages:
- English
- ISSNs:
- 0038-0644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.453000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 3363.xml