Robust and efficient memory management in Apache AsterixDB. (17th February 2020)
- Record Type:
- Journal Article
- Title:
- Robust and efficient memory management in Apache AsterixDB. (17th February 2020)
- Main Title:
- Robust and efficient memory management in Apache AsterixDB
- Authors:
- Kim, Taewoo
Behm, Alexander
Blow, Michael
Borkar, Vinayak
Bu, Yingyi
Carey, Michael J.
Hubail, Murtadha
Jahangiri, Shiva
Jia, Jianfeng
Li, Chen
Luo, Chen
Maxon, Ian
Pirzadeh, Pouria - Abstract:
- Summary: Traditional relational database systems handle data by dividing their memory into sections such as a buffer cache and working memory, assigning a memory budget to each section to efficiently manage a limited amount of overall memory. They also assign memory budgets to memory‐intensive operators such as sorts and joins and control the allocation of memory to these operators; each memory‐intensive operator attempts to maximize its memory usage to reduce disk I/O cost. Implementing such memory‐intensive operators requires a careful design and application of appropriate algorithms that properly utilize memory. Today's Big Data management systems need the ability to handle large amounts of data similarly, as it is unrealistic to assume that truly big data will fit into memory. In this article, we share our memory management experiences in Apache AsterixDB, an open‐source Big Data management software platform that scales out horizontally on shared‐nothing commodity computing clusters. We describe the implementation of AsterixDB's memory‐intensive operators and their designs related to memory management. We also discuss memory management at the global (cluster) level. We conducted an experimental study using several synthetic and real datasets to explore the impact of this work. We believe that future Big Data management system builders can benefit from these experiences.
- Is Part Of:
- Software, practice & experience. Volume 50:Number 7(2020)
- Journal:
- Software, practice & experience
- Issue:
- Volume 50:Number 7(2020)
- Issue Display:
- Volume 50, Issue 7 (2020)
- Year:
- 2020
- Volume:
- 50
- Issue:
- 7
- Issue Sort Value:
- 2020-0050-0007-0000
- Page Start:
- 1114
- Page End:
- 1151
- Publication Date:
- 2020-02-17
- Subjects:
- Apache AsterixDB -- big data management system -- group by -- hash join -- inverted‐index search -- memory management -- sort
Computer software -- Periodicals
Computer programming -- Periodicals
Computer programs -- Periodicals
005.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/spe.2799 ↗
- Languages:
- English
- ISSNs:
- 0038-0644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.453000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 14816.xml