A feature-based intelligent deduplication compression system with extreme resemblance detection. Issue 3 (3rd July 2021)
- Record Type:
- Journal Article
- Title:
- A feature-based intelligent deduplication compression system with extreme resemblance detection. Issue 3 (3rd July 2021)
- Main Title:
- A feature-based intelligent deduplication compression system with extreme resemblance detection
- Authors:
- Wu, Xiaotong
Gao, Jiaquan
Ji, Genlin
Wu, Taotao
Tian, Yuan
Al-Nabhan, Najla - Abstract:
- ABSTRACT: With the fast development of various computing paradigms, the amount of data is rapidly increasing that brings the huge storage overhead. However, the existing data deduplication techniques do not make full use of similarity detection to improve the storage efficiency and data transmission rate. In this paper, we study the problem of utilising the duplicate and resemblance detection techniques to further compress data. We first present a framework of FIDCS-ERD, a feature-based intelligent deduplication compression system with extreme resemblance detection. We also introduce the main components and the detailed workflow of our compression system. We propose a content-defined chunking algorithm for duplicate detection and a Bloom filter-based resemblance detection algorithm. FIDCS-ERD implements the intelligent file chunking and the fast duplicate and resemblance detection. By extensive experiments over the real datasets, we demonstrate that FIDCS-ERD has better compression effect and more accurate resemblance detection compared to the existing approaches.
- Is Part Of:
- Connection science. Volume 33:Issue 3(2021)
- Journal:
- Connection science
- Issue:
- Volume 33:Issue 3(2021)
- Issue Display:
- Volume 33, Issue 3 (2021)
- Year:
- 2021
- Volume:
- 33
- Issue:
- 3
- Issue Sort Value:
- 2021-0033-0003-0000
- Page Start:
- 576
- Page End:
- 604
- Publication Date:
- 2021-07-03
- Subjects:
- Data storage -- resemblance detection -- delta compression -- deduplication compression
Neural computers -- Periodicals
Artificial intelligence -- Periodicals
Cognitive science -- Periodicals
Connectionism -- Periodicals
006.3 - Journal URLs:
- http://www.tandfonline.com/toc/ccos20/current ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/09540091.2020.1862058 ↗
- Languages:
- English
- ISSNs:
- 0954-0091
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3417.662450
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 18883.xml