Improving Read Throughput of Deduplicated Cloud Storage using Frequent Pattern-Based Prefetching Technique. (18th March 2016)
- Record Type:
- Journal Article
- Title:
- Improving Read Throughput of Deduplicated Cloud Storage using Frequent Pattern-Based Prefetching Technique. (18th March 2016)
- Main Title:
- Improving Read Throughput of Deduplicated Cloud Storage using Frequent Pattern-Based Prefetching Technique
- Authors:
- Balasundaram, Prabavathy
Babu, Chitra
Devi M, Subha - Abstract:
- Abstract: In a cloud storage, while deduplication enables optimal storage space utilization, it also incurs a substantial overhead in maintaining the metadata, namely, the Fingerprint Index and the File Recipe . As this metadata is huge, it has to be stored in the disk which causes considerable read-latency in Deduplicated Cloud Storage (DCS). In order to improve this read-latency, it would be highly beneficial to prefetch the relevant fingerprints in a cache. Many existing research solutions have utilized either the spatial locality or the similarity among the files to prefetch the relevant fingerprints. However, the DCS that has been designed and implemented in this paper is intended to cater to non-backup workloads that do not exhibit significant spatial locality or similarity among the files. Hence, this paper proposes a suitable alternative prefetching approach that mines the pattern of client read accesses to find the most frequently accessed files. The proposed prefetching approach has been implemented and incorporated in the DCS. The experimental investigations indicate that the proposed prefetching approach improves the cache hit rates by 140% and increases the read throughput by 88% when compared with the Extreme Binning approach (Bhagwat, D., Eshghi, K. Long, D.D.E. and Lillibridge, M. (2009) Extreme Binning: Scalable, Parallel Deduplication for Chunk-based File Backup. Proc. MASCOTS'09, London, UK, September 21–23, pp. 1–9. IEEE) while incurring only a marginalAbstract: In a cloud storage, while deduplication enables optimal storage space utilization, it also incurs a substantial overhead in maintaining the metadata, namely, the Fingerprint Index and the File Recipe . As this metadata is huge, it has to be stored in the disk which causes considerable read-latency in Deduplicated Cloud Storage (DCS). In order to improve this read-latency, it would be highly beneficial to prefetch the relevant fingerprints in a cache. Many existing research solutions have utilized either the spatial locality or the similarity among the files to prefetch the relevant fingerprints. However, the DCS that has been designed and implemented in this paper is intended to cater to non-backup workloads that do not exhibit significant spatial locality or similarity among the files. Hence, this paper proposes a suitable alternative prefetching approach that mines the pattern of client read accesses to find the most frequently accessed files. The proposed prefetching approach has been implemented and incorporated in the DCS. The experimental investigations indicate that the proposed prefetching approach improves the cache hit rates by 140% and increases the read throughput by 88% when compared with the Extreme Binning approach (Bhagwat, D., Eshghi, K. Long, D.D.E. and Lillibridge, M. (2009) Extreme Binning: Scalable, Parallel Deduplication for Chunk-based File Backup. Proc. MASCOTS'09, London, UK, September 21–23, pp. 1–9. IEEE) while incurring only a marginal computational overhead of 1.7 s. … (more)
- Is Part Of:
- Computer journal. Volume 60:Number 3(2017)
- Journal:
- Computer journal
- Issue:
- Volume 60:Number 3(2017)
- Issue Display:
- Volume 60, Issue 3 (2017)
- Year:
- 2017
- Volume:
- 60
- Issue:
- 3
- Issue Sort Value:
- 2017-0060-0003-0000
- Page Start:
- 444
- Page End:
- 456
- Publication Date:
- 2016-03-18
- Subjects:
- cloud storage -- deduplication -- storage optimization -- prefetching -- association rule mining
Computers -- Periodicals
005.1 - Journal URLs:
- http://comjnl.oxfordjournals.org/ ↗
http://ukcatalogue.oup.com/ ↗ - DOI:
- 10.1093/comjnl/bxw013 ↗
- Languages:
- English
- ISSNs:
- 0010-4620
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.060000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21744.xml