Parallel and distributed clustering framework for big spatial data mining. Issue 6 (2nd November 2019)
- Record Type:
- Journal Article
- Title:
- Parallel and distributed clustering framework for big spatial data mining. Issue 6 (2nd November 2019)
- Main Title:
- Parallel and distributed clustering framework for big spatial data mining
- Authors:
- Bendechache, Malika
Tari, A-Kamel
Kechadi, M-Tahar - Abstract:
- ABSTRACT: Clustering techniques are very attractive for identifying and extracting patterns of interests from datasets. However, their application to very large spatial datasets presents numerous challenges such as high-dimensionality, heterogeneity, and high complexity of some algorithms. Distributed clustering techniques constitute a very good alternative to the Big Data challenges (e.g., Volume, Variety, Veracity, and Velocity). In this paper, we developed and implemented a Dynamic Parallel and Distributed clustering (DPDC) approach that can analyse Big Data within a reasonable response time and produce accurate results, by using existing and current computing and storage infrastructure, such as cloud computing. The DPDC approach consists of two phases. The first phase is fully parallel and it generates local clusters and the second phase aggregates the local results to obtain global clusters. The aggregation phase is designed in such a way that the final clusters are compact and accurate while the overall process is efficient in time and memory allocation. DPDC was thoroughly tested and compared to well-known clustering algorithms BIRCH and CURE. The results show that the approach not only produces high-quality results but also scales up very well by taking advantage of the Hadoop MapReduce paradigm or any distributed system. GRAPHICAL ABSTRACT:
- Is Part Of:
- International journal of parallel, emergent and distributed systems. Volume 34:Issue 6(2019)
- Journal:
- International journal of parallel, emergent and distributed systems
- Issue:
- Volume 34:Issue 6(2019)
- Issue Display:
- Volume 34, Issue 6 (2019)
- Year:
- 2019
- Volume:
- 34
- Issue:
- 6
- Issue Sort Value:
- 2019-0034-0006-0000
- Page Start:
- 671
- Page End:
- 689
- Publication Date:
- 2019-11-02
- Subjects:
- Big Data -- MapReduce -- Hadoop -- spatial data mining -- clustering -- distributed clustering -- parallel clustering -- DBSCAN -- dynamic K-means
Parallel computers -- Periodicals
Electronic data processing -- Distributed processing -- Periodicals
Computer algorithms -- Periodicals
004.35 - Journal URLs:
- http://www.tandfonline.com/toc/gpaa20/current ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/17445760.2018.1446210 ↗
- Languages:
- English
- ISSNs:
- 1744-5760
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.441300
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 11648.xml