Euclidean distance stratified random sampling based clustering model for big data mining. Issue 6 (26th October 2021)
- Record Type:
- Journal Article
- Title:
- Euclidean distance stratified random sampling based clustering model for big data mining. Issue 6 (26th October 2021)
- Main Title:
- Euclidean distance stratified random sampling based clustering model for big data mining
- Authors:
- Pandey, Kamlesh Kumar
Shukla, Diwakar - Abstract:
- Abstract: Big data mining is related to large‐scale data analysis and faces computational cost‐related challenges due to the exponential growth of digital technologies. Classical data mining algorithms suffer from computational deficiency, memory utilization, resource optimization, scale‐up, and speed‐up related challenges in big data mining. Sampling is one of the most effective data reduction techniques that reduces the computational cost, improves scalability and computational speed with high efficiency for any data mining algorithm in single and multiple machine execution environments. This study suggested a Euclidean distance‐based stratum method for stratum creation and a stratified random sampling‐based big data mining model using the K‐Means clustering (SSK‐Means) algorithm in a single machine execution environment. The performance of the SSK‐Means algorithm has achieved better cluster quality, speed‐up, scale‐up, and memory utilization against the random sampling‐based K‐Means and classical K‐Means algorithms using silhouette coefficient, Davies Bouldin index, Calinski Harabasz index, execution time, and speedup ratio internal measures.
- Is Part Of:
- Computational and mathematical methods. Volume 3:Issue 6(2021)
- Journal:
- Computational and mathematical methods
- Issue:
- Volume 3:Issue 6(2021)
- Issue Display:
- Volume 3, Issue 6 (2021)
- Year:
- 2021
- Volume:
- 3
- Issue:
- 6
- Issue Sort Value:
- 2021-0003-0006-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2021-10-26
- Subjects:
- big data mining -- big data sampling -- big data clustering -- Euclidean distance based stratum -- random sampling -- sample extension -- SSK‐Means -- stratified sampling
Mathematics -- Data processing -- Periodicals
Numerical analysis -- Periodicals
Numerical analysis
Mathematics -- Data processing
Periodicals
004.0151 - Journal URLs:
- https://onlinelibrary.wiley.com/loi/25777408 ↗
https://www.hindawi.com/journals/cmm/ ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/cmm4.1206 ↗
- Languages:
- English
- ISSNs:
- 2577-7408
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3390.572700
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 20767.xml