Estimating data stream tendencies to adapt clustering parameters. (2018)
- Record Type:
- Journal Article
- Title:
- Estimating data stream tendencies to adapt clustering parameters. (2018)
- Main Title:
- Estimating data stream tendencies to adapt clustering parameters
- Authors:
- Albertini, Marcelo Keese
Mello, Rodrigo Fernandes De - Abstract:
- A wide-range of applications based on processing of data streams have emerged in the last decade. They require specialised techniques to obtain representative models and extract information. Traditional data clustering algorithms have been adapted to include continuously arriving data by updating the current model. Most of data stream clustering algorithms aggregate new data into models according to parameters usually set by users. Problems arise when choosing the values of given parameters. When the phenomenon under study is stable, an analysis of a sample of the data stream or a priori knowledge can be used. However, when the behaviour changes over collection, parameters become obsolete and, consequently, the performance is degraded. In this paper, we study the problem of how to automatically adapt control parameters of data stream clustering algorithms. In this sense, we introduce a novel approach to estimate and use data tendencies in order to automatically modify control parameters. We present a proof of the convergence of our approach towards an ideal and unknown value of the control parameter. Experimental results confirm the estimation of data tendency improves learning control parameterisation.
- Is Part Of:
- International journal of high performance computing and networking. Volume 11:Number 1(2018)
- Journal:
- International journal of high performance computing and networking
- Issue:
- Volume 11:Number 1(2018)
- Issue Display:
- Volume 11, Issue 1 (2018)
- Year:
- 2018
- Volume:
- 11
- Issue:
- 1
- Issue Sort Value:
- 2018-0011-0001-0000
- Page Start:
- 34
- Page End:
- 44
- Publication Date:
- 2018
- Subjects:
- big data -- data clustering -- data stream -- data sequence -- adaptive clustering -- data analysis
High performance computing -- Periodicals
Computer networks -- Periodicals
High performance computing
Periodicals
004.05 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijhpcn ↗
http://www.metapress.com/openurl.asp?genre=journal&issn=1740-0562 ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1740-0562
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 9032.xml