An Effective Multi-clustering Anonymization Approach Using Discrete Component Task for Non Binary High Dimensional Data Spaces. (2016)
- Record Type:
- Journal Article
- Title:
- An Effective Multi-clustering Anonymization Approach Using Discrete Component Task for Non Binary High Dimensional Data Spaces. (2016)
- Main Title:
- An Effective Multi-clustering Anonymization Approach Using Discrete Component Task for Non Binary High Dimensional Data Spaces
- Authors:
- Shalin, L.V. Arun
Prasadh, K. - Abstract:
- Abstract: Clustering in common is a process of grouping elements together, so that the elements assigned to the same cluster are more comparable to each other than the remaining data points. Certain difficulties related to dealing with high dimensional data are ubiquitous and abundant. Research works conducted using anonymization method for high dimensional data spaces failed to address the problem related to dimensionality reduction for non binary databases. In this paper, Discrete Component Task Specific Multi-Clustering (DCTSM) approach is presented for dimensionality reduction on non binary database. To start with the analysis of attribute in the non binary database takes place and the process of projecting clusters identifies sparseness degree of dimensions. Then with the quantum distribution on multi cluster dimension, the solution for relevancy of attribute and redundancy on non-binary data spaces is provided. As a result, dimensionality reduction on non binary data leads to performance improvement on the basis of tag based feature. Multi clustering tag based feature reduction extracts individual features and are correspondingly replaced by the equivalent feature clusters (i.e.) tag clusters. During training, the DCTSM approach, multi clusters are used instead of the individual tag features and then during decoding the individual features are replaced by the corresponding multi clusters. To measure the effectiveness of the method, experiments are conducted on existingAbstract: Clustering in common is a process of grouping elements together, so that the elements assigned to the same cluster are more comparable to each other than the remaining data points. Certain difficulties related to dealing with high dimensional data are ubiquitous and abundant. Research works conducted using anonymization method for high dimensional data spaces failed to address the problem related to dimensionality reduction for non binary databases. In this paper, Discrete Component Task Specific Multi-Clustering (DCTSM) approach is presented for dimensionality reduction on non binary database. To start with the analysis of attribute in the non binary database takes place and the process of projecting clusters identifies sparseness degree of dimensions. Then with the quantum distribution on multi cluster dimension, the solution for relevancy of attribute and redundancy on non-binary data spaces is provided. As a result, dimensionality reduction on non binary data leads to performance improvement on the basis of tag based feature. Multi clustering tag based feature reduction extracts individual features and are correspondingly replaced by the equivalent feature clusters (i.e.) tag clusters. During training, the DCTSM approach, multi clusters are used instead of the individual tag features and then during decoding the individual features are replaced by the corresponding multi clusters. To measure the effectiveness of the method, experiments are conducted on existing anonymization method for high dimensional data spaces and compared with the DCTSM approach using Statlog German Credit Data Set. DCTSM approach obtained results of 7.05% improved accuracy and was observed that it took minimal time during tag feature extraction and resulted in lesser error rate. … (more)
- Is Part Of:
- Procedia technology. Volume 25(2016)
- Journal:
- Procedia technology
- Issue:
- Volume 25(2016)
- Issue Display:
- Volume 25, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 25
- Issue:
- 2016
- Issue Sort Value:
- 2016-0025-2016-0000
- Page Start:
- 208
- Page End:
- 215
- Publication Date:
- 2016
- Subjects:
- High-Dimensional Data Space -- Non-Binary Database -- Discrete Component Task Specific -- Quantum Distribution -- Dimensionality Reduction
Technology -- Congresses
Technology -- Periodicals
Engineering -- Congresses
Engineering -- Periodicals
Engineering
Technology
Conference proceedings
Periodicals
605 - Journal URLs:
- http://www.sciencedirect.com/science/journal/22120173 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.protcy.2016.08.099 ↗
- Languages:
- English
- ISSNs:
- 2212-0173
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7363.xml