A modified Fuzzy k-Partition based on indiscernibility relation for categorical data clustering. (August 2016)
- Record Type:
- Journal Article
- Title:
- A modified Fuzzy k-Partition based on indiscernibility relation for categorical data clustering. (August 2016)
- Main Title:
- A modified Fuzzy k-Partition based on indiscernibility relation for categorical data clustering
- Authors:
- Yanto, Iwan Tri Riyadi
Ismail, Maizatul Akmar
Herawan, Tutut - Abstract:
- Abstract: Categorical data clustering has been adopted by many scientific communities to classify objects from large databases. In order to classify the objects, Fuzzy k- Partition approach has been proposed for categorical data clustering. However, existing Fuzzy k- Partition approaches suffer from high computational time and low clustering accuracy. Moreover, the parameter maximize of the classification likelihood function in Fuzzy k- Partition approach will always have the same categories, hence producing the same results. To overcome these issues, we propose a modified Fuzzy k- Partition based on indiscernibility relation. The indiscernibility relation induces an approximation space which is constructed by equivalence classes of indiscernible objects, thus it can be applied to classify categorical data. The novelty of the proposed approach is that unlike previous approach that use the likelihood function of multivariate multinomial distributions, the proposed approach is based on indescernibility relation. We performed an extensive theoretical analysis of the proposed approach to show its effectiveness in achieving lower computational complexity. Further, we compared the proposed approach with Fuzzy Centroid and Fuzzy k- Partition approaches in terms of response time and clustering accuracy on several UCI benchmark and real world datasets. The results show that the proposed approach achieves lower response time and higher clustering accuracy as compared to other Fuzzy k-Abstract: Categorical data clustering has been adopted by many scientific communities to classify objects from large databases. In order to classify the objects, Fuzzy k- Partition approach has been proposed for categorical data clustering. However, existing Fuzzy k- Partition approaches suffer from high computational time and low clustering accuracy. Moreover, the parameter maximize of the classification likelihood function in Fuzzy k- Partition approach will always have the same categories, hence producing the same results. To overcome these issues, we propose a modified Fuzzy k- Partition based on indiscernibility relation. The indiscernibility relation induces an approximation space which is constructed by equivalence classes of indiscernible objects, thus it can be applied to classify categorical data. The novelty of the proposed approach is that unlike previous approach that use the likelihood function of multivariate multinomial distributions, the proposed approach is based on indescernibility relation. We performed an extensive theoretical analysis of the proposed approach to show its effectiveness in achieving lower computational complexity. Further, we compared the proposed approach with Fuzzy Centroid and Fuzzy k- Partition approaches in terms of response time and clustering accuracy on several UCI benchmark and real world datasets. The results show that the proposed approach achieves lower response time and higher clustering accuracy as compared to other Fuzzy k- based approaches. … (more)
- Is Part Of:
- Engineering applications of artificial intelligence. Volume 53(2016:May)
- Journal:
- Engineering applications of artificial intelligence
- Issue:
- Volume 53(2016:May)
- Issue Display:
- Volume 53 (2016)
- Year:
- 2016
- Volume:
- 53
- Issue Sort Value:
- 2016-0053-0000-0000
- Page Start:
- 41
- Page End:
- 52
- Publication Date:
- 2016-08
- Subjects:
- Clustering -- Categorical data -- Fuzzy k-Partition -- Indescernibility relation
Engineering -- Data processing -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Ingénierie -- Informatique -- Périodiques
Intelligence artificielle -- Périodiques
Systèmes experts (Informatique) -- Périodiques
Artificial intelligence
Engineering -- Data processing
Expert systems (Computer science)
Periodicals
620.00285 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09521976 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.engappai.2016.01.026 ↗
- Languages:
- English
- ISSNs:
- 0952-1976
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3755.704500
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 970.xml