Clustering biomedical and gene expression datasets with kernel density and unique neighborhood set based vein detection. (July 2020)
- Record Type:
- Journal Article
- Title:
- Clustering biomedical and gene expression datasets with kernel density and unique neighborhood set based vein detection. (July 2020)
- Main Title:
- Clustering biomedical and gene expression datasets with kernel density and unique neighborhood set based vein detection
- Authors:
- Rahman, Md Anisur
Ang, Li-Minn
Seng, Kah Phooi - Abstract:
- Abstract: It is a crucial need for a clustering technique to produce high-quality clusters from biomedical and gene expression datasets without requiring any user inputs. Therefore, in this paper we present a clustering technique called KUVClust that produces high-quality clusters when applied on biomedical and gene expression datasets without requiring any user inputs. The KUVClust algorithm uses three concepts namely multivariate kernel density estimation, unique closest neighborhood set and vein-based clustering. Although these concepts are known in the literature, KUVClust combines the concepts in a novel manner to achieve high-quality clustering results. The performance of KUVClust is compared with established clustering techniques on real-world biomedical and gene expression datasets. The comparisons were evaluated in terms of three criteria (purity, entropy, and sum of squared error (SSE)). Experimental results demonstrated the superiority of the proposed technique over the existing techniques for clustering both the low dimensional biomedical and high dimensional gene expressions datasets used in the experiments. Highlights: KUVClust does not require any user input to produce high quality clusters. KUVClust produces high quality clusters from biomedical & gene expression datasets. KUVClust is combination of three concepts namely Vein-based clustering, KDE and UNS. KUVClust works well on both high dimensional and low dimensional datasets.
- Is Part Of:
- Information systems. Volume 91(2020)
- Journal:
- Information systems
- Issue:
- Volume 91(2020)
- Issue Display:
- Volume 91, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 91
- Issue:
- 2020
- Issue Sort Value:
- 2020-0091-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-07
- Subjects:
- Clustering -- Biomedical -- Gene expression -- Kernel density estimation -- Vein-based clustering -- Unique neighborhood set
Database management -- Periodicals
Electronic data processing -- Periodicals
Bases de données -- Gestion -- Périodiques
Informatique -- Périodiques
Database management
Electronic data processing
Periodicals
005.7 - Journal URLs:
- http://www.sciencedirect.com/science/journal/03064379 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.is.2020.101490 ↗
- Languages:
- English
- ISSNs:
- 0306-4379
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.367300
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 13537.xml