Bregmannian consensus clustering for cancer subtypes analysis. (June 2020)
- Record Type:
- Journal Article
- Title:
- Bregmannian consensus clustering for cancer subtypes analysis. (June 2020)
- Main Title:
- Bregmannian consensus clustering for cancer subtypes analysis
- Authors:
- Li, Jianqiang
Xie, Liyang
Xie, Yunshen
Wang, Fei - Abstract:
- Highlights: The highlights of this paper named 'Bregmannian Consensus Clustering for Cancer Subtypes Analysis' can be summarized as follow: Unlike some traditional methods which use Euclidean distance or KL divergence to measure matrix consensus, we adopt a much more general criterion– Bregman divergence, which can include many commonly used distortion measures as its special cases. We extend our framework to formulate the weighted consensus cluster problem, showing that both the unweighted and weighted problems are convex and can be efficiently solved. And we generalize our framework to incorporate pairwise constraints, and show that the resulting problem can be efficiently solved. We apply the new consensus clustering methods to the analysis of molecular profiles for cancer subtypes, aims to develop effective treatments for this aggressive type of cancer. Abstract: Cancer subtype analysis, as an extension of cancer diagnosis, can be regarded as a consensus clustering problem. This analysis is beneficial for providing patients with more accurate treatment. Consensus clustering refers to a situation in which several different clusters have been obtained for a particular data set, and it is desired to aggregate those clustering results to get a better clustering solution. In this paper, we propose to generalize the traditional consensus clustering methods in three manners: (1) We provide Bregmannian consensus clustering (BCC), where the loss between the consensus clusteringHighlights: The highlights of this paper named 'Bregmannian Consensus Clustering for Cancer Subtypes Analysis' can be summarized as follow: Unlike some traditional methods which use Euclidean distance or KL divergence to measure matrix consensus, we adopt a much more general criterion– Bregman divergence, which can include many commonly used distortion measures as its special cases. We extend our framework to formulate the weighted consensus cluster problem, showing that both the unweighted and weighted problems are convex and can be efficiently solved. And we generalize our framework to incorporate pairwise constraints, and show that the resulting problem can be efficiently solved. We apply the new consensus clustering methods to the analysis of molecular profiles for cancer subtypes, aims to develop effective treatments for this aggressive type of cancer. Abstract: Cancer subtype analysis, as an extension of cancer diagnosis, can be regarded as a consensus clustering problem. This analysis is beneficial for providing patients with more accurate treatment. Consensus clustering refers to a situation in which several different clusters have been obtained for a particular data set, and it is desired to aggregate those clustering results to get a better clustering solution. In this paper, we propose to generalize the traditional consensus clustering methods in three manners: (1) We provide Bregmannian consensus clustering (BCC), where the loss between the consensus clustering result and all the input clusterings are generalized from a traditional Euclidean distance to a general Bregman loss; (2) we generalize the BCC to a weighted case, where each input clustering has different weights, providing a better solution for the final clustering result; and (3) we propose a novel semi-supervised consensus clustering, which adds some must-link and cannot-link constraints compared with the first two methods. Then, we obtain three cancer (breast, lung, colorectal cancer) data sets from The Cancer Genome Atlas (TCGA). Each data set has three data types (mRNA, mircoRNA, methylation), and each is respectively used to test the accuracy of the proposed algorithms for clusterings. The experimental results demonstrate that the highest aggregation accuracy of the weighted BCC (WBCC) on cancer data sets is 90.2%. Moreover, although the lowest accuracy is 62.3%, it is higher than other methods on the same data set. Therefore, we conclude that as compared with the competition, our method is more effective. … (more)
- Is Part Of:
- Computer methods and programs in biomedicine. Volume 189(2020)
- Journal:
- Computer methods and programs in biomedicine
- Issue:
- Volume 189(2020)
- Issue Display:
- Volume 189, Issue 2020 (2020)
- Year:
- 2020
- Volume:
- 189
- Issue:
- 2020
- Issue Sort Value:
- 2020-0189-2020-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-06
- Subjects:
- Cancer subtypes analysis -- Consensus Clustering -- Bregman divergence
Medicine -- Computer programs -- Periodicals
Biology -- Computer programs -- Periodicals
Computers -- Periodicals
Medicine -- Periodicals
Médecine -- Logiciels -- Périodiques
Biologie -- Logiciels -- Périodiques
Biology -- Computer programs
Medicine -- Computer programs
Periodicals
Electronic journals
610.28 - Journal URLs:
- http://www.sciencedirect.com/science/journal/01692607 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.cmpb.2020.105337 ↗
- Languages:
- English
- ISSNs:
- 0169-2607
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.095000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 14592.xml