CProtMEDIAS: clustering of amino acid sequences encoded by gene families by MErging and DIgitizing Aligned Sequences. Issue 4 (15th July 2022)
- Record Type:
- Journal Article
- Title:
- CProtMEDIAS: clustering of amino acid sequences encoded by gene families by MErging and DIgitizing Aligned Sequences. Issue 4 (15th July 2022)
- Main Title:
- CProtMEDIAS: clustering of amino acid sequences encoded by gene families by MErging and DIgitizing Aligned Sequences
- Authors:
- Zhang, Zhe
Zhu, Miaomiao
Xie, Qi
Larkin, Robert M
Shi, Xueping
Zheng, Bo - Abstract:
- Abstract: Protein phylogenetic analysis focuses on the evolutionary relationships among related protein sequences and can help researchers infer protein functions and developmental trajectories. With the advent of the big data era, the existing protein phylogenetic methods, including distance matrix and character-based methods, are facing challenges in both running time and application scope. Here, we developed an R package that we call CProtMEDIAS that is useful for protein phylogenetic analysis. In contrast to existing phylogenetic analysis methods, CProtMEDIAS utilizes dimensionality reduction algorithms to digitize multiple sequence alignments and quickly conduct phylogenetic analysis with a large number of amino acid sequences from similarly distant protein families and species. We used CProtMEDIAS to perform a dimensionality reduction, clustering, pseudotime, specific residue and evolutionary trajectory analysis of the plant homeobox superfamily. We found that CProtMEDIAS delivers consistent clustering, fast running and elegant presentation and thus provides powerful new tools and methods for protein clustering and evolutionary analysis.
- Is Part Of:
- Briefings in bioinformatics. Volume 23:Issue 4(2022)
- Journal:
- Briefings in bioinformatics
- Issue:
- Volume 23:Issue 4(2022)
- Issue Display:
- Volume 23, Issue 4 (2022)
- Year:
- 2022
- Volume:
- 23
- Issue:
- 4
- Issue Sort Value:
- 2022-0023-0004-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-07-15
- Subjects:
- phylogenetic analysis -- amino acid sequence -- sequence digitization -- dimensionality reduction -- developmental trajectory inference
Genetics -- Data processing -- Periodicals
Molecular biology -- Data processing -- Periodicals
Genomes -- Data processing -- Periodicals
572.80285 - Journal URLs:
- http://bib.oxfordjournals.org ↗
http://www.oxfordjournals.org/content?genre=journal&issn=1477-4054 ↗
http://ukcatalogue.oup.com/ ↗
http://firstsearch.oclc.org ↗ - DOI:
- 10.1093/bib/bbac276 ↗
- Languages:
- English
- ISSNs:
- 1467-5463
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2283.958363
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22546.xml