MrIML: Multi‐response interpretable machine learning to model genomic landscapes. (6th September 2021)
- Record Type:
- Journal Article
- Title:
- MrIML: Multi‐response interpretable machine learning to model genomic landscapes. (6th September 2021)
- Main Title:
- MrIML: Multi‐response interpretable machine learning to model genomic landscapes
- Authors:
- Fountain‐Jones, Nicholas M.
Kozakiewicz, Christopher P.
Forester, Brenna R.
Landguth, Erin L.
Carver, Scott
Charleston, Michael
Gagne, Roderick B.
Greenwell, Brandon
Kraberger, Simona
Trumbo, Daryl R.
Mayer, Michael
Clark, Nicholas J.
Machado, Gustavo - Other Names:
- Fountain‐Jones Nicholas M. guestEditor.
Smith Megan L. guestEditor.
Austerlitz Frédéric guestEditor. - Abstract:
- Abstract: We introduce a new R package "MrIML" ("Mister iml"; Multi‐response Interpretable Machine Learning). MrIML provides a powerful and interpretable framework that enables users to harness recent advances in machine learning to quantify multilocus genomic relationships, to identify loci of interest for future landscape genetics studies, and to gain new insights into adaptation across environmental gradients. Relationships between genetic variation and environment are often nonlinear and interactive; these characteristics have been challenging to address using traditional landscape genetic approaches. Our package helps capture this complexity and offers functions that fit and interpret a wide range of highly flexible models that are routinely used for single‐locus landscape genetics studies but are rarely extended to estimate response functions for multiple loci. To demonstrate the package's broad functionality, we test its ability to recover landscape relationships from simulated genomic data. We also apply the package to two empirical case studies. In the first, we model genetic variation of North American balsam poplar ( Populus balsamifera, Salicaceae) populations across environmental gradients. In the second case study, we recover the landscape and host drivers of feline immunodeficiency virus genetic variation in bobcats ( Lynx rufus ). The ability to model thousands of loci collectively and compare models from linear regression to extreme gradient boosting, withinAbstract: We introduce a new R package "MrIML" ("Mister iml"; Multi‐response Interpretable Machine Learning). MrIML provides a powerful and interpretable framework that enables users to harness recent advances in machine learning to quantify multilocus genomic relationships, to identify loci of interest for future landscape genetics studies, and to gain new insights into adaptation across environmental gradients. Relationships between genetic variation and environment are often nonlinear and interactive; these characteristics have been challenging to address using traditional landscape genetic approaches. Our package helps capture this complexity and offers functions that fit and interpret a wide range of highly flexible models that are routinely used for single‐locus landscape genetics studies but are rarely extended to estimate response functions for multiple loci. To demonstrate the package's broad functionality, we test its ability to recover landscape relationships from simulated genomic data. We also apply the package to two empirical case studies. In the first, we model genetic variation of North American balsam poplar ( Populus balsamifera, Salicaceae) populations across environmental gradients. In the second case study, we recover the landscape and host drivers of feline immunodeficiency virus genetic variation in bobcats ( Lynx rufus ). The ability to model thousands of loci collectively and compare models from linear regression to extreme gradient boosting, within the same analytical framework, has the potential to be transformative. The MrIML framework is also extendable and not limited to modelling genetic variation; for example, it can quantify the environmental drivers of microbiomes and coinfection dynamics. … (more)
- Is Part Of:
- Molecular ecology resources. Volume 21:Number 8(2021)
- Journal:
- Molecular ecology resources
- Issue:
- Volume 21:Number 8(2021)
- Issue Display:
- Volume 21, Issue 8 (2021)
- Year:
- 2021
- Volume:
- 21
- Issue:
- 8
- Issue Sort Value:
- 2021-0021-0008-0000
- Page Start:
- 2766
- Page End:
- 2781
- Publication Date:
- 2021-09-06
- Subjects:
- artificial intelligence -- community ecology -- ecological genetics -- gradient boosting models -- landscape genetics -- random forest
Molecular ecology -- Periodicals
572.8 - Journal URLs:
- http://onlinelibrary.wiley.com/journal/10.1111/(ISSN)1755-0998 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1111/1755-0998.13495 ↗
- Languages:
- English
- ISSNs:
- 1755-098X
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5900.817368
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 20036.xml