Extraction of phenotypic traits from taxonomic descriptions for the tree of life using natural language processing. (31st March 2018)
- Record Type:
- Journal Article
- Title:
- Extraction of phenotypic traits from taxonomic descriptions for the tree of life using natural language processing. (31st March 2018)
- Main Title:
- Extraction of phenotypic traits from taxonomic descriptions for the tree of life using natural language processing
- Authors:
- Endara, Lorena
Cui, Hong
Burleigh, J. Gordon - Abstract:
- Abstract : Premise of the Study: Phenotypic data sets are necessary to elucidate the genealogy of life, but assembling phenotypic data for taxa across the tree of life can be technically challenging and prohibitively time consuming. We describe a semi‐automated protocol to facilitate and expedite the assembly of phenotypic character matrices of plants from formal taxonomic descriptions. This pipeline uses new natural language processing (NLP) techniques and a glossary of over 9000 botanical terms. Methods and Results: Our protocol includes the Explorer of Taxon Concepts (ETC), an online application that assembles taxon‐by‐character matrices from taxonomic descriptions, and MatrixConverter, a Java application that enables users to evaluate and discretize the characters extracted by ETC. We demonstrate this protocol using descriptions from Araucariaceae. Conclusions: The NLP pipeline unlocks the phenotypic data found in taxonomic descriptions and makes them usable for evolutionary analyses.
- Is Part Of:
- Applications in plant sciences. Volume 6:Number 3(2018)
- Journal:
- Applications in plant sciences
- Issue:
- Volume 6:Number 3(2018)
- Issue Display:
- Volume 6, Issue 3 (2018)
- Year:
- 2018
- Volume:
- 6
- Issue:
- 3
- Issue Sort Value:
- 2018-0006-0003-0000
- Page Start:
- n/a
- Page End:
- n/a
- Publication Date:
- 2018-03-31
- Subjects:
- morphological matrices -- natural language processing -- phenotypic traits -- taxonomic descriptions
Plants -- Periodicals
Plant physiology -- Periodicals
Plant Physiological Phenomena
Plant physiology
Plants
Periodicals
Periodicals
Fulltext
Internet Resources
Periodicals
580 - Journal URLs:
- http://bibpurl.oclc.org/web/83301 ↗
http://onlinelibrary.wiley.com/journal/10.1002/(ISSN)2168-0450 ↗
http://onlinelibrary.wiley.com/ ↗ - DOI:
- 10.1002/aps3.1035 ↗
- Languages:
- English
- ISSNs:
- 2168-0450
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 6306.xml