Advances in scientific literature mining for interpreting materials characterization. Issue 4 (15th July 2021)
- Record Type:
- Journal Article
- Title:
- Advances in scientific literature mining for interpreting materials characterization. Issue 4 (15th July 2021)
- Main Title:
- Advances in scientific literature mining for interpreting materials characterization
- Authors:
- Park, Gilchan
Pouchard, Line - Abstract:
- Abstract: Using synchrotron light sources, such as the National Synchrotron Light Source II at Brookhaven National Laboratory, scientists in fields as diverse as physics, biology, and materials science, identify the atomic structure, chemical composition, or other important properties of varied specimens. x-ray spectroscopy from light sources is particularly valuable for materials research with vast information available about reference spectra in the scientific literature. However, as the technique is applicable to many science domains, searching for information about select x-ray spectroscopy spectra is impeded by the sheer number of publications. Moreover, useful information about the context of an experiment or figures presented in papers can be buried among the details, which takes time to assess. This work presents a scientific literature mining system that supports data acquisition, information extraction, and user interaction for referencing x-ray spectra identification and spectral interpretation. The goal is to provide efficient access to useful spectral data to researchers who may spend only a few days at a synchrotron light source. With this system, users browse a classification tree for papers arranged according to x-ray spectroscopic methods, chemical elements, and x-ray absorption spectroscopy edges. Relevant figures are extracted with sentences from the paper that explain them, known as 'figure explanatory text.' Notably, this system focuses on semanticAbstract: Using synchrotron light sources, such as the National Synchrotron Light Source II at Brookhaven National Laboratory, scientists in fields as diverse as physics, biology, and materials science, identify the atomic structure, chemical composition, or other important properties of varied specimens. x-ray spectroscopy from light sources is particularly valuable for materials research with vast information available about reference spectra in the scientific literature. However, as the technique is applicable to many science domains, searching for information about select x-ray spectroscopy spectra is impeded by the sheer number of publications. Moreover, useful information about the context of an experiment or figures presented in papers can be buried among the details, which takes time to assess. This work presents a scientific literature mining system that supports data acquisition, information extraction, and user interaction for referencing x-ray spectra identification and spectral interpretation. The goal is to provide efficient access to useful spectral data to researchers who may spend only a few days at a synchrotron light source. With this system, users browse a classification tree for papers arranged according to x-ray spectroscopic methods, chemical elements, and x-ray absorption spectroscopy edges. Relevant figures are extracted with sentences from the paper that explain them, known as 'figure explanatory text.' Notably, this system focuses on semantic aspects (logical analysis) to find figure explanatory text using deep contextualized word embeddings techniques and contains an interface to obtain labeled data from domain experts that is used to evaluate and improve the model. … (more)
- Is Part Of:
- Machine learning: science and technology. Volume 2:Issue 4(2021)
- Journal:
- Machine learning: science and technology
- Issue:
- Volume 2:Issue 4(2021)
- Issue Display:
- Volume 2, Issue 4 (2021)
- Year:
- 2021
- Volume:
- 2
- Issue:
- 4
- Issue Sort Value:
- 2021-0002-0004-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-07-15
- Subjects:
- XAS -- x-ray spectroscopy -- figure explanatory text -- scientific literature mining -- text extraction -- deep learning -- natural language processing
006.31 - Journal URLs:
- https://iopscience.iop.org/journal/2632-2153 ↗
- DOI:
- 10.1088/2632-2153/abf751 ↗
- Languages:
- English
- ISSNs:
- 2632-2153
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 17566.xml