Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications. (13th December 2021)
- Record Type:
- Journal Article
- Title:
- Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications. (13th December 2021)
- Main Title:
- Towards combined semantic and lexical scores based on a new representation of textual data to extract experimental data from scientific publications
- Authors:
- Lentschat, Martin
Buche, Patrice
Dibie-Barthelemy, Juliette
Roche, Mathieu - Abstract:
- This article presents an ontological and terminological resource guided process for targeted extraction of scientific experimental data. Our method relies on the scientific publication representation ( SciPuRe ) describing the extracted data through ontological, lexical and structural (using segments in the scientific documents) features. Relevance scores based on these features are computed to rank the results and filter out the numerous false positives. Linear and sequential combinations of these scores are presented and evaluated. Experiments were carried out on a corpus of 50 English language scientific papers in the food packaging field. They revealed that article segment are an effective criterion for filtering out a majority of the quantitative entity false positives using lexical scores. Moreover the best symbolic entity extraction results were obtained with a sequential combinations of semantic and lexical scores. These results enable the ranking of entities by relevance and the filtering of false positive results.
- Is Part Of:
- International journal of intelligent information and database systems. Volume 15:Number 1(2022)
- Journal:
- International journal of intelligent information and database systems
- Issue:
- Volume 15:Number 1(2022)
- Issue Display:
- Volume 15, Issue 1 (2022)
- Year:
- 2022
- Volume:
- 15
- Issue:
- 1
- Issue Sort Value:
- 2022-0015-0001-0000
- Page Start:
- 78
- Page End:
- 103
- Publication Date:
- 2021-12-13
- Subjects:
- data extraction -- data relevance -- data representation -- ontological and terminological resource -- OTR -- information retrieval -- web scientific documents
Database management -- Computer programs -- Periodicals
Information retrieval -- Computer programs -- Periodicals
Information storage and retrieval systems -- Computer programs -- Periodicals
Artificial intelligence -- Periodicals
Expert systems (Computer science) -- Periodicals
Intelligent agents (Computer software) -- Periodicals
006.33 - Journal URLs:
- http://www.inderscience.com/jhome.php?jcode=ijiids ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1751-5858
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 18151.xml