A new method to extract n-Ary relation instances from scientific documents. (15th December 2022)
- Record Type:
- Journal Article
- Title:
- A new method to extract n-Ary relation instances from scientific documents. (15th December 2022)
- Main Title:
- A new method to extract n-Ary relation instances from scientific documents
- Authors:
- Lentschat, Martin
Buche, Patrice
Dibie-Barthelemy, Juliette
Roche, Mathieu - Abstract:
- Abstract: A new method to extract knowledge structured as n-Ary relations from scientific articles is presented. We designed and assessed different approaches to reconstruct instances of n-Ary relations extracted from scientific articles in experimental domains, driven by an Ontological and Terminological Resource (OTR) and based on multi-feature representation of relations and their arguments. The proposed method starts with the identification of partial n-Ary relations in tables of scientific articles and then seeks to reconstruct them with argument instances in the article texts. Based on the so-called Scientific Publication Representation (SciPuRe) of textual arguments and Scientific Table Representation (STaRe) of n-Ary relations representation of an n-Ary relation called STaRe (Scientific Table Representation, originating from partial n-Ary relations extracted from document tables), here we propose and evaluate different approaches for the selection of textual argument instances that could complement partial n-Ary relations: structural, frequentist and word embedding models. The application domain concerns food packaging, especially composition and permeability data. Experiments were conducted on a corpus of 332 relation instances composed of 1547 arguments. Corpora of full and partial relations recognized in document tables and argument instances extracted from texts are available online. Different methods and strategies were measured with an f-score ranging from . 34Abstract: A new method to extract knowledge structured as n-Ary relations from scientific articles is presented. We designed and assessed different approaches to reconstruct instances of n-Ary relations extracted from scientific articles in experimental domains, driven by an Ontological and Terminological Resource (OTR) and based on multi-feature representation of relations and their arguments. The proposed method starts with the identification of partial n-Ary relations in tables of scientific articles and then seeks to reconstruct them with argument instances in the article texts. Based on the so-called Scientific Publication Representation (SciPuRe) of textual arguments and Scientific Table Representation (STaRe) of n-Ary relations representation of an n-Ary relation called STaRe (Scientific Table Representation, originating from partial n-Ary relations extracted from document tables), here we propose and evaluate different approaches for the selection of textual argument instances that could complement partial n-Ary relations: structural, frequentist and word embedding models. The application domain concerns food packaging, especially composition and permeability data. Experiments were conducted on a corpus of 332 relation instances composed of 1547 arguments. Corpora of full and partial relations recognized in document tables and argument instances extracted from texts are available online. Different methods and strategies were measured with an f-score ranging from . 34 to . 74 . These results show that n-Ary relations reconstruction approach depends on the number of selected candidate argument instances. Highlights: A generic method to extract n-Ary relations based on a domain ontology. An approach using partial n-Ary relation in tables completed by text arguments. Multi-criteria approaches to complete partial n-Ary relation with argument instances. A complete pipeline for the extraction of n-Ary relations driven by an OTR. A corpus in the food packaging field to assess the method. … (more)
- Is Part Of:
- Expert systems with applications. Volume 209(2022)
- Journal:
- Expert systems with applications
- Issue:
- Volume 209(2022)
- Issue Display:
- Volume 209, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 209
- Issue:
- 2022
- Issue Sort Value:
- 2022-0209-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-12-15
- Subjects:
- N-ary relations -- Natural language processing -- Knowledge extraction -- Information extraction -- Ontological and terminological resource -- Smart data
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2022.118332 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 23342.xml