Logical-linguistic model for multilingual Open Information Extraction. Issue 1 (1st January 2020)
- Record Type:
- Journal Article
- Title:
- Logical-linguistic model for multilingual Open Information Extraction. Issue 1 (1st January 2020)
- Main Title:
- Logical-linguistic model for multilingual Open Information Extraction
- Authors:
- Khairova, Nina
Mamyrbayev, Orken
Mukhsina, Kuralay
Kolesnyk, Anastasiia - Editors:
- Pratap, Saurabh
- Abstract:
- Abstract: Open Information Extraction (OIE) is a modern strategy to extract the triplet of facts from Web-document collections. However, most part of the current OIE approaches is based on NLP techniques such as POS tagging and dependency parsing, which tools are accessible not to all languages. In this paper, we suggest the logical-linguistic model, which basic mathematical means are logical-algebraic equations of finite predicates algebra. These equations allow expressing a semantic role of the participant of a triplet of the fact (Subject-Predicate-Object) due to the relations of grammatical characteristics of words in the sentence. We propose the model that extracts the unlimited domain-independent number of facts from sentences of different languages. The use of our model allows extracting the facts from unstructured texts without requiring a pre-specified vocabulary, by identifying relations in phrases and associated arguments in arbitrary sentences of English, Kazakh, and Russian languages. We evaluate our approach on corpora of three languages based on English and Kazakh bilingual news websites. We achieve the precision of facts extraction over 87% for English corpus, over 82% for Russian corpus and 71% for Kazakh corpus.
- Is Part Of:
- Cogent engineering. Volume 7:Issue 1(2020)
- Journal:
- Cogent engineering
- Issue:
- Volume 7:Issue 1(2020)
- Issue Display:
- Volume 7, Issue 1 (2020)
- Year:
- 2020
- Volume:
- 7
- Issue:
- 1
- Issue Sort Value:
- 2020-0007-0001-0000
- Page Start:
- Page End:
- Publication Date:
- 2020-01-01
- Subjects:
- Open Information Extraction -- fact extraction from unstructured texts -- Kazakh bilingual news websites -- criminal subject -- logical-linguistic model -- finite predicates algebra
Engineering -- Periodicals
Technology -- Periodicals
Engineering
Technology
Periodicals
620 - Journal URLs:
- http://bibpurl.oclc.org/web/73324 ↗
http://cogentoa.tandfonline.com/journal/oaen20 ↗
http://www.tandfonline.com/toc/oaen20/1/1 ↗
http://www.tandfonline.com/ ↗
http://cogentoa.tandfonline.com/journal/oaps20 ↗ - DOI:
- 10.1080/23311916.2020.1714829 ↗
- Languages:
- English
- ISSNs:
- 2331-1916
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21972.xml