Ontology population for open‐source intelligence: A GATE‐based solution. (16th September 2018)
- Record Type:
- Journal Article
- Title:
- Ontology population for open‐source intelligence: A GATE‐based solution. (16th September 2018)
- Main Title:
- Ontology population for open‐source intelligence: A GATE‐based solution
- Authors:
- Ganino, Giulio
Lembo, Domenico
Mecella, Massimo
Scafoglieri, Federico - Abstract:
- Summary: Open‐Source INTelligence is intelligence based on publicly available sources such as news sites, blogs, forums, etc. The Web is the primary source of information, but once data are crawled, they need to be interpreted and structured. Ontologies may play a crucial role in this process, but because of the vast amount of documents available, automatic mechanisms for their population are needed, starting from the crawled text. This paper presents an approach for the automatic population of predefined ontologies with data extracted from text and discusses the design and realization of a pipeline based on the General Architecture for Text Engineering system, which is interesting for both researchers and practitioners in the field. Some experimental results that are encouraging in terms of extracted correct instances of the ontology are also reported. Furthermore, the paper also describes an alternative approach and provides additional experiments for one of the phases of our pipeline, which requires the use of predefined dictionaries for relevant entities. Through such a variant, the manual workload required in this phase was reduced, still obtaining promising results.
- Is Part Of:
- Software, practice & experience. Volume 48:Number 12(2018)
- Journal:
- Software, practice & experience
- Issue:
- Volume 48:Number 12(2018)
- Issue Display:
- Volume 48, Issue 12 (2018)
- Year:
- 2018
- Volume:
- 48
- Issue:
- 12
- Issue Sort Value:
- 2018-0048-0012-0000
- Page Start:
- 2302
- Page End:
- 2330
- Publication Date:
- 2018-09-16
- Subjects:
- general architecture for text engineering (GATE) -- information extraction -- internet as a data source -- ontology population
Computer software -- Periodicals
Computer programming -- Periodicals
Computer programs -- Periodicals
005.3 - Journal URLs:
- http://onlinelibrary.wiley.com/ ↗
- DOI:
- 10.1002/spe.2640 ↗
- Languages:
- English
- ISSNs:
- 0038-0644
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 8321.453000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 8482.xml