A method for archaeological and dendrochronological concept annotation using domain knowledge in information extraction. (18th May 2022)
- Record Type:
- Journal Article
- Title:
- A method for archaeological and dendrochronological concept annotation using domain knowledge in information extraction. (18th May 2022)
- Main Title:
- A method for archaeological and dendrochronological concept annotation using domain knowledge in information extraction
- Authors:
- Vlachidis, Andreas
Tudhope, Douglas - Abstract:
- Advances in Natural Language Processing allow the process of deriving information from large volumes of text to be automated. Attention is turned to one of the most important, but traditionally difficult to access resources in archaeology, commonly known as 'grey literature'. This paper presents the development of two separate Named-Entity Recognition (NER) pipelines aimed at the extraction of Archaeological and of Dendrochronological concepts in Dutch, respectively. The role of domain vocabulary is discussed for the development of a Knowledge Organisation System (KOS)-driven, Rule-Based method of NER which makes complementary use of ontology, thesauri and domain vocabulary for information extraction and attribute assignment of semantic annotations. The NER task is challenged by a series of domain and language-oriented aspects and evaluated against a human-annotated Gold Standard. The results suggest the suitability of Rule-based KOS driven approaches for attaining the low-hanging fruits of NER, using a combination of quality vocabulary and rules.
- Is Part Of:
- International journal of metadata, semantics and ontologies. Volume 15:Number 3(2021)
- Journal:
- International journal of metadata, semantics and ontologies
- Issue:
- Volume 15:Number 3(2021)
- Issue Display:
- Volume 15, Issue 3 (2021)
- Year:
- 2021
- Volume:
- 15
- Issue:
- 3
- Issue Sort Value:
- 2021-0015-0003-0000
- Page Start:
- 192
- Page End:
- 203
- Publication Date:
- 2022-05-18
- Subjects:
- information extraction -- knowledge organisation systems -- named entity recognition -- archaeology -- dendrochronology -- grey literature -- semantic annotation
Metadata -- Periodicals
Semantic Web -- Periodicals
Ontologies (Information retrieval) -- Periodicals
Data structures (Computer science) -- Periodicals
Information theory -- Periodicals
005.74 - Journal URLs:
- http://www.inderscience.com/browse/index.php?journalID=152 ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1744-2621
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 21569.xml