Mining annotators' common knowledge for automatic text revision. (18th May 2019)
- Record Type:
- Journal Article
- Title:
- Mining annotators' common knowledge for automatic text revision. (18th May 2019)
- Main Title:
- Mining annotators' common knowledge for automatic text revision
- Authors:
- Siragusa, Giovanni
Caro, Luigi Di
Tosalli, Marco - Abstract:
- Many natural language understanding tasks require clean input textual data in order to train systems with the highest precision. Such data, usually collected from surveys or the web, are manually processed in order to remove morphosyntactic variability, spelling errors and incoherence in naming entities. Since these operations are conducted by domain experts and annotators, they are usually costly and time-consuming. Furthermore, this scenario is very common in industrial tasks where annotators are hired. In this context, we propose an innovative and simple method that extracts correction patterns, i.e., <expression, replacement> pairs, where expression is a matching string and replacement indicates how to re-write the matched string. Such tool can be used both to evaluate annotators (since it provides a deep understanding of their work) and to automatically revise the texts. We extensively tested our method in a multilingual setting, obtaining outstanding results over baseline approaches.
- Is Part Of:
- International journal of metadata, semantics and ontologies. Volume 13:Number 3(2019)
- Journal:
- International journal of metadata, semantics and ontologies
- Issue:
- Volume 13:Number 3(2019)
- Issue Display:
- Volume 13, Issue 3 (2019)
- Year:
- 2019
- Volume:
- 13
- Issue:
- 3
- Issue Sort Value:
- 2019-0013-0003-0000
- Page Start:
- 254
- Page End:
- 263
- Publication Date:
- 2019-05-18
- Subjects:
- pattern extraction -- natural language understanding -- annotation learning -- correction patterns
Metadata -- Periodicals
Semantic Web -- Periodicals
Ontologies (Information retrieval) -- Periodicals
Data structures (Computer science) -- Periodicals
Information theory -- Periodicals
005.74 - Journal URLs:
- http://www.inderscience.com/browse/index.php?journalID=152 ↗
http://www.inderscience.com/ ↗ - Languages:
- English
- ISSNs:
- 1744-2621
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 10644.xml