A survey of diacritic restoration in abjad and alphabet writing systems. (20th November 2017)
- Record Type:
- Journal Article
- Title:
- A survey of diacritic restoration in abjad and alphabet writing systems. (20th November 2017)
- Main Title:
- A survey of diacritic restoration in abjad and alphabet writing systems
- Authors:
- ASAHIAH, FRANKLIN ỌLÁDIÍPỌ̀
ỌDẸ́JỌBÍ, ỌDẸ́TÚNJÍ ÀJÀDÍ
ADÁGÚNODÒ, EMMANUEL RÓTÌMÍ - Abstract:
- Abstract: A diacritic is a mark placed near or through a character to alter its original phonetic or orthographic value. Many languages around the world use diacritics in their orthography, whatever the writing system the orthography is based on. In many languages, diacritics are ignored either by convention or as a matter of convenience. For users who are not familiar with the text domain, the absence of diacritics within text has been known to cause mild to serious readability and comprehension problems. However, the absence of diacritics in text causes near-intractable problems for natural language processing systems. This situation has led to extensive research on diacritization. Several techniques have been applied to address diacritic restoration (or diacritization) but the existing surveys of techniques have been restricted to some languages and hence left gaps for practitioners to fill. Our survey examined diacritization from the angle of resources deployed and various formulation employed for diacritization. It was concluded by recommending that (a) any proposed technique for diacritization should consider the language features and the purpose served by diacritics, (b) that evaluation metrics needed to be more rigorously defined for easy comparison of performance of models.
- Is Part Of:
- Natural language engineering. Volume 24:Part 1(2018)
- Journal:
- Natural language engineering
- Issue:
- Volume 24:Part 1(2018)
- Issue Display:
- Volume 24, Issue 1, Part 1 (2018)
- Year:
- 2018
- Volume:
- 24
- Issue:
- 1
- Part:
- 1
- Issue Sort Value:
- 2018-0024-0001-0001
- Page Start:
- 123
- Page End:
- 154
- Publication Date:
- 2017-11-20
- Subjects:
- Natural language processing (Computer science) -- Periodicals
Software engineering -- Periodicals
006.35 - Journal URLs:
- http://journals.cambridge.org/action/displayJournal?jid=NLE ↗
- DOI:
- 10.1017/S1351324917000407 ↗
- Languages:
- English
- ISSNs:
- 1351-3249
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 5947.xml