A neural approach for inducing multilingual resources and natural language processing tools for low-resource languages. (6th August 2018)
- Record Type:
- Journal Article
- Title:
- A neural approach for inducing multilingual resources and natural language processing tools for low-resource languages. (6th August 2018)
- Main Title:
- A neural approach for inducing multilingual resources and natural language processing tools for low-resource languages
- Authors:
- ZENNAKI, O.
SEMMAR, N.
BESACIER, L. - Abstract:
- Abstract: This work focuses on the rapid development of linguistic annotation tools for low-resource languages (languages that have no labeled training data). We experiment with several cross-lingual annotation projection methods using recurrent neural networks (RNN) models. The distinctive feature of our approach is that our multilingual word representation requires only a parallel corpus between source and target languages. More precisely, our approach has the following characteristics: (a) it does not use word alignment information, (b) it does not assume any knowledge about target languages (one requirement is that the two languages (source and target) are not too syntactically divergent), which makes it applicable to a wide range of low-resource languages, (c) it provides authentic multilingual taggers (one tagger for N languages). We investigate both uni and bidirectional RNN models and propose a method to include external information (for instance, low-level information from part-of-speech tags) in the RNN to train higher level taggers (for instance, Super Sense taggers). We demonstrate the validity and genericity of our model by using parallel corpora (obtained by manual or automatic translation). Our experiments are conducted to induce cross-lingual part-of-speech and Super Sense taggers. We also use our approach in a weakly supervised context, and it shows an excellent potential for very low-resource settings (less than 1k training utterances).
- Is Part Of:
- Natural language engineering. Volume 25:Part 1(2019)
- Journal:
- Natural language engineering
- Issue:
- Volume 25:Part 1(2019)
- Issue Display:
- Volume 25, Issue 1, Part 1 (2019)
- Year:
- 2019
- Volume:
- 25
- Issue:
- 1
- Part:
- 1
- Issue Sort Value:
- 2019-0025-0001-0001
- Page Start:
- 43
- Page End:
- 67
- Publication Date:
- 2018-08-06
- Subjects:
- Natural language processing (Computer science) -- Periodicals
Software engineering -- Periodicals
006.35 - Journal URLs:
- http://journals.cambridge.org/action/displayJournal?jid=NLE ↗
- DOI:
- 10.1017/S1351324918000293 ↗
- Languages:
- English
- ISSNs:
- 1351-3249
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 9521.xml