DEXTER: A workbench for automatic term extraction with specialized corpora†. (5th October 2017)
- Record Type:
- Journal Article
- Title:
- DEXTER: A workbench for automatic term extraction with specialized corpora†. (5th October 2017)
- Main Title:
- DEXTER: A workbench for automatic term extraction with specialized corpora†
- Authors:
- PERIÑAN-PASCUAL, CARLOS
- Abstract:
- Abstract: Automatic term extraction has become a priority area of research within corpus processing. Despite the extensive literature in this field, there are still some outstanding issues that should be dealt with during the construction of term extractors, particularly those oriented to support research in terminology and terminography. In this regard, this article describes the design and development of DEXTER, an online workbench for the extraction of simple and complex terms from domain-specific corpora in English, French, Italian and Spanish. In this framework, three issues contribute to placing the most important terms in the foreground. First, unlike the elaborate morphosyntactic patterns proposed by most previous research, shallow lexical filters have been constructed to discard term candidates. Second, a large number of common stopwords are automatically detected by means of a method that relies on the IATE database together with the frequency distribution of the domain-specific corpus and a general corpus. Third, the term-ranking metric, which is grounded on the notions of salience, relevance and cohesion, is guided by the IATE database to display an adequate distribution of terms.
- Is Part Of:
- Natural language engineering. Volume 24:Part 2(2018)
- Journal:
- Natural language engineering
- Issue:
- Volume 24:Part 2(2018)
- Issue Display:
- Volume 24, Issue 2, Part 2 (2018)
- Year:
- 2018
- Volume:
- 24
- Issue:
- 2
- Part:
- 2
- Issue Sort Value:
- 2018-0024-0002-0002
- Page Start:
- 163
- Page End:
- 198
- Publication Date:
- 2017-10-05
- Subjects:
- Natural language processing (Computer science) -- Periodicals
Software engineering -- Periodicals
006.35 - Journal URLs:
- http://journals.cambridge.org/action/displayJournal?jid=NLE ↗
- DOI:
- 10.1017/S1351324917000365 ↗
- Languages:
- English
- ISSNs:
- 1351-3249
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 6959.xml