A new approach for textual feature selection based on N-composite isolated labels. (29th March 2020)
- Record Type:
- Journal Article
- Title:
- A new approach for textual feature selection based on N-composite isolated labels. (29th March 2020)
- Main Title:
- A new approach for textual feature selection based on N-composite isolated labels
- Authors:
- Elloumi, Samir
- Abstract:
- Abstract: Textual Feature Selection (TFS) aims to extract relevant parts or segments from text as being the most relevant ones w.r.t. the information it expresses. The selected features are useful for automatic indexing, summarization, document categorization, knowledge discovery, so on. Regarding the huge amount of electronic textual data daily published, many challenges related to the semantic aspect as well as the processing efficiency are addressed. In this paper, we propose a new approach for TFS based on Formal Concept Analysis background. Mainly, we propose to extract textual features by exploring the regularities in a formal context where isolated points exist. We introduce the notion of N -composite isolated points as a set of N words to be considered as a unique textual feature. We show that a reduced value of N (between 1 and 3) allows extracting significant textual features compared with existing approaches even for non-completely covering an initial formal context.
- Is Part Of:
- Natural language engineering. Volume 26:Part 2(2020)
- Journal:
- Natural language engineering
- Issue:
- Volume 26:Part 2(2020)
- Issue Display:
- Volume 26, Issue 2, Part 2 (2020)
- Year:
- 2020
- Volume:
- 26
- Issue:
- 2
- Part:
- 2
- Issue Sort Value:
- 2020-0026-0002-0002
- Page Start:
- 221
- Page End:
- 243
- Publication Date:
- 2020-03-29
- Subjects:
- Textual features, -- Formal concept, -- Composite isolated point, -- Difunctional relation
Natural language processing (Computer science) -- Periodicals
Software engineering -- Periodicals
006.35 - Journal URLs:
- http://journals.cambridge.org/action/displayJournal?jid=NLE ↗
- DOI:
- 10.1017/S1351324919000160 ↗
- Languages:
- English
- ISSNs:
- 1351-3249
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library HMNTS - ELD Digital store
- Ingest File:
- 14630.xml