Integrated concept blending with vector space models. (November 2016)
- Record Type:
- Journal Article
- Title:
- Integrated concept blending with vector space models. (November 2016)
- Main Title:
- Integrated concept blending with vector space models
- Authors:
- Calvo, Hiram
Méndez, Oscar
Moreno-Armendáriz, Marco A. - Abstract:
- Abstract : Highlights: A method for merging an arbitrary number of nouns, mixing their meanings. Successful model using vector representation, scantily explored for concept retrieval. Experiments with 3 semantic space models: WN, a thesaurus, and a topic based model. Good performance with an automatically obtained resource comparable to a manual one. Evaluation by qualified reviewers and comparison with a traditional dictionary. Abstract: Traditional concept retrieval is based on usual word definition dictionaries with simple performance: they just map words to their definitions. This approach is mostly helpful for readers and language students, but writers sometimes need to find a word that encompasses a set of ideas that they have in mind. For this task, inverse dictionaries are ready to help; however, in some cases a sought word does not correspond to a single definition but to a composite meaning of several concepts. A language producer then tends to require a concept search that starts with a group of words or a series of related terms, looking for a target word. This paper aims to assist on this task by presenting a new approach for concept blending through the development of a search-by-concept method based on vector space representation using semantic analysis and statistical natural language processing techniques. Words are represented as numeric vectors based on different semantic similarity measures and probabilistic measures; the semantic properties of a word areAbstract : Highlights: A method for merging an arbitrary number of nouns, mixing their meanings. Successful model using vector representation, scantily explored for concept retrieval. Experiments with 3 semantic space models: WN, a thesaurus, and a topic based model. Good performance with an automatically obtained resource comparable to a manual one. Evaluation by qualified reviewers and comparison with a traditional dictionary. Abstract: Traditional concept retrieval is based on usual word definition dictionaries with simple performance: they just map words to their definitions. This approach is mostly helpful for readers and language students, but writers sometimes need to find a word that encompasses a set of ideas that they have in mind. For this task, inverse dictionaries are ready to help; however, in some cases a sought word does not correspond to a single definition but to a composite meaning of several concepts. A language producer then tends to require a concept search that starts with a group of words or a series of related terms, looking for a target word. This paper aims to assist on this task by presenting a new approach for concept blending through the development of a search-by-concept method based on vector space representation using semantic analysis and statistical natural language processing techniques. Words are represented as numeric vectors based on different semantic similarity measures and probabilistic measures; the semantic properties of a word are captured in the vector elements determined by a given linguistic context. Three different sources are used as context for word vector construction: WordNet, a distributional thesaurus, and the Latent Dirichlet Allocation algorithm; each source is used for building a different semantic vector space. The concept-blender input is then conformed by a set of n-nouns. All input members are read and substituted by their corresponding vectors. Then, a semantic space analysis including a filtering and ranking process is carried out to deploy a list of target words. A test set of 50 concepts was created in order to evaluate the system's performance. A group of 30 evaluators found our integrated concept blending model to provide better results for finding an adequate word for the provided set of concepts. … (more)
- Is Part Of:
- Computer speech & language. Volume 40(2016)
- Journal:
- Computer speech & language
- Issue:
- Volume 40(2016)
- Issue Display:
- Volume 40, Issue 2016 (2016)
- Year:
- 2016
- Volume:
- 40
- Issue:
- 2016
- Issue Sort Value:
- 2016-0040-2016-0000
- Page Start:
- 79
- Page End:
- 96
- Publication Date:
- 2016-11
- Subjects:
- Computational linguistics -- Natural language processing -- Lexicography -- Vector space models -- Reverse lookup dictionaries -- Concept-blending
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2016.01.004 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 1262.xml