Extending latent semantic analysis to manage its syntactic blindness. (1st March 2021)
- Record Type:
- Journal Article
- Title:
- Extending latent semantic analysis to manage its syntactic blindness. (1st March 2021)
- Main Title:
- Extending latent semantic analysis to manage its syntactic blindness
- Authors:
- Suleman, Raja Muhammad
Korkontzelos, Ioannis - Abstract:
- Highlights: Latent Semantic Analysis is syntactically blind. LSA is unable to handle negation in sentences. xLSA uses Syntactic Information to enhance semantic similarity results. xLSA outperforms neural network-based models on simple inverse sentences. Abstract: Natural Language Processing (NLP) is the sub-field of Artificial Intelligence that represents and analyses human language automatically. NLP has been employed in many applications, such as information retrieval, information processing and automated answer ranking. Semantic analysis focuses on understanding the meaning of text. Among other proposed approaches, Latent Semantic Analysis (LSA) is a widely used corpus-based approach that evaluates similarity of text based on the semantic relations among words. LSA has been applied successfully in diverse language systems for calculating the semantic similarity of texts. LSA ignores the structure of sentences, i.e., it suffers from a syntactic blindness problem. LSA fails to distinguish between sentences that contain semantically similar words but have opposite meanings. Disregarding sentence structure, LSA cannot differentiate between a sentence and a list of keywords. If the list and the sentence contain similar words, comparing them using LSA would lead to a high similarity score. In this paper, we propose xLSA, an extension of LSA that focuses on the syntactic structure of sentences to overcome the syntactic blindness problem of the original LSA approach. xLSA wasHighlights: Latent Semantic Analysis is syntactically blind. LSA is unable to handle negation in sentences. xLSA uses Syntactic Information to enhance semantic similarity results. xLSA outperforms neural network-based models on simple inverse sentences. Abstract: Natural Language Processing (NLP) is the sub-field of Artificial Intelligence that represents and analyses human language automatically. NLP has been employed in many applications, such as information retrieval, information processing and automated answer ranking. Semantic analysis focuses on understanding the meaning of text. Among other proposed approaches, Latent Semantic Analysis (LSA) is a widely used corpus-based approach that evaluates similarity of text based on the semantic relations among words. LSA has been applied successfully in diverse language systems for calculating the semantic similarity of texts. LSA ignores the structure of sentences, i.e., it suffers from a syntactic blindness problem. LSA fails to distinguish between sentences that contain semantically similar words but have opposite meanings. Disregarding sentence structure, LSA cannot differentiate between a sentence and a list of keywords. If the list and the sentence contain similar words, comparing them using LSA would lead to a high similarity score. In this paper, we propose xLSA, an extension of LSA that focuses on the syntactic structure of sentences to overcome the syntactic blindness problem of the original LSA approach. xLSA was tested on sentence pairs that contain similar words but have significantly different meaning. Our results showed that xLSA alleviates the syntactic blindness problem, providing more realistic semantic similarity scores. … (more)
- Is Part Of:
- Expert systems with applications. Volume 165(2021)
- Journal:
- Expert systems with applications
- Issue:
- Volume 165(2021)
- Issue Display:
- Volume 165, Issue 2021 (2021)
- Year:
- 2021
- Volume:
- 165
- Issue:
- 2021
- Issue Sort Value:
- 2021-0165-2021-0000
- Page Start:
- Page End:
- Publication Date:
- 2021-03-01
- Subjects:
- Natural Language Processing -- Natural Language Understanding -- Latent Semantic Analysis -- Semantic Similarity
Expert systems (Computer science) -- Periodicals
Systèmes experts (Informatique) -- Périodiques
Electronic journals
006.33 - Journal URLs:
- http://www.sciencedirect.com/science/journal/09574174 ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.eswa.2020.114130 ↗
- Languages:
- English
- ISSNs:
- 0957-4174
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3842.004220
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22337.xml