A novel word sense disambiguation approach using WordNet knowledge graph. (July 2022)
- Record Type:
- Journal Article
- Title:
- A novel word sense disambiguation approach using WordNet knowledge graph. (July 2022)
- Main Title:
- A novel word sense disambiguation approach using WordNet knowledge graph
- Authors:
- AlMousa, Mohannad
Benlamri, Rachid
Khoury, Richard - Abstract:
- Abstract: Various applications in computational linguistics and artificial intelligence rely on high-performing word sense disambiguation techniques to solve challenging tasks such as information retrieval, machine translation, question answering, and document clustering. While text comprehension is intuitive for humans, machines face tremendous challenges in processing and interpreting a human's natural language. This paper presents a novel knowledge-based word sense disambiguation algorithm, namely Sequential Contextual Similarity Matrix Multiplication (SCSMM). The SCSMM algorithm combines semantic similarity, heuristic knowledge, and document context to respectively exploit the merits of local sense-based context between consecutive terms, human knowledge about terms, and a document's main topic in disambiguating terms. Unlike other algorithms, the SCSMM algorithm guarantees the capture of the maximum sentence context while maintaining the terms' order within the sentence. The proposed algorithm outperformed all other algorithms when disambiguating nouns on the combined gold standard datasets, while demonstrating comparable results to current state-of-the-art word sense disambiguation systems when dealing with each dataset separately. Furthermore, the paper discusses the impact of granularity level, ambiguity rate, sentence size, and part of speech distribution on the performance of the proposed algorithm. Highlights: Semantic similarity affects the overall performance ofAbstract: Various applications in computational linguistics and artificial intelligence rely on high-performing word sense disambiguation techniques to solve challenging tasks such as information retrieval, machine translation, question answering, and document clustering. While text comprehension is intuitive for humans, machines face tremendous challenges in processing and interpreting a human's natural language. This paper presents a novel knowledge-based word sense disambiguation algorithm, namely Sequential Contextual Similarity Matrix Multiplication (SCSMM). The SCSMM algorithm combines semantic similarity, heuristic knowledge, and document context to respectively exploit the merits of local sense-based context between consecutive terms, human knowledge about terms, and a document's main topic in disambiguating terms. Unlike other algorithms, the SCSMM algorithm guarantees the capture of the maximum sentence context while maintaining the terms' order within the sentence. The proposed algorithm outperformed all other algorithms when disambiguating nouns on the combined gold standard datasets, while demonstrating comparable results to current state-of-the-art word sense disambiguation systems when dealing with each dataset separately. Furthermore, the paper discusses the impact of granularity level, ambiguity rate, sentence size, and part of speech distribution on the performance of the proposed algorithm. Highlights: Semantic similarity affects the overall performance of knowledge-based Word Sense Disambiguation (WSD) systems. With Semantic similarity, sense heuristics, and document context, we designed a novel knowledge-based word sense disambiguation algorithm. The Sequential Contextual Similarity Matrix Multiplication (SCSMM) algorithm captures the maximum sentence context while maintaining the words' order. The SCSMM algorithm outperforms current WSD systems when disambiguating nouns. … (more)
- Is Part Of:
- Computer speech & language. Volume 74(2022)
- Journal:
- Computer speech & language
- Issue:
- Volume 74(2022)
- Issue Display:
- Volume 74, Issue 2022 (2022)
- Year:
- 2022
- Volume:
- 74
- Issue:
- 2022
- Issue Sort Value:
- 2022-0074-2022-0000
- Page Start:
- Page End:
- Publication Date:
- 2022-07
- Subjects:
- Semantic word sense disambiguation -- Knowledge-based -- Knowledge graph -- WordNet
Speech processing systems -- Periodicals
Automatic speech recognition -- Periodicals
Computers -- Periodicals
Linguistics -- Periodicals
Speech-Language Pathology -- Periodicals
Traitement automatique de la parole -- Périodiques
Reconnaissance automatique de la parole -- Périodiques
Automatic speech recognition
Speech processing systems
Electronic journals
Periodicals
006.454 - Journal URLs:
- http://www.journals.elsevier.com/computer-speech-and-language/ ↗
http://www.elsevier.com/journals ↗ - DOI:
- 10.1016/j.csl.2021.101337 ↗
- Languages:
- English
- ISSNs:
- 0885-2308
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 3394.276600
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21011.xml