Unsupervised multi-sense language models for natural language processing tasks. (October 2021)

Record Type:: Journal Article
Title:: Unsupervised multi-sense language models for natural language processing tasks. (October 2021)
Main Title:: Unsupervised multi-sense language models for natural language processing tasks
Authors:: Roh, Jihyeon
Park, Sungjin
Kim, Bo-Kyeong
Oh, Sang-Hoon
Lee, Soo-Young
Abstract:: Abstract: Existing language models (LMs) represent each word with only a single representation, which is unsuitable for processing words with multiple meanings. This issue has often been compounded by the lack of availability of large-scale data annotated with word meanings. In this paper, we propose a sense-aware framework that can process multi-sense word information without relying on annotated data. In contrast to the existing multi-sense representation models, which handle information in a restricted context, our framework provides context representations encoded without ignoring word order information or long-term dependency. The proposed framework consists of a context representation stage to encode the variable-size context, a sense-labeling stage that involves unsupervised clustering to infer a probable sense for a word in each context, and a multi-sense LM (MSLM) learning stage to learn the multi-sense representations. Particularly for the evaluation of MSLMs with different vocabulary sizes, we propose a new metric, i.e., unigram-normalized perplexity ( PPLu ), which is also understood as the negated mutual information between a word and its context information. Additionally, there is a theoretical verification of PPLu on the change of vocabulary size. Also, we adopt a method of estimating the number of senses, which does not require further hyperparameter search for an LM performance. For the LMs in our framework, both unidirectional and bidirectional … (more)
Is Part Of:: Neural networks. Volume 142(2021)
Journal:: Neural networks
Issue:: Volume 142(2021)
Issue Display:: Volume 142, Issue 2021 (2021)
Year:: 2021
Volume:: 142
Issue:: 2021
Issue Sort Value:: 2021-0142-2021-0000
Page Start:: 397
Page End:: 409
Publication Date:: 2021-10
Subjects:: Language model -- Neural language processing (NLP) -- Multi-sense word modeling
Neural computers -- Periodicals
Neural networks (Computer science) -- Periodicals
Neural networks (Neurobiology) -- Periodicals
Nervous System -- Periodicals
Ordinateurs neuronaux -- Périodiques
Réseaux neuronaux (Informatique) -- Périodiques
Réseaux neuronaux (Neurobiologie) -- Périodiques
Neural computers
Neural networks (Computer science)
Neural networks (Neurobiology)
Periodicals
006.32
Journal URLs:: http://www.sciencedirect.com/science/journal/08936080 ↗
http://www.elsevier.com/journals ↗
DOI:: 10.1016/j.neunet.2021.05.023 ↗
Languages:: English
ISSNs:: 0893-6080
Deposit Type:: Legaldeposit
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library DSC - 6081.280800
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store
Ingest File:: 18473.xml