Improving word vector model with part‐of‐speech and dependency grammar information. Issue 4 (2nd November 2020)
- Record Type:
- Journal Article
- Title:
- Improving word vector model with part‐of‐speech and dependency grammar information. Issue 4 (2nd November 2020)
- Main Title:
- Improving word vector model with part‐of‐speech and dependency grammar information
- Authors:
- Deng, Chunhui
Lai, Gangming
Deng, Huifang - Abstract:
- Abstract : Part‐of‐speech (POS) and dependency grammar (DG) are the basic components of natural language processing. However, current word vector models have not made full use of both POS information and DG information, and hence the models' performances are limited to some extent. The authors first put forward the concept of POS vector, and then, based on continuous bag‐of‐words (CBOW), constructed four models: CBOW + P, CBOW + PW, CBOW + G, and CBOW + G + P to incorporate POS information and DG information into word vectors. The CBOW + P and CBOW + PW models are based on POS tagging, the CBOW + G model is based on DG parsing, and the CBOW + G + P model is based on POS tagging and DG parsing. POS information is integrated into the training process of word vectors through the POS vector to solve the problem of the POS similarity being difficult to measure. The POS vector correlation coefficient and distance weighting function are used to train the POS vector as well as the word vector. DG information is used to correct the information loss caused by fixed context windows. Dependency relations weight is used to measure the difference of dependency relations. Experiments demonstrated the superior performance of their models while the time complexity is still kept the same as the base model of CBOW.
- Is Part Of:
- CAAI transactions on intelligence technology. Volume 5:Issue 4(2020)
- Journal:
- CAAI transactions on intelligence technology
- Issue:
- Volume 5:Issue 4(2020)
- Issue Display:
- Volume 5, Issue 4 (2020)
- Year:
- 2020
- Volume:
- 5
- Issue:
- 4
- Issue Sort Value:
- 2020-0005-0004-0000
- Page Start:
- 276
- Page End:
- 282
- Publication Date:
- 2020-11-02
- Subjects:
- grammars -- search engines -- natural language processing -- learning (artificial intelligence) -- vectors -- text analysis -- advertising data processing
improving word vector model -- part‐of‐speech -- dependency grammar information -- current word vector models -- POS information -- DG information -- POS vector -- bag‐of‐words -- CBOW + P -- CBOW + PW -- POS tagging -- CBOW + G model -- DG parsing -- CBOW + G + P model -- POS similarity -- distance weighting function -- information loss
Artificial intelligence -- Periodicals
Computer science -- Periodicals
Artificial intelligence
Computer science
Electronic journals
Periodicals
006.305 - Journal URLs:
- https://digital-library.theiet.org/content/journals/trit ↗
https://ietresearch.onlinelibrary.wiley.com/journal/24682322 ↗
http://search.ebscohost.com/login.aspx?direct=true&site=edspub-live&scope=site&type=44&db=edspub&authtype=ip, guest&custid=ns011247&groupid=main&profile=eds&bquery=AN%2010129651 ↗
http://www.sciencedirect.com/ ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1049/trit.2020.0055 ↗
- Languages:
- English
- ISSNs:
- 2468-6557
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 2943.720000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 16698.xml