Compressing and interpreting word embeddings with latent space regularization and interactive semantics probing. (January 2023)
- Record Type:
- Journal Article
- Title:
- Compressing and interpreting word embeddings with latent space regularization and interactive semantics probing. (January 2023)
- Main Title:
- Compressing and interpreting word embeddings with latent space regularization and interactive semantics probing
- Authors:
- Li, Haoyu
Wang, Junpeng
Zheng, Yan
Wang, Liang
Zhang, Wei
Shen, Han-Wei - Abstract:
- Word embedding, a high-dimensional (HD) numerical representation of words generated by machine learning models, has been used for different natural language processing tasks, for example, translation between two languages. Recently, there has been an increasing trend of transforming the HD embeddings into a latent space (e.g. via autoencoders) for further tasks, exploiting various merits the latent representations could bring. To preserve the embeddings' quality, these works often map the embeddings into an even higher-dimensional latent space, making the already complicated embeddings even less interpretable and consuming more storage space. In this work, we borrow the idea ofβ VAE to regularize the HD latent space. Our regularization implicitly condenses information from the HD latent space into a much lower-dimensional space, thus compressing the embeddings. We also show that each dimension of our regularized latent space is more semantically salient, and validate our assertion by interactively probing the encoding-level of user-proposed semantics in the dimensions. To the end, we design a visual analytics system to monitor the regularization process, explore the HD latent space, and interpret latent dimensions' semantics. We validate the effectiveness of our embedding regularization and interpretation approach through both quantitative and qualitative evaluations.
- Is Part Of:
- Information visualization. Volume 22:Number 1(2023)
- Journal:
- Information visualization
- Issue:
- Volume 22:Number 1(2023)
- Issue Display:
- Volume 22, Issue 1 (2023)
- Year:
- 2023
- Volume:
- 22
- Issue:
- 1
- Issue Sort Value:
- 2023-0022-0001-0000
- Page Start:
- 52
- Page End:
- 68
- Publication Date:
- 2023-01
- Subjects:
- High-dimensional data visualization -- visual analytics -- neural networks -- word embedding
Information visualization -- Periodicals
006.605 - Journal URLs:
- http://ivi.sagepub.com/ ↗
http://www.palgrave-journals.com/ivs/index.html ↗
http://www.uk.sagepub.com ↗ - DOI:
- 10.1177/14738716221130338 ↗
- Languages:
- English
- ISSNs:
- 1473-8716
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.401000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 24258.xml