Interactive optimization of embedding-based text similarity calculations. (October 2022)
- Record Type:
- Journal Article
- Title:
- Interactive optimization of embedding-based text similarity calculations. (October 2022)
- Main Title:
- Interactive optimization of embedding-based text similarity calculations
- Authors:
- Witschard, Daniel
Jusufi, Ilir
Martins, Rafael M
Kucher, Kostiantyn
Kerren, Andreas - Abstract:
- Comparing text documents is an essential task for a variety of applications within diverse research fields, and several different methods have been developed for this. However, calculating text similarity is an ambiguous and context-dependent task, so many open challenges still exist. In this paper, we present a novel method for text similarity calculations based on the combination of embedding technology and ensemble methods. By using several embeddings, instead of only one, we show that it is possible to achieve higher quality, which in turn is a key factor for developing high-performing applications for text similarity exploitation. We also provide a prototype visual analytics tool which helps the analyst to find optimal performing ensembles and gain insights to the inner workings of the similarity calculations. Furthermore, we discuss the generalizability of our key ideas to fields beyond the scope of text analysis.
- Is Part Of:
- Information visualization. Volume 21:Number 4(2022)
- Journal:
- Information visualization
- Issue:
- Volume 21:Number 4(2022)
- Issue Display:
- Volume 21, Issue 4 (2022)
- Year:
- 2022
- Volume:
- 21
- Issue:
- 4
- Issue Sort Value:
- 2022-0021-0004-0000
- Page Start:
- 335
- Page End:
- 353
- Publication Date:
- 2022-10
- Subjects:
- Text embedding -- ensemble methods -- text similarity -- similarity calculations -- visual analytics
Information visualization -- Periodicals
006.605 - Journal URLs:
- http://ivi.sagepub.com/ ↗
http://www.palgrave-journals.com/ivs/index.html ↗
http://www.uk.sagepub.com ↗ - DOI:
- 10.1177/14738716221114372 ↗
- Languages:
- English
- ISSNs:
- 1473-8716
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.401000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 22142.xml