High-dimensional data analysis with subspace comparison using matrix visualization. (January 2019)
- Record Type:
- Journal Article
- Title:
- High-dimensional data analysis with subspace comparison using matrix visualization. (January 2019)
- Main Title:
- High-dimensional data analysis with subspace comparison using matrix visualization
- Authors:
- Wang, Junpeng
Liu, Xiaotong
Shen, Han-Wei - Abstract:
- Due to the intricate relationship between different dimensions of high-dimensional data, subspace analysis is often conducted to decompose dimensions and give prominence to certain subsets of dimensions, i.e. subspaces. Exploring and comparing subspaces are important to reveal the underlying features of subspaces, as well as to portray the characteristics of individual dimensions. To date, most of the existing high-dimensional data exploration and analysis approaches rely on dimensionality reduction algorithms (e.g. principal component analysis and multi-dimensional scaling) to project high-dimensional data, or their subspaces, to two-dimensional space and employ scatterplots for visualization. However, the dimensionality reduction algorithms are sometimes difficult to fine-tune and scatterplots are not effective for comparative visualization, making subspace comparison hard to perform. In this article, we aggregate high-dimensional data or their subspaces by computing pair-wise distances between all data items and showing the distances with matrix visualizations to present the original high-dimensional data or subspaces. Our approach enables effective visual comparisons among subspaces, which allows users to further investigate the characteristics of individual dimensions by studying their behaviors in similar subspaces. Through subspace comparisons, we identify dominant, similar, and conforming dimensions in different subspace contexts of synthetic and real-worldDue to the intricate relationship between different dimensions of high-dimensional data, subspace analysis is often conducted to decompose dimensions and give prominence to certain subsets of dimensions, i.e. subspaces. Exploring and comparing subspaces are important to reveal the underlying features of subspaces, as well as to portray the characteristics of individual dimensions. To date, most of the existing high-dimensional data exploration and analysis approaches rely on dimensionality reduction algorithms (e.g. principal component analysis and multi-dimensional scaling) to project high-dimensional data, or their subspaces, to two-dimensional space and employ scatterplots for visualization. However, the dimensionality reduction algorithms are sometimes difficult to fine-tune and scatterplots are not effective for comparative visualization, making subspace comparison hard to perform. In this article, we aggregate high-dimensional data or their subspaces by computing pair-wise distances between all data items and showing the distances with matrix visualizations to present the original high-dimensional data or subspaces. Our approach enables effective visual comparisons among subspaces, which allows users to further investigate the characteristics of individual dimensions by studying their behaviors in similar subspaces. Through subspace comparisons, we identify dominant, similar, and conforming dimensions in different subspace contexts of synthetic and real-world high-dimensional data sets. Additionally, we present a prototype that integrates parallel coordinates plot and matrix visualization for high-dimensional data exploration and incremental dimensionality analysis, which also allows users to further validate the dimension characterization results derived from the subspace comparisons. … (more)
- Is Part Of:
- Information visualization. Volume 18:Number 1(2019)
- Journal:
- Information visualization
- Issue:
- Volume 18:Number 1(2019)
- Issue Display:
- Volume 18, Issue 1 (2019)
- Year:
- 2019
- Volume:
- 18
- Issue:
- 1
- Issue Sort Value:
- 2019-0018-0001-0000
- Page Start:
- 94
- Page End:
- 109
- Publication Date:
- 2019-01
- Subjects:
- High-dimensional data -- matrix visualization -- subspace comparison
Information visualization -- Periodicals
006.605 - Journal URLs:
- http://ivi.sagepub.com/ ↗
http://www.palgrave-journals.com/ivs/index.html ↗
http://www.uk.sagepub.com ↗ - DOI:
- 10.1177/1473871617733996 ↗
- Languages:
- English
- ISSNs:
- 1473-8716
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4496.401000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 9434.xml