A dimension range representation (DRR) measure for self-organizing maps. (May 2016)
- Record Type:
- Journal Article
- Title:
- A dimension range representation (DRR) measure for self-organizing maps. (May 2016)
- Main Title:
- A dimension range representation (DRR) measure for self-organizing maps
- Authors:
- Clark, Stephanie
Sisson, Scott. A.
Sharma, Ashish - Abstract:
- Abstract: A common tool in exploratory data analysis, the self-organizing map, or SOM, is used for clustering and visualisation to discover patterns in large, high-dimensional data sets. The output map may be interpreted to gain an understanding of the structure of the original data set, correlations between variables, and the characteristics the clusters formed by placing the data on the map. However, if the map does not represent all dimensions of the data in an informative way, map interpretation may be misleading. Currently there is no measure of how well a SOM represents each dimension of a data set, and therefore how descriptive the map vectors are of the full structure of the data they represent. A dimension range representation (DRR) measure is proposed to quantify how well represented each dimension of the data set is by the map vectors of the SOM. This can be used to choose between different map size and shape options that could potentially represent a specific data set. Through examples, it is demonstrated how the DRR measure is used to inform the choice of map size and shape, leading to more informative insight into the original data set through examination of the output map. Highlights: The self-organizing map is a popular technique for clustering and visualization. If the map vectors represent all dimensions of the data well, insight into the data set will be improved. A 'dimension range representation' measure is introduced to quantify coverage of the data byAbstract: A common tool in exploratory data analysis, the self-organizing map, or SOM, is used for clustering and visualisation to discover patterns in large, high-dimensional data sets. The output map may be interpreted to gain an understanding of the structure of the original data set, correlations between variables, and the characteristics the clusters formed by placing the data on the map. However, if the map does not represent all dimensions of the data in an informative way, map interpretation may be misleading. Currently there is no measure of how well a SOM represents each dimension of a data set, and therefore how descriptive the map vectors are of the full structure of the data they represent. A dimension range representation (DRR) measure is proposed to quantify how well represented each dimension of the data set is by the map vectors of the SOM. This can be used to choose between different map size and shape options that could potentially represent a specific data set. Through examples, it is demonstrated how the DRR measure is used to inform the choice of map size and shape, leading to more informative insight into the original data set through examination of the output map. Highlights: The self-organizing map is a popular technique for clustering and visualization. If the map vectors represent all dimensions of the data well, insight into the data set will be improved. A 'dimension range representation' measure is introduced to quantify coverage of the data by the map. Information gained from the map can be improved with optimal map size and shape. The DRR measure can be used to choose output map size and shape to improve insights. … (more)
- Is Part Of:
- Pattern recognition. Volume 53(2016:May)
- Journal:
- Pattern recognition
- Issue:
- Volume 53(2016:May)
- Issue Display:
- Volume 53 (2016)
- Year:
- 2016
- Volume:
- 53
- Issue Sort Value:
- 2016-0053-0000-0000
- Page Start:
- 276
- Page End:
- 286
- Publication Date:
- 2016-05
- Subjects:
- Self-organizing maps -- Quality -- Error measure -- Dimension -- Coverage -- Map size -- Map shape -- Extreme values
Pattern perception -- Periodicals
Perception des structures -- Périodiques
Patroonherkenning
006.4 - Journal URLs:
- http://www.sciencedirect.com/science/journal/00313203 ↗
http://www.sciencedirect.com/ ↗ - DOI:
- 10.1016/j.patcog.2015.11.002 ↗
- Languages:
- English
- ISSNs:
- 0031-3203
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7799.xml