A Principal Component Analysis (PCA)-based framework for automated variable selection in geodemographic classification. Issue 4 (2nd October 2019)
- Record Type:
- Journal Article
- Title:
- A Principal Component Analysis (PCA)-based framework for automated variable selection in geodemographic classification. Issue 4 (2nd October 2019)
- Main Title:
- A Principal Component Analysis (PCA)-based framework for automated variable selection in geodemographic classification
- Authors:
- Liu, Yunzhe
Singleton, Alex
Arribas-Bel, Daniel - Abstract:
- ABSTRACT: A geodemographic classification aims to describe the most salient characteristics of a small area zonal geography. However, such representations are influenced by the methodological choices made during their construction. Of particular debate are the choice and specification of input variables, with the objective of identifying inputs that add value but also aim for model parsimony. Within this context, our paper introduces a principal component analysis (PCA)-based automated variable selection methodology that has the objective of identifying candidate inputs to a geodemographic classification from a collection of variables. The proposed methodology is exemplified in the context of variables from the UK 2011 Census, and its output compared to the Office for National Statistics 2011 Output Area Classification (2011 OAC). Through the implementation of the proposed methodology, the quality of the cluster assignment was improved relative to 2011 OAC, manifested by a lower total within-cluster sum of square score. Across the UK, more than 70.2% of the Output Areas (OAs) occupied by the newly created classification (i.e. AVS-OAC) outperform the 2011 OAC, with particularly strong performance within Scotland and Wales.
- Is Part Of:
- Geo-spatial information science. Volume 22:Issue 4(2019)
- Journal:
- Geo-spatial information science
- Issue:
- Volume 22:Issue 4(2019)
- Issue Display:
- Volume 22, Issue 4 (2019)
- Year:
- 2019
- Volume:
- 22
- Issue:
- 4
- Issue Sort Value:
- 2019-0022-0004-0000
- Page Start:
- 251
- Page End:
- 264
- Publication Date:
- 2019-10-02
- Subjects:
- Geodemographics -- variable selection -- UK census -- spatial data mining -- principal component analysis
Geographic information systems -- Periodicals
Cartography -- Data processing -- Periodicals
Surveying -- Data processing -- Periodicals
Remote sensing -- Periodicals
526.0285 - Journal URLs:
- http://www.springerlink.com/content/120480/ ↗
http://www.tandfonline.com/loi/tgsi20#.Vh45TZWFOig ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/10095020.2019.1621549 ↗
- Languages:
- English
- ISSNs:
- 1009-5020
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4158.896405
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21378.xml