Exploring point-of-interest data from social media for artificial surface validation with decision trees. Issue 23 (2nd December 2017)
- Record Type:
- Journal Article
- Title:
- Exploring point-of-interest data from social media for artificial surface validation with decision trees. Issue 23 (2nd December 2017)
- Main Title:
- Exploring point-of-interest data from social media for artificial surface validation with decision trees
- Authors:
- Xing, Hanfa
Meng, Yuan
Hou, Dongyang
Cao, Fangjie
Xu, Haibin - Abstract:
- ABSTRACT: Artificial surfaces represent one of the key land cover types, and validation is an indispensable component of land cover mapping that ensures data quality. Traditionally, validation has been carried out by confronting the produced land cover map with reference data, which is collected through field surveys or image interpretation. However, this approach has limitations, including high costs in terms of money and time. Recently, geo-tagged photos from social media have been used as reference data. This procedure has lower costs, but the process of interpreting geo-tagged photos is still time-consuming. In fact, social media point of interest (POI) data, including geo-tagged photos, may contain useful textual information for land cover validation. However, this kind of special textual data has seldom been analysed or used to support land cover validation. This paper examines the potential of textual information from social media POIs as a new reference source to assist in artificial surface validation without photo recognition and proposes a validation framework using modified decision trees. First, POI datasets are classified semantically to divide POIs into the standard taxonomy of land cover maps. Then, a decision tree model is built and trained to classify POIs automatically. To eliminate the effects of spatial heterogeneity on POI classification, the shortest distances between each POI and both roads and villages serve as two factors in the modified decisionABSTRACT: Artificial surfaces represent one of the key land cover types, and validation is an indispensable component of land cover mapping that ensures data quality. Traditionally, validation has been carried out by confronting the produced land cover map with reference data, which is collected through field surveys or image interpretation. However, this approach has limitations, including high costs in terms of money and time. Recently, geo-tagged photos from social media have been used as reference data. This procedure has lower costs, but the process of interpreting geo-tagged photos is still time-consuming. In fact, social media point of interest (POI) data, including geo-tagged photos, may contain useful textual information for land cover validation. However, this kind of special textual data has seldom been analysed or used to support land cover validation. This paper examines the potential of textual information from social media POIs as a new reference source to assist in artificial surface validation without photo recognition and proposes a validation framework using modified decision trees. First, POI datasets are classified semantically to divide POIs into the standard taxonomy of land cover maps. Then, a decision tree model is built and trained to classify POIs automatically. To eliminate the effects of spatial heterogeneity on POI classification, the shortest distances between each POI and both roads and villages serve as two factors in the modified decision tree model. Finally, a data transformation based on a majority vote algorithm is then performed to convert the classified points into raster form for the purposes of applying confusion matrix methods to the land cover map. Using Beijing as a study area, social media POIs from Sina Weibo were collected to validate artificial surfaces in GlobeLand30 in 2010. A classification accuracy of 80.68% was achieved through our modified decision tree method. Compared with a classification method without spatial heterogeneity, the accuracy is 10% greater. This result indicates that our modified decision tree method displays considerable skill in classifying POIs with high spatial heterogeneity. In addition, a high validation accuracy of 92.76% was achieved, which is relatively close to the official result of 86.7%. These preliminary results indicate that social media POI datasets are valuable ancillary data for land cover validation, and our proposed validation framework provides opportunities for land cover validation with low costs in terms of money and time. … (more)
- Is Part Of:
- International journal of remote sensing. Volume 38:Issue 23(2017)
- Journal:
- International journal of remote sensing
- Issue:
- Volume 38:Issue 23(2017)
- Issue Display:
- Volume 38, Issue 23 (2017)
- Year:
- 2017
- Volume:
- 38
- Issue:
- 23
- Issue Sort Value:
- 2017-0038-0023-0000
- Page Start:
- 6945
- Page End:
- 6969
- Publication Date:
- 2017-12-02
- Subjects:
- Remote sensing -- Periodicals
Télédétection -- Périodiques
621.3678 - Journal URLs:
- http://www.tandfonline.com/toc/tres20/current ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/01431161.2017.1368101 ↗
- Languages:
- English
- ISSNs:
- 0143-1161
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.528000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 8061.xml