A random forest classifier with cost-sensitive learning to extract urban landmarks from an imbalanced dataset. Issue 3 (4th March 2022)
- Record Type:
- Journal Article
- Title:
- A random forest classifier with cost-sensitive learning to extract urban landmarks from an imbalanced dataset. Issue 3 (4th March 2022)
- Main Title:
- A random forest classifier with cost-sensitive learning to extract urban landmarks from an imbalanced dataset
- Authors:
- Kang, Mengjun
Liu, Yue
Wang, Mengqi
Li, Lin
Weng, Min - Abstract:
- ABSTRACT: Urban landmarks play an important role as spatial references in spatial cognition, navigation, map design and urban planning. However, the current landmark extraction methods do not consider the imbalance between the landmark and non-landmarknon-landmark samples in a dataset, so the extraction results are biased toward the class with the majority of sample data, resulting in poor classification performance for the class with the fewest sample data. This study introduces a random forest (RF) classifier combined with cost-sensitive learning to extract urban landmarks automatically from a basic spatial database. First, the optimal feature set is determined according to the importance of features. Next, a cost-sensitive RF algorithm is applied to extract landmarks, which determines the misclassification cost according to the class distribution, and each decision tree is weighted by the classification results. The method has good performance, with a recall and area under the ROC curve (AUC) greater than 90%, and the model is also applicable to small sample sets, which can reduce the cost of manual labor.
- Is Part Of:
- International journal of geographical information science. Volume 36:Issue 3(2022)
- Journal:
- International journal of geographical information science
- Issue:
- Volume 36:Issue 3(2022)
- Issue Display:
- Volume 36, Issue 3 (2022)
- Year:
- 2022
- Volume:
- 36
- Issue:
- 3
- Issue Sort Value:
- 2022-0036-0003-0000
- Page Start:
- 496
- Page End:
- 513
- Publication Date:
- 2022-03-04
- Subjects:
- Urban landmark -- salience -- random forest -- class imbalance -- cost-sensitive ensemble
Geography -- Data processing -- Periodicals
Information storage and retrieval systems -- Periodicals
Géomatique -- Périodiques
Systèmes d'information -- Périodiques
910.285 - Journal URLs:
- http://www.tandfonline.com/loi/tgis20 ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/13658816.2021.1977814 ↗
- Languages:
- English
- ISSNs:
- 1365-8816
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.266150
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 21168.xml