An effective implementation and assessment of a random forest classifier as a soil spatial predictive model. Issue 8 (18th April 2018)
- Record Type:
- Journal Article
- Title:
- An effective implementation and assessment of a random forest classifier as a soil spatial predictive model. Issue 8 (18th April 2018)
- Main Title:
- An effective implementation and assessment of a random forest classifier as a soil spatial predictive model
- Authors:
- Shukla, Gaurav
Garg, Rahul Dev
Srivastava, Hari Shanker
Garg, Pradeep Kumar - Abstract:
- ABSTRACT: Mapping the spatial distribution of soil classes is important for informing soil use and management decisions. This study aimed to effectively implement Random Forest (RF) model and to evaluate the behaviour and performance of the model for soil classification of Indian districts. Soil-forming factors, known as 'scorpan, ' are selected as environmental covariates to tune RF model to classify 11 different soil categories. Thirty-five digital layers are prepared using different satellite data [ALOS (Advanced Land Observing Satellite) digital elevation model, Landsat-8, Moderate Resolution Imaging Spectroradiometer normalized difference vegetation index product, RISAT-1 (Radar Imaging Satellite-1), Sentinel-1A] and climatic data (precipitation and temperature) to represent scorpan environmental covariates in the study area. The RF parameters corresponding to highest Cohen's kappa coefficient ( κ ) value and lowest number of random split variables are considered optimum values for RF model. Model behaviour evaluation is based on mapping accuracy, sensitivity to data set size, and noise. Two other machine-learning methods, CART (Classification and Regression Tree) decision tree (CDT) and CART ensemble bagger (CEB), are used to provide the comparative study. To access behaviour of models to the false data set, noise in training set is produced by assigning a false class to the training set in 5% increment. Comparative performance of RF model is based on qualityABSTRACT: Mapping the spatial distribution of soil classes is important for informing soil use and management decisions. This study aimed to effectively implement Random Forest (RF) model and to evaluate the behaviour and performance of the model for soil classification of Indian districts. Soil-forming factors, known as 'scorpan, ' are selected as environmental covariates to tune RF model to classify 11 different soil categories. Thirty-five digital layers are prepared using different satellite data [ALOS (Advanced Land Observing Satellite) digital elevation model, Landsat-8, Moderate Resolution Imaging Spectroradiometer normalized difference vegetation index product, RISAT-1 (Radar Imaging Satellite-1), Sentinel-1A] and climatic data (precipitation and temperature) to represent scorpan environmental covariates in the study area. The RF parameters corresponding to highest Cohen's kappa coefficient ( κ ) value and lowest number of random split variables are considered optimum values for RF model. Model behaviour evaluation is based on mapping accuracy, sensitivity to data set size, and noise. Two other machine-learning methods, CART (Classification and Regression Tree) decision tree (CDT) and CART ensemble bagger (CEB), are used to provide the comparative study. To access behaviour of models to the false data set, noise in training set is produced by assigning a false class to the training set in 5% increment. Comparative performance of RF model is based on quality assessment measures. To evaluate the performance of models, marginal rates, F -measure, and Jaccard's coefficient of the community, classification success index and agreement coefficients are selected under quality assessment measures. The score is calculated to rank the algorithm. RF model shows high stability against data set reduction in comparison to other methods. The results show that the abrupt change in accuracy is only observed after 60% training data reduction in RF model; however, significant decrease in accuracy can be noted after 45% and 25% data reduction in CEB and CDT, respectively. The RF model shows comparatively the greater resistance to noise. Overall, RF model has performed better than CDT and CEB to classify soil categories in the study area. The results of this research provide new insights into the performance of RF in the context of soil class mapping. … (more)
- Is Part Of:
- International journal of remote sensing. Volume 39:Issue 8(2018)
- Journal:
- International journal of remote sensing
- Issue:
- Volume 39:Issue 8(2018)
- Issue Display:
- Volume 39, Issue 8 (2018)
- Year:
- 2018
- Volume:
- 39
- Issue:
- 8
- Issue Sort Value:
- 2018-0039-0008-0000
- Page Start:
- 2637
- Page End:
- 2669
- Publication Date:
- 2018-04-18
- Subjects:
- Remote sensing -- Periodicals
Télédétection -- Périodiques
621.3678 - Journal URLs:
- http://www.tandfonline.com/toc/tres20/current ↗
http://www.tandfonline.com/ ↗ - DOI:
- 10.1080/01431161.2018.1430399 ↗
- Languages:
- English
- ISSNs:
- 0143-1161
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 4542.528000
British Library DSC - BLDSS-3PM
British Library STI - ELD Digital store - Ingest File:
- 18586.xml