Fine-resolution soil maps constitute important data for many different environmental studies. Digital soil mapping techniques represent a cost-effective method to obtain detailed information about soil types and soil properties over large areas. The main objective of the study was to extend predictions from 1:25,000 legacy soil surveys (including WRB soil groups, soil depth and soil texture classes) to the larger area of Cyprus. A multiple-trees classification technique, namely Random Forest (RF), was applied. Specific objectives were: (i) to analyze the role and importance of a large data set of environmental predictors, (ii) to investigate the effect of the number of training points, forest size (ntree), the numbers of predictors sampled per node (mtry) and tree size (nodesize) in RF; (iii) to compare RF-derived maps with maps derived with a multinomial logistic regression model, in terms of validation error (test set and independent profiles) and map uncertainty, using the confusion index and a newly developed reliability index. The optimized RF model was run using half of the input points available (over a million) and with ntree equal to 350. The mtry parameter was set to 5 (close to half the number of the environmental variables used) for both soil series and soil properties. The nodesize calibration showed no relevant performance increase and was kept at its default value (1). In terms of environmental variables, the model used 10 predictors, covering all the soil formation factors considered in the scorpan formula, to derive the three maps. Soil properties, derived from geochemistry data, showed a high importance in deriving soil groups, depths and texture. Random Forest constructed a better predictive model than multinomial logistic regression, showing comparable predictive uncertainty but much lower validation error. The RF-derived maps show very low out of bag (OOB) errors (around 10% for both soil groups and soil properties) but relatively high validation error from independent profiles (45% for soil depth, 51% for soil texture). The resulting reliability index was low in the main mountainous area of Cyprus, where predictions were extrapolations as indicated by the multivariate environmental similarity surface, but medium to high in the main agricultural areas of the country.

A high resolution map of soil types and physical properties for Cyprus : a digital soil mapping optimization / C. Camera, Z. Zomeni, J.S. Noller, A.M. Zissimos, I.C. Christoforou, A. Bruggeman. - In: GEODERMA. - ISSN 0016-7061. - 285:(2017), pp. 35-49. [10.1016/j.geoderma.2016.09.019]

A high resolution map of soil types and physical properties for Cyprus : a digital soil mapping optimization

C. Camera
Primo
;
2017

Abstract

Fine-resolution soil maps constitute important data for many different environmental studies. Digital soil mapping techniques represent a cost-effective method to obtain detailed information about soil types and soil properties over large areas. The main objective of the study was to extend predictions from 1:25,000 legacy soil surveys (including WRB soil groups, soil depth and soil texture classes) to the larger area of Cyprus. A multiple-trees classification technique, namely Random Forest (RF), was applied. Specific objectives were: (i) to analyze the role and importance of a large data set of environmental predictors, (ii) to investigate the effect of the number of training points, forest size (ntree), the numbers of predictors sampled per node (mtry) and tree size (nodesize) in RF; (iii) to compare RF-derived maps with maps derived with a multinomial logistic regression model, in terms of validation error (test set and independent profiles) and map uncertainty, using the confusion index and a newly developed reliability index. The optimized RF model was run using half of the input points available (over a million) and with ntree equal to 350. The mtry parameter was set to 5 (close to half the number of the environmental variables used) for both soil series and soil properties. The nodesize calibration showed no relevant performance increase and was kept at its default value (1). In terms of environmental variables, the model used 10 predictors, covering all the soil formation factors considered in the scorpan formula, to derive the three maps. Soil properties, derived from geochemistry data, showed a high importance in deriving soil groups, depths and texture. Random Forest constructed a better predictive model than multinomial logistic regression, showing comparable predictive uncertainty but much lower validation error. The RF-derived maps show very low out of bag (OOB) errors (around 10% for both soil groups and soil properties) but relatively high validation error from independent profiles (45% for soil depth, 51% for soil texture). The resulting reliability index was low in the main mountainous area of Cyprus, where predictions were extrapolations as indicated by the multivariate environmental similarity surface, but medium to high in the main agricultural areas of the country.
Cyprus; Digital soil mapping; Model optimization; Random Forest; Soil landscape model; World Reference Base; Soil Science
Settore GEO/04 - Geografia Fisica e Geomorfologia
Settore GEO/05 - Geologia Applicata
2017
Article (author)
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0016706116304864-main.pdf

accesso riservato

Tipologia: Publisher's version/PDF
Dimensione 2.92 MB
Formato Adobe PDF
2.92 MB Adobe PDF   Visualizza/Apri   Richiedi una copia
Camera_et_al_2017b_postprint_Digital_soil_map_Cyprus.pdf

accesso aperto

Descrizione: Versione non editata accettata dalla rivista
Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 1.74 MB
Formato Adobe PDF
1.74 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/551385
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 94
  • ???jsp.display-item.citation.isi??? 88
social impact