Content area

Abstract

The transition from conventional soil mapping (CSM) to digital soil mapping (DSM) not only affects the final map products, but it also affects the concepts of scale, resolution, and sampling intensity. This is critical because in the CSM approach, sampling intensity is intricately linked to the desired scale of soil map publication, which provided standardization of sampling. This is not the case for DSM where sample size varies widely by project, and sampling design studies have largely focused on where to sample without due consideration for sample size. Using a regional soil survey dataset with 1791 sampled and described soil profiles, we first extracted an external validation dataset using the conditioned Latin hypercube sampling (cLHS) algorithm and then created repeated (n = 10) sample plans of increasing size from the remaining calibration sites using the cLHS, feature space coverage sampling (FSCS), and simple random sampling (SRS). We then trained random forest (RF) models for four soil properties: pH, CEC, clay content, and SOC at five different depths. We identified the effective sample size based on the model learning curves and compared it to the optimal sample size determined from the Jensen–Shannon divergence (DJS) applied to the environmental covariates. Maps were then generated from models that used all the calibration points (reference maps) and from models that used the optimal sample size (optimal maps) for comparison. Our findings revealed that the optimal sample sizes based on the DJS analysis were closely aligned with the effective sample sizes from the model learning curves (815 for cLHS, 832 for FSCS, and 847 for SRS). Furthermore, the comparison of the optimal maps to the reference maps showed little difference in the global statistics (concordance correlation coefficient and root mean square error) and spatial trends of the data, confirming that the optimal sample size was sufficient for creating predictions of similar accuracy to the full calibration dataset. Finally, we conclude that the Ottawa soil survey project could have saved between CAD 330,500 and CAD 374,000 (CAD = Canadian dollars) if the determination of optimal sample size tools presented herein existed during the project planning phase. This clearly illustrates the need for additional research in determining an optimal sample size for DSM and demonstrates that operationalization of DSM in public institutions requires a sound scientific basis for determining sample size.

Details

1009240
Title
Post-hoc Evaluation of Sample Size in a Regional Digital Soil Mapping Project
Author
Saurette, Daniel D 1   VIAFID ORCID Logo  ; Heck, Richard J 2   VIAFID ORCID Logo  ; Gillespie, Adam W 2 ; Berg, Aaron A 3   VIAFID ORCID Logo  ; Biswas, Asim 2   VIAFID ORCID Logo 

 School of Environmental Sciences, University of Guelph, 50 Stone Rd East, Guelph, ON N1G 2W1, Canada or [email protected] (D.D.S.); [email protected] (R.J.H.); [email protected] (A.W.G.); Ontario Ministry of Agriculture, Food and Agribusiness, 1 Stone Rd West, Guelph, ON N1G 2Y4, Canada 
 School of Environmental Sciences, University of Guelph, 50 Stone Rd East, Guelph, ON N1G 2W1, Canada or [email protected] (D.D.S.); [email protected] (R.J.H.); [email protected] (A.W.G.) 
 Department of Geography, Environment & Geomatics, University of Guelph, 50 Stone Rd East, Guelph, ON N1G 2W1, Canada; [email protected] 
Publication title
Land; Basel
Volume
14
Issue
3
First page
545
Publication year
2025
Publication date
2025
Publisher
MDPI AG
Place of publication
Basel
Country of publication
Switzerland
Publication subject
e-ISSN
2073445X
Source type
Scholarly Journal
Language of publication
English
Document type
Journal Article
Publication history
 
 
Online publication date
2025-03-05
Milestone dates
2024-12-22 (Received); 2025-03-03 (Accepted)
Publication history
 
 
   First posting date
05 Mar 2025
ProQuest document ID
3181561293
Document URL
https://www.proquest.com/scholarly-journals/post-hoc-evaluation-sample-size-regional-digital/docview/3181561293/se-2?accountid=208611
Copyright
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2025-03-27
Database
ProQuest One Academic