Distributed Estimation of the Curve Number (CN) in Continental Ecuador Using Machine Learning, Official Geo-Pedological Data, and Field-Based Hydrological Validation

Archive/Distributed Estimation of the Curve Number (CN) in Continental Ecuador Using Machine Learning, Official Geo-Pedological Data, and Field-Based Hydrological Validation

Carlos Andrés Maldonado Chávez, Benito Guillermo Mendoza Trujillo, Andrés Santiago Cisneros Barahona et al.

3. Juli 2026

Abstract

The Curve Number (CN) remains one of the most widely applied parameters for estimating direct surface runoff. However, its conventional application based on watershed-aggregated tabulated values conceals hydrological variability in regions with contrasting soils and steep topographic gradients. A recurring limitation of distributed CN approaches is the absence of independent hydrological validation; most machine learning models are trained and evaluated against the same SCS-USDA lookup values used to construct the training target, a circular scheme that measures statistical agreement rather than physical credibility. This study develops a reproducible geospatial workflow for distributed CN estimation across continental Ecuador, combining official MAG land use, soil surface texture natural drainage, and topographic slope layers at 1:25,000 scale with a Random Forest regression model at 10 m spatial resolution. The CN reference raster was derived from official geo-pedological layers and independently validated, not against tabulated assumptions, but against observed hydrological behaviour. Field hydraulic characterization across four dominant land cover classes in the Guamote microwatershed (Chimborazo Province), combined with HEC-HMS (US Army Corps of Engineers, Davis, CA, USA) rainfall-runoff modelling over 41 years (1981–2021), confirmed a mean annual discharge of 0.1568 m3 s−1 consistent with the tabulated CN assignments. To our knowledge, this is the first nationally distributed CN map with field-anchored hydrological benchmarking for an Andean country. The Random Forest model achieved an RMSE = 10.4, an R2 = 0.42, and an NSE = 0.41, a performance consistent with published field-based CN estimation studies and expected given the inherent scatter of the SCS-USDA method under real-world conditions. Zonal CN comparisons confirmed a mean absolute error below 5 CN units across the Andean highland and Amazon watersheds; the Guamote watershed showed a mean ∆CN below 4 units against the field-calibrated model. Land use and surface texture emerged as the dominant CN predictors, with natural drainage providing critical discrimination in volcanic and poorly drained soil environments. The resulting 10 m national CN map offers a physically grounded, spatially explicit parameterization layer for distributed hydrological modeling and water resources planning across data-scarce Andean and tropical territories, with direct relevance for flood risk screening, irrigation planning, watershed conservation, and climate adaptation under SDG 6, SDG 11, SDG 13 and SDG 15.

Metadata

DOI: 10.3390/hydrology13070177 CC BY 4.0 license

IPC Classification

G06A01B60

Keywords

distributedestimationcurvenumbercontinentalecuadormachinelearningofficialgeo-pedologicaldatafield-basedhydrologicalvalidationhydrologyremainsmostwidelyappliedparametersestimatingdirectsurfacerunoff

Diese Veröffentlichung zitieren

€ 4.00

← Back to Archive