A hybrid physics and machine learning approach highlights and alleviates soil moisture downscaling challenges
Abstract. Soil moisture information is crucial for predicting natural hazards like droughts and landslides, improving climate and weather forecasts, understanding soil-atmosphere exchanges and feedback, and better managing water resources. Soil moisture networks are sparse, and remote sensing products lack necessary resolution. Downscaling techniques aim to enhance resolution, but face challenges because the ancillary data they rely on are often sparse and lack representativeness.
Here, a ”downscaling playground” is created over the Contiguous USA using physics-based hydrological simulations that allow downscaling techniques to be trained and tested without practical limitations. We show that existing biases in soil moisture station locations lead to spatial overfitting, resulting in poorer performance than, for example, if those stations had been randomly located. The biases lead also to unwarranted confidence in the improvement achieved with downscaling, as testing is carried out on stations in similar locations. Finally, we propose a new paradigm that fuses physics-based modelling (ParFlow-CLM) with machine learning (XGBoost) and downscales remote sensing observations (e.g., SMAP) to produce a 1km2 soil moisture product over CONUS. We test the approach thoroughly against in situ observations.
Competing interests: At least one of the (co-)authors is a member of the editorial board of Hydrology and Earth System Sciences.
Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.