Evaluating the Statistical Agreement between Gridded Evapotranspiration Data Sets in the Conterminous United States via Triple Collocation
Abstract. Evapotranspiration (ET) is a critical component in hydrologic budgets and one of the most difficult to measure, which can create substantial inconsistencies among ET datasets. This is particularly an issue for benchmarking the performance of hydrologic model simulations, as we lack any ground truth gridded data for verifying ET datasets. Commonly, some form of collocation analysis is used to estimate the error variance of ET data from multiple independent datasets. However, this technique only assesses the error variances and does not estimate the biases of the datasets from the truth, which can be more important for some applications. Although the biases from the truth cannot be determined, relative biases between dataset can. To assess these, we developed a novel method that combines the temporal median of the relative bias between datasets and the error covariance matrices estimated from the Extended Collocation method to derive dataset agreement probabilities. We then applied this method to six gridded monthly ET datasets that cover the Conterminous United States (CONUS): SSEBop, GLEAM, ERA-5-Land, NLDAS-2, TerraClimate, and the water balance ET dataset from Reitz et al. (2023a). From these probabilities, we found that all but one dataset pair had >70 % of grid cells with p-values > 0.16 (and >85 % of grid cells with p-values > 0.05) across CONUS over the full time period, which indicates reasonable agreement. When split by season, winter has 36.8 % of grid cells across all dataset pairs with p-values > 0.16 (56.7 % with p-values > 0.05), with spring, summer, and fall having 41.9 % (63.3 %), 54.2 % (74.4 %), and 63.3 % (82.2 %), respectively. However, when looking at certain regions of concern for water resources management, estimated agreement probabilities split by season showed that the Central Valley had a majority of dataset pairs with p-values < 0.005 in the summer. This may be directly attributed to the lack of an irrigation component in the GLEAM, ERA5-Land, NLDAS, and TerraClimate datasets.