Preprints
https://doi.org/10.5194/egusphere-2024-864
https://doi.org/10.5194/egusphere-2024-864
02 Apr 2024
 | 02 Apr 2024

HESS Opinions: A few camels or a whole caravan?

Franziska Maria Clerc-Schwarzenbach, Giovanni Selleri, Mattia Neri, Elena Toth, Ilja van Meerveld, and Jan Seibert

Abstract. Large-sample datasets containing hydrometeorological time series and catchment attributes for hundreds of catchments in a country, many of them known as “Camels” (catchment attributes and meteorology for large-sample studies), have revolutionized hydrological modelling and enabled comparative analyses. The Caravan dataset is a compilation of several (“Camels” and other) large-sample datasets with uniform attribute names and data structure. This simplifies large-sample hydrology across regions, continents, or the globe. However, the use of the Caravan dataset instead of the original Camels or other large-sample datasets may affect model results and the conclusions derived thereof. For the Caravan dataset, the meteorological forcing data are based on ERA5-Land reanalysis data. Here, we describe the differences between the original precipitation, temperature, and potential evapotranspiration (Epot) data for 1252 catchments in the CAMELS-US, CAMELS-BR, and CAMELS-GB datasets and the forcing data for these catchments in the Caravan dataset. The Epot in the Caravan dataset is unrealistically high for many catchments but there are, not surprisingly, also considerable differences in the precipitation data. We show that the use of the forcing data from the Caravan dataset impairs hydrological model calibration for the vast majority of catchments, i.e., there is a drop in the calibration performance when using the forcing data from the Caravan dataset compared to the original Camels datasets. This drop is mainly due to the differences in the precipitation data. Therefore, we suggest extending the Caravan dataset with the forcing data included in the original Camels datasets wherever possible, so that users can choose which forcing data they want to use, or at least indicating clearly that the forcing data in Caravan come with a data quality loss and using the original datasets is recommended. Moreover, we suggest not using the Epot data (and derived catchment attributes, such as the aridity index) from the Caravan dataset and replacing these with (or based on) alternative Epot estimates.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Franziska Maria Clerc-Schwarzenbach, Giovanni Selleri, Mattia Neri, Elena Toth, Ilja van Meerveld, and Jan Seibert

Status: final response (author comments only)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2024-864', Thorsten Wagener, 27 Apr 2024
    • AC1: 'Reply on RC1', Franziska Clerc-Schwarzenbach, 06 Jun 2024
  • RC2: 'Comment on egusphere-2024-864', François Brissette, 24 May 2024
    • AC2: 'Reply on RC2', Franziska Clerc-Schwarzenbach, 06 Jun 2024
Franziska Maria Clerc-Schwarzenbach, Giovanni Selleri, Mattia Neri, Elena Toth, Ilja van Meerveld, and Jan Seibert

Model code and software

HESS Opinions: A few camels or a whole caravan? Franziska M. Clerc-Schwarzenbach https://doi.org/10.5281/zenodo.10784701

Franziska Maria Clerc-Schwarzenbach, Giovanni Selleri, Mattia Neri, Elena Toth, Ilja van Meerveld, and Jan Seibert

Viewed

Total article views: 947 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
709 207 31 947 18 14
  • HTML: 709
  • PDF: 207
  • XML: 31
  • Total: 947
  • BibTeX: 18
  • EndNote: 14
Views and downloads (calculated since 02 Apr 2024)
Cumulative views and downloads (calculated since 02 Apr 2024)

Viewed (geographical distribution)

Total article views: 903 (including HTML, PDF, and XML) Thereof 903 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 16 Jul 2024
Download
Large sample hydrology datasets such as Caravan provides the community with hydrometeorological information and catchment attributes for many catchments in the world and offers the opportunity for hydrological research. However, there are considerable differences between the forcing data of Caravan compared to the CAMELS datasets, especially with potential evaporation. This can lead to wrong conclusions on catchment hydrological drivers and affect regionalization. This papers shows the important of robustness of large sample datasets and the need to keep assessing that.
Short summary
We compare the catchment forcing data provided in large-sample datasets, namely the Caravan dataset and three of the original CAMELS datasets (US, BR, GB). We show that the differences affect hydrological model performance and that the data quality in the Caravan dataset is lower than the one in the CAMELS datasets, both for precipitation and potential evapotranspiration. We want to raise awareness of the lower data quality in Caravan and we suggest possible improvements for the Caravan dataset.