Preprints
https://doi.org/10.5194/egusphere-2026-3249
https://doi.org/10.5194/egusphere-2026-3249
16 Jun 2026
 | 16 Jun 2026
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

New insights on the suitability of NetCDF/HDF5 as storage format for climate cloud repositories

Ezequiel Cimadevilla, David Hassell, and Bryan N. Lawrence

Abstract. Climate data analysis increasingly relies on cloud infrastructures to offer new and efficient methods of accessing the necessary climate data. Cloud repositories allow access to such data in a remote data access basis, which allows users to retrieve and manipulate their data without requiring file downloads, reducing storage costs resulting in more efficient systems. Together, climate cloud repositories and remote data access are evolving fast, due to the necessity of collaboration that brings together diverse communities to address challenges in climate science. In recent years, a prevailing discourse has emerged suggesting that traditional climate data storage formats are inherently unsuitable for remote data access. In this work, we present new insights that challenge this discourse and demonstrate that established storage formats such as NetCDF/HDF5 can continue to operate efficiently in cloud environments when accessed remotely. These findings contrast with the widespread perception that such formats are inherently unsuitable for cloud based workflows. In the context of the onset of CMIP7, these insights have the potential to substantially enhance climate data access and analysis for the broader research community without incurring major maintenance burdens.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Ezequiel Cimadevilla, David Hassell, and Bryan N. Lawrence

Status: open (until 11 Aug 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Ezequiel Cimadevilla, David Hassell, and Bryan N. Lawrence
Ezequiel Cimadevilla, David Hassell, and Bryan N. Lawrence
Metrics will be available soon.
Latest update: 16 Jun 2026
Download
Short summary
Climate science is increasingly adopting cloud-based infrastructures for data access and analysis. Contrary to the widespread belief that traditional formats such as NetCDF/HDF5 are unsuitable for remote cloud access, this study demonstrates that they can perform efficiently in cloud environments. These findings challenge current assumptions and provide practical insights that could support CMIP7 data workflows while reducing the need for major changes to existing climate data infrastructures.
Share