Preprints
https://doi.org/10.5194/egusphere-2026-1246
https://doi.org/10.5194/egusphere-2026-1246
17 Mar 2026
 | 17 Mar 2026
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

CMIP6 data usage: Lessons learned from more than 200 million downloads

Juliette Lavoie, Aude Carreric, Alistair Duffey, Giovanni Chellini, and Elisa Ziegler

Abstract. Earth system simulations from the Coupled Model Intercomparison Project (CMIP) are considered the gold standard in terms of representation of the Earth’s climate system, its past and present states, and future evolution. As CMIP moves into its seventh phase, the increasing complexity of Earth system models (ESMs) means that there is a greater need for infrastructure resources to store, distribute and utilize CMIP simulations. Statistics on the usage of data during CMIP6 has the potential of offering guidance to prepare for CMIP7. Here, we analyse the usage of CMIP6 data and propose recommendations for optimizing the production and accessibility of future CMIP data. Our analysis focuses on CMIP6 data usage statistics from the Earth System Grid Federation (ESGF), the main database of CMIP and other ESMs simulation data. We perform an analysis of CMIP6 ESGF data usage statistics, with a focus on the usage of variables, experiments, individual thematic Model Intercomparison Projects (MIPs), sources and institutions, and related geographical usage trends. We further include statistics on usage from other sources hosting CMIP6 data, including some curated by community portals (Pangeo) through commercial clouds (Google Cloud and Amazon Web Services) and by climate services (Copernicus Climate Change Service). We conclude with recommendations for centres involved in the production and distribution of data to optimise resources based on usage statistics, and to implement improved approaches to track usage.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Juliette Lavoie, Aude Carreric, Alistair Duffey, Giovanni Chellini, and Elisa Ziegler

Status: open (until 12 May 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Juliette Lavoie, Aude Carreric, Alistair Duffey, Giovanni Chellini, and Elisa Ziegler
Juliette Lavoie, Aude Carreric, Alistair Duffey, Giovanni Chellini, and Elisa Ziegler
Metrics will be available soon.
Latest update: 17 Mar 2026
Download
Short summary
The Coupled Model Intercomparison Project (CMIP) is a large collaborative project to better understand the Earth’s climate system. The data produced through this project is downloaded by users around the world. In this paper, we analyze the patterns of downloads and the usage of this massive dataset. From this analysis, we make some recommendations for future data production and usage tracking.
Share