Preprints
https://doi.org/10.5194/egusphere-2025-3147
https://doi.org/10.5194/egusphere-2025-3147
18 Jul 2025
 | 18 Jul 2025

Compression of ERA5 meteorological reanalysis data and their application to simulations with the Lagrangian model for Massive Parallel Trajectory Calculations (MPTRAC v2.7)

Farahnaz Khosrawi and Lars Hoffmann

Abstract. Computer performance has increased immensely in recent years, but the ability to store data has only increased slightly. The storage requirements for the current version of the ERA5 meteorological reanalysis data provided by the European Centre for Medium-Range Weather Forecasts (ECMWF) have increased by a factor of ∼80 compared to its predecessor ERA-Interim. This presents scientists with major challenges, especially if data covering several decades is to be stored on local computer systems. Accordingly, many compression methods have been developed in recent years with which data can be stored either lossless or lossy. Here we test three of these methods: two lossy compression methods, ZFP and Layer Packing (PCK), and the lossless compressor ZStandard (ZSTD). We investigate how the use of these compressed data affects the results of Lagrangian air parcel trajectory calculations with the Lagrangian model for Massive-Parallel Trajectory Calculations (MPTRAC). We analyzed 10-day forward trajectories that were globally distributed over the free troposphere and stratosphere. The largest transport deviations (up to 1600 km) were derived when using ZFP with the largest compression (CR=25). Using a less strong compression we could reduce the transport deviation (up to 100 km) and still obtain a significant compression (CR=7). Since ZSTD is a lossless compressor, we derive no transport deviations when using these compressed files for trajectory calculations, but do not reduce the use of disk space significantly using this compressor (reduction of ∼30 %, CR=1.5). The best compromise concerning compression efficiency and transport deviations is derived with the layer packing method PCK. The data is compressed by about 50 % (CR=2) but horizontal transport deviations do not exceed 40 km. Thus, our study shows that the PCK compression method would be valuable for application in atmospheric sciences and that with compression of the ERA5 meteorological reanalyses data one can overcome the challenges of high demand of disk space from this data set.

Competing interests: Lars Hoffmann is an Editor of Geoscientific Model Development.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Farahnaz Khosrawi and Lars Hoffmann

Status: final response (author comments only)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2025-3147', Anonymous Referee #1, 08 Aug 2025
    • AC1: 'Reply on RC1', Farahnaz Khosrawi, 17 Sep 2025
      • EC1: 'Reply on AC1', Peter Caldwell, 17 Sep 2025
  • RC2: 'Comment on egusphere-2025-3147', Anonymous Referee #2, 17 Aug 2025
Farahnaz Khosrawi and Lars Hoffmann
Farahnaz Khosrawi and Lars Hoffmann

Viewed

Total article views: 973 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
922 40 11 973 19 17
  • HTML: 922
  • PDF: 40
  • XML: 11
  • Total: 973
  • BibTeX: 19
  • EndNote: 17
Views and downloads (calculated since 18 Jul 2025)
Cumulative views and downloads (calculated since 18 Jul 2025)

Viewed (geographical distribution)

Total article views: 958 (including HTML, PDF, and XML) Thereof 958 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 17 Sep 2025
Download
Short summary
Computer performance has increased immensely in recent years, but the ability to store data has only increased slightly. This presents scientists with major challenges. Many compression methods have been developed in recent years with which data can be stored either lossless or lossy. Here we test three of these methods: two lossy compression methods and one lossless compressor. Our study shows that compression is a valuable tool to cope with the high demand of disk space from these data sets.
Share