Preprints
https://doi.org/10.5194/egusphere-2024-753
https://doi.org/10.5194/egusphere-2024-753
25 Apr 2024
 | 25 Apr 2024

The effect of lossy compression of numerical weather prediction data on data analysis: a case study using enstools-compression 2023.11

Oriol Tintó Prims, Robert Redl, Marc Rautenhaus, Tobias Selz, Takumi Matsunobu, Kameswar Rao Modali, and George Craig

Abstract. The increasing amount of data in meteorological science requires effective data reduction methods. Our study demonstrates the use of advanced scientific lossy compression techniques to significantly reduce the size of these large datasets, achieving reductions ranging from 5x to over 150x, while ensuring data integrity is maintained. A key aspect of our work is the development of the 'enstools-compression' Python library. This user-friendly tool simplifies the application of lossy compression for Earth scientists and is integrated into the commonly used NetCDF file format workflows in atmospheric sciences. Based on the HDF5 compression filter architecture, 'enstools-compression' is easily used in Python scripts or via command line, enhancing its accessibility for the scientific community. A series of examples, drawn from current atmospheric science research, shows how lossy compression can efficiently manage large meteorological datasets while maintaining a balance between reducing data size and preserving scientific accuracy. This work addresses the challenge of making lossy compression more accessible, marking a significant step forward in efficient data handling in Earth sciences.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Journal article(s) based on this preprint

19 Dec 2024
The effect of lossy compression of numerical weather prediction data on data analysis: a case study using enstools-compression 2023.11
Oriol Tintó Prims, Robert Redl, Marc Rautenhaus, Tobias Selz, Takumi Matsunobu, Kameswar Rao Modali, and George Craig
Geosci. Model Dev., 17, 8909–8925, https://doi.org/10.5194/gmd-17-8909-2024,https://doi.org/10.5194/gmd-17-8909-2024, 2024
Short summary
Oriol Tintó Prims, Robert Redl, Marc Rautenhaus, Tobias Selz, Takumi Matsunobu, Kameswar Rao Modali, and George Craig

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2024-753', Anonymous Referee #1, 28 May 2024
    • AC1: 'Reply on RC1', Oriol Tinto, 19 Aug 2024
  • RC2: 'Comment on egusphere-2024-753', Anonymous Referee #2, 21 Jun 2024
    • AC2: 'Reply on RC2', Oriol Tinto, 19 Aug 2024

Interactive discussion

Status: closed

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2024-753', Anonymous Referee #1, 28 May 2024
    • AC1: 'Reply on RC1', Oriol Tinto, 19 Aug 2024
  • RC2: 'Comment on egusphere-2024-753', Anonymous Referee #2, 21 Jun 2024
    • AC2: 'Reply on RC2', Oriol Tinto, 19 Aug 2024

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload
AR by Oriol Tinto on behalf of the Authors (13 Sep 2024)  Author's response   Author's tracked changes   Manuscript 
ED: Referee Nomination & Report Request started (19 Sep 2024) by Sylwester Arabas
RR by Anonymous Referee #2 (02 Oct 2024)
RR by Anonymous Referee #1 (05 Oct 2024)
ED: Publish subject to technical corrections (06 Oct 2024) by Sylwester Arabas
AR by Oriol Tinto on behalf of the Authors (14 Oct 2024)  Author's response   Manuscript 

Journal article(s) based on this preprint

19 Dec 2024
The effect of lossy compression of numerical weather prediction data on data analysis: a case study using enstools-compression 2023.11
Oriol Tintó Prims, Robert Redl, Marc Rautenhaus, Tobias Selz, Takumi Matsunobu, Kameswar Rao Modali, and George Craig
Geosci. Model Dev., 17, 8909–8925, https://doi.org/10.5194/gmd-17-8909-2024,https://doi.org/10.5194/gmd-17-8909-2024, 2024
Short summary
Oriol Tintó Prims, Robert Redl, Marc Rautenhaus, Tobias Selz, Takumi Matsunobu, Kameswar Rao Modali, and George Craig

Model code and software

enstools-compression Oriol Tintó and Robert Redl https://doi.org/10.5281/zenodo.10998676

Interactive computing environment

The effect of lossy compression of numerical weather prediction data on data analysis: software to reproduce figures using enstools-compression Oriol Tintó Prims https://doi.org/10.5281/zenodo.10998604

Oriol Tintó Prims, Robert Redl, Marc Rautenhaus, Tobias Selz, Takumi Matsunobu, Kameswar Rao Modali, and George Craig

Viewed

Total article views: 565 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
388 148 29 565 25 18
  • HTML: 388
  • PDF: 148
  • XML: 29
  • Total: 565
  • BibTeX: 25
  • EndNote: 18
Views and downloads (calculated since 25 Apr 2024)
Cumulative views and downloads (calculated since 25 Apr 2024)

Viewed (geographical distribution)

Total article views: 585 (including HTML, PDF, and XML) Thereof 585 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 19 Dec 2024
Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Short summary
Advanced compression techniques can drastically reduce the size of meteorological datasets (by 5x to 150x) without compromising the data's scientific value. We developed a user-friendly tool called 'enstools-compression' that makes this compression simple for Earth scientists. This tool works seamlessly with common weather and climate data formats. Our work shows that lossy compression can significantly improve how researchers store and analyze large meteorological datasets.