Preprints
https://doi.org/10.5194/egusphere-2025-4811
https://doi.org/10.5194/egusphere-2025-4811
29 Oct 2025
 | 29 Oct 2025
Status: this preprint is open for discussion and under review for Atmospheric Chemistry and Physics (ACP).

Technical note: A Flexible Framework for Precision Truncation and Lossless Compression in WRF Simulations with Application over the United States

Shang Wu, David C. Wong, Jiandong Wang, Yuzhi Jin, Junjun Li, and Chunsong Lu

Abstract. As climate simulations generate increasingly large datasets, reducing storage demands without compromising scientific integrity has become a critical challenge. This study evaluates the effectiveness of precision truncation, applied prior to lossless compression, in balancing storage efficiency and fidelity within regional Weather Research and Forecasting (WRF) simulations over the United States. We examine input-only, output-only, and combined input–output truncation strategies across both routine meteorological variables and extreme precipitation indices. Results show that conventional atmospheric fields remain robust when outputs are truncated to 5 or 4 significant digits, keeping biases within acceptable limits. Wind speed is largely insensitive to truncation, temperature and humidity are more vulnerable under aggressive output truncation (3 significant digits). Precipitation shows mixed responses, with deviations dominated by input perturbations. Extreme precipitation indices display more complex sensitivities: percentile- and maximum-based indices are highly susceptible to nonlinear, regionally heterogeneous biases under input truncation, whereas frequency- and intensity-based indices respond more systematically to output truncation, with substantial distortions emerging at 3 digits. These findings demonstrate that truncation strategies cannot be applied uniformly but must be tailored to variable type and diagnostic. Within this study, output-only truncation emerges as the most reliable strategy, with 4 significant digits identified as a safe lower bound and 5 digits preferable when fidelity of extreme-event is critical. To implement this in practice, we introduce a flexible error-tolerance framework that applies a predefined threshold across all indices and adapts truncation levels by region and season, enabling substantial storage savings while safeguarding the integrity of climate diagnostics.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Shang Wu, David C. Wong, Jiandong Wang, Yuzhi Jin, Junjun Li, and Chunsong Lu

Status: open (until 10 Dec 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Shang Wu, David C. Wong, Jiandong Wang, Yuzhi Jin, Junjun Li, and Chunsong Lu

Data sets

Dataset for the paper 'A Flexible Framework for Precision Truncation and Lossless Compression in WRF Simulations: Method and Application over the United States' Shang Wu https://doi.org/10.5281/zenodo.17139028

Model code and software

Truncation tool for the paper 'A Flexible Framework for Precision Truncation and Lossless Compression in WRF Simulations: Method and Application over the United States' David C. Wong https://doi.org/10.5281/zenodo.17156737

Shang Wu, David C. Wong, Jiandong Wang, Yuzhi Jin, Junjun Li, and Chunsong Lu
Metrics will be available soon.
Latest update: 29 Oct 2025
Download
Short summary
Climate simulations create huge amounts of data that are difficult to store and share. In this study, we developed a simple method to reduce file sizes while keeping the scientific information accurate. By carefully shortening numbers before applying compression, we tested different settings on U.S. weather simulations and found ways to save space without losing key results. This approach helps scientists work more efficiently and supports better access to climate data for the wider community.
Share