Preprints
https://doi.org/10.5194/egusphere-2024-2088
https://doi.org/10.5194/egusphere-2024-2088
14 Oct 2024
 | 14 Oct 2024

Accelerating research through community open source software for a standardized file format to improve process representation in numerical weather prediction models

Johanna Tjernström, Michael Gallagher, Jareth Holt, Gunilla Svensson, Matthew D. Shupe, Jonathan J. Day, Lara Ferrighi, Siri Jodha Khalsa, Leslie M. Hartten, Ewan O'Connor, Zen Mariani, and Øystein Godøy

Abstract. Improvements in process representation in numerical weather prediction (NWP) models requires informed collaboration between scientists making research-grade observations and scientist developing state-of-the-art NWP models. As a result, progress in model quality relies heavily on the ability to efficiently evaluate and reliably reconcile these two sources of information. To facilitate such progress, with focus on enhanced model skill in polar regions, the Year of Polar Prediction site Model Intercomparison Project (YOPPsiteMIP) community defined the Merged Data File (MDF) format. The file format is designed for high temporal and spatial resolution data for direct comparison between observations and model output to assess parameterized processes under various conditions. A broad overview of the MDF format is provided along with supporting use-cases defined by the research community, and present a set of free, open-source, computational tools for creating and utilizing this standardized format. Two free open source Python packages are discussed: 1) “The MDF toolkit", a data processing library for the creation of standardized datasets, and 2) "MDF visualization", a set of Python codes in notebook format that accelerate model evaluation and climate process research utilizing the MDF format. The benefits of such tools that may help unite diverse groups of researchers through a common data-format language are also discussed.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Johanna Tjernström, Michael Gallagher, Jareth Holt, Gunilla Svensson, Matthew D. Shupe, Jonathan J. Day, Lara Ferrighi, Siri Jodha Khalsa, Leslie M. Hartten, Ewan O'Connor, Zen Mariani, and Øystein Godøy

Status: final response (author comments only)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2024-2088', Anonymous Referee #1, 30 Oct 2024
  • RC2: 'Comment on egusphere-2024-2088', Anonymous Referee #2, 05 Dec 2024
  • AC1: 'Comment on egusphere-2024-2088', Johanna Tjernström, 05 Jan 2025
Johanna Tjernström, Michael Gallagher, Jareth Holt, Gunilla Svensson, Matthew D. Shupe, Jonathan J. Day, Lara Ferrighi, Siri Jodha Khalsa, Leslie M. Hartten, Ewan O'Connor, Zen Mariani, and Øystein Godøy
Johanna Tjernström, Michael Gallagher, Jareth Holt, Gunilla Svensson, Matthew D. Shupe, Jonathan J. Day, Lara Ferrighi, Siri Jodha Khalsa, Leslie M. Hartten, Ewan O'Connor, Zen Mariani, and Øystein Godøy

Viewed

Total article views: 329 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
214 49 66 329 7 6
  • HTML: 214
  • PDF: 49
  • XML: 66
  • Total: 329
  • BibTeX: 7
  • EndNote: 6
Views and downloads (calculated since 14 Oct 2024)
Cumulative views and downloads (calculated since 14 Oct 2024)

Viewed (geographical distribution)

Total article views: 310 (including HTML, PDF, and XML) Thereof 310 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 14 Jan 2025
Download
Short summary
The value of numerical weather predictions can be enhanced in several ways, one is to improve the representations of small-scale processes in models. To understand what needs to be improved, the model results need to be evaluated. Following standardized principles, a file format has been defined to be as similar as possible for both observational and model data. Python packages and toolkits are presented as a community resource in the production of the files and evaluation analysis.