Preprints
https://doi.org/10.5194/egusphere-2024-3533
https://doi.org/10.5194/egusphere-2024-3533
16 Jan 2025
 | 16 Jan 2025
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

Optimizing output operations in high-resolution climate models through dynamic scheduling

Dong Wang and Xiaomeng Huang

Abstract. This study presents a new approach to improve the efficiency of data output in high-resolution climate models. The method begins by forwarding data to processes with lighter workloads or finishing their tasks earlier, allowing these units to serve as temporary storage. Following this, the processes create multiple smaller communication groups to reorganize the data and then use an I/O aggregation approach to enable efficient parallel writing. A dedicated control process dynamically manages these phases based on the status of each process. To further refine the I/O strategies, we collect performance data from the target machine to build a simulated environment. A reinforcement learning agent is deployed in this environment to identify and test better parameter configurations. Experiments conducted on two models, GOMO1.0 and LICOM3, show that this method increases output efficiency by factors of 1.54 and 13.1, respectively, compared to the commonly used PnetCDF and MPI-IO. These results suggest that this approach can significantly reduce the overhead associated with data output, providing a promising solution for enhancing the performance of climate models.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Dong Wang and Xiaomeng Huang

Status: open (until 13 Mar 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Dong Wang and Xiaomeng Huang
Dong Wang and Xiaomeng Huang

Viewed

Total article views: 21 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
17 2 2 21 0 0
  • HTML: 17
  • PDF: 2
  • XML: 2
  • Total: 21
  • BibTeX: 0
  • EndNote: 0
Views and downloads (calculated since 16 Jan 2025)
Cumulative views and downloads (calculated since 16 Jan 2025)

Viewed (geographical distribution)

Total article views: 21 (including HTML, PDF, and XML) Thereof 21 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 17 Jan 2025
Download
Short summary
This study presents a method to enhance data output efficiency in high-resolution climate models by redistributing workloads and allowing lighter tasks to temporarily store data. We use smaller communication groups and I/O aggregation for efficient data writing. A reinforcement learning agent optimizes the approach based on performance data from two models, suggesting a promising strategy to reduce data output overhead and improve model performance.