Preprints
https://doi.org/10.5194/egusphere-2024-3533
https://doi.org/10.5194/egusphere-2024-3533
16 Jan 2025
 | 16 Jan 2025

Optimizing output operations in high-resolution climate models through dynamic scheduling

Dong Wang and Xiaomeng Huang

Abstract. This study presents a new approach to improve the efficiency of data output in high-resolution climate models. The method begins by forwarding data to processes with lighter workloads or finishing their tasks earlier, allowing these units to serve as temporary storage. Following this, the processes create multiple smaller communication groups to reorganize the data and then use an I/O aggregation approach to enable efficient parallel writing. A dedicated control process dynamically manages these phases based on the status of each process. To further refine the I/O strategies, we collect performance data from the target machine to build a simulated environment. A reinforcement learning agent is deployed in this environment to identify and test better parameter configurations. Experiments conducted on two models, GOMO1.0 and LICOM3, show that this method increases output efficiency by factors of 1.54 and 13.1, respectively, compared to the commonly used PnetCDF and MPI-IO. These results suggest that this approach can significantly reduce the overhead associated with data output, providing a promising solution for enhancing the performance of climate models.

Competing interests: The author Xiaomeng Huang is the member of the editorial board of journal GMD.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Dong Wang and Xiaomeng Huang

Status: final response (author comments only)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
  • RC1: 'Comment on egusphere-2024-3533', Anonymous Referee #1, 27 Feb 2025
    • AC1: 'Reply on RC1', Dong Wang, 13 Mar 2025
  • RC2: 'Comment on egusphere-2024-3533', Anonymous Referee #2, 16 Mar 2025
    • AC2: 'Reply on RC2', Dong Wang, 19 Mar 2025
Dong Wang and Xiaomeng Huang
Dong Wang and Xiaomeng Huang

Viewed

Total article views: 216 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
139 66 11 216 8 7
  • HTML: 139
  • PDF: 66
  • XML: 11
  • Total: 216
  • BibTeX: 8
  • EndNote: 7
Views and downloads (calculated since 16 Jan 2025)
Cumulative views and downloads (calculated since 16 Jan 2025)

Viewed (geographical distribution)

Total article views: 221 (including HTML, PDF, and XML) Thereof 221 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 22 May 2025
Download
Short summary
This study presents a method to enhance data output efficiency in high-resolution climate models by redistributing workloads and allowing lighter tasks to temporarily store data. We use smaller communication groups and I/O aggregation for efficient data writing. A reinforcement learning agent optimizes the approach based on performance data from two models, suggesting a promising strategy to reduce data output overhead and improve model performance.
Share