Optimizing output operations in high-resolution climate models through dynamic scheduling

Wang, Dong; Huang, Xiaomeng

doi:https://doi.org/10.5194/egusphere-2024-3533

Dong Wang and Xiaomeng Huang

Abstract. This study presents a new approach to improve the efficiency of data output in high-resolution climate models. The method begins by forwarding data to processes with lighter workloads or finishing their tasks earlier, allowing these units to serve as temporary storage. Following this, the processes create multiple smaller communication groups to reorganize the data and then use an I/O aggregation approach to enable efficient parallel writing. A dedicated control process dynamically manages these phases based on the status of each process. To further refine the I/O strategies, we collect performance data from the target machine to build a simulated environment. A reinforcement learning agent is deployed in this environment to identify and test better parameter configurations. Experiments conducted on two models, GOMO1.0 and LICOM3, show that this method increases output efficiency by factors of 1.54 and 13.1, respectively, compared to the commonly used PnetCDF and MPI-IO. These results suggest that this approach can significantly reduce the overhead associated with data output, providing a promising solution for enhancing the performance of climate models.

Received: 12 Nov 2024 – Discussion started: 16 Jan 2025

Competing interests: The author Xiaomeng Huang is the member of the editorial board of journal GMD.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Country	#	Views	%
United States of America	1	73	44
undefined	2	14	8
China	3	12	7
Germany	4	8	4
France	5	8	4


Total:	0
HTML:	0
PDF:	0
XML:	0

Optimizing output operations in high-resolution climate models through dynamic scheduling

Viewed

Viewed (geographical distribution)