Preprints
https://doi.org/10.5194/egusphere-2026-262
https://doi.org/10.5194/egusphere-2026-262
30 Jan 2026
 | 30 Jan 2026
Status: this preprint is open for discussion and under review for Ocean Science (OS).

An interpretable machine learning for marine heatwave prediction for the south China sea

Peihao Yang and Guodong Ye

Abstract. A primary challenge of machine learning to predict marine heatwave (MHW) for the south China sea (SCS) is the limited availability of observational data for model training. To address this issue, this study explores the viability of leveraging multi-member ensemble simulations from the Coupled Model Intercomparison Project Phase 6 (CMIP6), to construct an extensive, physically consistent training dataset for various machine learning models. After training on multiple CMIP6 ensemble members, the constructed models are evaluated for their predictive capacity regarding MHW in the SCS. The results also show that these machine learning-based methods can perform comparably to the existing dynamic models in terms of prediction performance, and in some cases even outperform the latter. Furthermore, by incorporating machine learning interpretability techniques, the key physical processes can also be elucidated from these predictions. That is to say, the new method is not a traditional "black box", but rather an effective tool that can possess certain physical transparency and scientific interpretability.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Peihao Yang and Guodong Ye

Status: open (until 27 Mar 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Peihao Yang and Guodong Ye
Peihao Yang and Guodong Ye
Metrics will be available soon.
Latest update: 30 Jan 2026
Download
Short summary
Because long records are scarce, we trained models using a large set of global climate simulations, combining sea surface temperature with air temperature, air pressure, and winds. Checked against observations in the South China Sea, accuracy is highest when predicting nearer in time and can be similar to traditional physics models. Sea surface temperature drives short term predictions, while wind changes matter more for longer term outlooks, supporting better warnings of extreme ocean warming.
Share