the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
NN4CAST: An end-to-end deep learning application for seasonal climate forecasts
Abstract. Predicting climate variables at seasonal time scales is crucial for climate services. At these time scales, the most important driver of the atmospheric variability patterns is the ocean, which can trigger local perturbations that could affect the climate of remote areas through different teleconnection mechanisms. Dynamical models are not always effective in overcoming the challenges posed by complex climate processes and may show a low signal-to-noise ratio in the response of some phenomena (such as ENSO). Recently, statistical approaches, which focus on the relationships between predictor and predictand fields, have gained popularity due to advances in computing capabilities and the availability of extensive meteorological datasets. This is particularly evidenced in the development of short-range weather prediction data-driven models. However, seasonal prediction remains challenging, especially in regions where there are complex non-linear remote interactions of non-stationary and seasonal dependent signals from different sources of predictability of the Earth system. For this reason, we have developed a non-linear modelling tool, named as Neural Network foreCAST (NN4CAST). It is a deep learning model library which provides a versatile tool for operational prediction and non-linear statistical analysis, which can enhance seasonal forecast accuracy and understanding of the underlying dynamics. It is demonstrated how NN4CAST is able to provide accurate seasonal predictions in regions where the atmospheric response to the ocean is mostly linear (i.e., tropics) as well as in remote areas through atmospheric teleconnections, where the signal is highly non-linear (i.e., North Atlantic). It is in those remote regions where the dynamical models fail to capture the climate signal, and where the deep learning models may be more useful for a seasonal data-driven or hybrid prediction system. Therefore, the application of this model could provide accurate and reliable forecasts that can be useful for the different strategic societal sectors such as marine ecosystems, health or energy, which require predictions on these time scales.
- Preprint
(2102 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 27 Nov 2024)
-
RC1: 'Comment on egusphere-2024-2897', Anonymous Referee #1, 24 Oct 2024
reply
This is an interesting application for building sub-seasonal models but I have several concerns that would be good to address before it is published.
Major Corrections
1. It is not clear to me what the focus of this paper. Is it to present this new framework and the model you have trained is just an example of an application that could be done with the new framework? Or is the idea to present this new model? If the former, do you have plans to extend this into short lead time weather forecasting? It seems to me that most of what you have developed e.g. hyperparameter tuning and XAI could be useful here. If the latter then I think it would be good to have a better description of the model.
2. As you mention in the introduction, sub-seasonal forecasting is very uncertain. I think for this framework to have a significant impact, it would need to be able to include a way to quantify uncertainty. For example, allowing multiple initial conditions, injections of Gaussian noise or the generation of ensembles.
3. It is unclear to me who provides the model? Are there example models provided in the repository? Or can the user prepare their own models and what format should they be in? Torch/tensorflow?
4. For the model you present, I am not convinced that cross-validation is appropriate across an annual timescale. Is the idea to make the model robust against climate change? We know that ERA5 is also worse pre-1979 because of the lack of satellite observations.
Minor
1. I think in the discussion around line 65, it would be good to mention Neural GCM as an example of an effective hybrid model.
2. I am not sure Lines 73-74 follow. You say the models are largely linear and then you say that this is important for non-linear relationships?
3. I think large parts of Section 2 could be removed. The basic theory of neural networks does not need to be included in a paper.
Citation: https://doi.org/10.5194/egusphere-2024-2897-RC1 -
CEC1: 'Comment on egusphere-2024-2897: No compliance with the policy of the journal', Juan Antonio Añel, 29 Oct 2024
reply
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.htmlYou have archived your code on GitHub (https://github.com/Victorgf00/nn4cast/tree/main). However, GitHub is not a suitable repository for scientific publication. GitHub itself instructs authors to use other long-term archival and publishing alternatives, such as Zenodo. Therefore, please publish your code in one of the appropriate repositories and reply to this comment with the relevant information (link and a permanent identifier for it (e.g. DOI)) as soon as possible, as we can not accept manuscripts in Discussions that do not comply with our policy. Therefore, the current situation with your manuscript is irregular.
In this way, if you do not fix this problem, we will have to reject your manuscript for publication in our journal.
Also, you must include the modified 'Code and Data Availability' section in a potentially reviewed manuscript, the DOI of the code (and another DOI for the dataset if necessary).
Juan A. Añel
Geosci. Model Dev. Executive EditorCitation: https://doi.org/10.5194/egusphere-2024-2897-CEC1 -
AC1: 'Reply on CEC1', Víctor Galván Fraile, 30 Oct 2024
reply
Dear Dr. Añel,
Thank you for your feedback and for bringing the "Code and Data Policy" requirements to our attention. We apologize for the oversight in our initial submission.
We have now archived our code in compliance with the journal's requirements by depositing it in Zenodo. Please find the DOI for the code below:
DOI: 10.5281/zenodo.14011998
We have also updated the 'Code and Data Availability' section in our manuscript accordingly. The revised manuscript with the modified section will be submitted promptly.
Thank you once again for the guidance, and please let us know if there is anything further needed to facilitate the review process.
Sincerely,
Víctor Galván (on behalf of the authors team)Citation: https://doi.org/10.5194/egusphere-2024-2897-AC1 -
CEC2: 'Reply on AC1', Juan Antonio Añel, 30 Oct 2024
reply
Dear authors,
Many thanks for addressing this issue so quickly. We can consider now the current version of your manuscript in compliance with the code policy of our journal.
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/egusphere-2024-2897-CEC2
-
CEC2: 'Reply on AC1', Juan Antonio Añel, 30 Oct 2024
reply
-
AC1: 'Reply on CEC1', Víctor Galván Fraile, 30 Oct 2024
reply
Data sets
NN4CAST_manual Víctor Galván Fraile et al. https://doi.org/10.5281/zenodo.13768761
Model code and software
NN4CAST Víctor Galván Fraile https://github.com/Victorgf00/nn4cast/tree/main
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
310 | 45 | 11 | 366 | 2 | 2 |
- HTML: 310
- PDF: 45
- XML: 11
- Total: 366
- BibTeX: 2
- EndNote: 2
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1