GPTCast: a weather language model for precipitation nowcasting
Abstract. This work introduces GPTCast, a generative deep-learning method for ensemble nowcast of radar-based precipitation, inspired by advancements in large language models (LLMs). We employ a GPT model as a forecaster to learn spatiotemporal precipitation dynamics using tokenized radar images. The tokenizer is based on a Quantized Variational Autoencoder featuring a novel reconstruction loss tailored for the skewed distribution of precipitation that promotes faithful reconstruction of high rainfall rates. The approach produces realistic ensemble forecasts and provides probabilistic outputs with accurate uncertainty estimation. The model is trained without resorting to randomness, all variability is learned solely from the data and exposed by model at inference for ensemble generation. We train and test GPTCast using a 6-year radar dataset over the Emilia-Romagna region in Northern Italy, showing superior results compared to state-of-the-art ensemble extrapolation methods.
Status: open (until 15 Dec 2024)
-
CEC1: 'Comment on egusphere-2024-3002', Juan Antonio Añel, 30 Oct 2024
reply
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.htmlYou have archived part of your code on GitHub. However, GitHub is not a suitable repository for scientific publication. GitHub itself instructs authors to use other long-term archival and publishing alternatives, such as Zenodo. Therefore, the current situation with your manuscript is irregular. Please, move this GitHub posted code to one of the appropriate repositories and reply to this comment with the relevant information for it (link and a permanent identifier for it (e.g. DOI)) as soon as possible, as we can not accept manuscripts in Discussions that do not comply with our policy.
Please, note that if you do not fix this problem, we could have to reject your manuscript for publication in our journal.
Also, you must include the modified 'Code and Data Availability' section in a potentially reviewed manuscript, the new link and DOI of the code.
Juan A. Añel
Geosci. Model Dev. Executive EditorCitation: https://doi.org/10.5194/egusphere-2024-3002-CEC1 -
EC1: 'Reply on CEC1', David Topping, 30 Oct 2024
reply
Dear Juan
The data and code assets are archived on zenodo, as listed in the paper assets tab. An interactive notebook, however, is hosted on Github so I agree this should be included as part of the discussion phase and was intended to do so. Howver the Copernicus email instructions [6/10/24] state that 'Please note, you will not be able to ask for revisions since the preprint has already been posted ', which is rather confusing. Given the tone of your email, could you please clarify at what stage the authors should do this?
Thanks
DaveCitation: https://doi.org/10.5194/egusphere-2024-3002-EC1 -
CEC2: 'Reply on EC1', Juan Antonio Añel, 31 Oct 2024
reply
Dear Dave, Dear authors,
The automatic email from the system means that it is not going to be possible to make changes to the current version in Discussions after the Topical Editor has agreed on publishing it there. However, it is possible to ask for revisions, like in any part of the review process.
Therefore, comments can be made here replying to my request, and new information posted, like the new acceptable repository needed that I request. Actually, this should be done as soon as possible, as any manuscript in Discussions that does not comply with the policy of the journal is a potential risk of wasting time from reviewers and editors. For example, if editors and reviewers perform all the review process and authors finally do not comply with the policy. This could seem unfortunate, but happens sometimes.
Finally, if reviewers recommend a new round of reviews or publication, and you decide to invite a new version of the manuscript or accept this one, during the process, in a new submitted version of the manuscript, the authors can include the information that they posted here in Discussions replying to my comment.
I hope this clarify the situation.
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/egusphere-2024-3002-CEC2 -
AC1: 'Reply on CEC2', Gabriele Franch, 01 Nov 2024
reply
Dear Juan, Dear Dave,
The code asset we submitted on Zenodo contains the full archived copy of the Github repository, including both the code and the interactive notebooks: https://zenodo.org/records/13832526
We added the Github link as a convenience source for the interactive computing environment. since it does not require to extract the archive and allows to visualize the notebooks via web, but we understand that this may be a source of confusion for the reviewers since we are listing two different sources for the same asset.
We apologize for the confusion that this may have caused. Please disregard the github link and refer only to the asset archived on Zenodo as a reference for both the code and the interactive notebooks.
We hope this clarifies the situation.
Gabriele Franch
Citation: https://doi.org/10.5194/egusphere-2024-3002-AC1
-
AC1: 'Reply on CEC2', Gabriele Franch, 01 Nov 2024
reply
-
CEC2: 'Reply on EC1', Juan Antonio Añel, 31 Oct 2024
reply
-
EC1: 'Reply on CEC1', David Topping, 30 Oct 2024
reply
Data sets
Dataset for "GPTCast: a weather language model for precipitation nowcasting" Gabriele Franch, Elena Tomasi, Chaira Cardinali, Virginia Poli, Pier Paolo Alberoni, and Marco Cristoforetti https://doi.org/10.5281/zenodo.13692016
Model code and software
Code for "GPTCast: a weather language model for precipitation nowcasting" Gabriele Franch, Elena Tomasi, and Marco Cristoforetti https://doi.org/10.5281/zenodo.13832526
Interactive computing environment
Jupyter Notebooks for "GPTCast: a weather language model for precipitation nowcasting" Gabriele Franch, Elena Tomasi, and Marco Cristoforetti https://github.com/DSIP-FBK/GPTCast/tree/main/notebooks
Viewed
Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
123 | 0 | 0 | 123 | 0 | 0 |
- HTML: 123
- PDF: 0
- XML: 0
- Total: 123
- BibTeX: 0
- EndNote: 0
Viewed (geographical distribution)
Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1