the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
The CMIP6-downscaled CORDEX-Southeast Asia (SEA) ensemble: evaluation and benchmarking for megacities of SEA
Abstract. A 21-member ensemble of regional climate simulations has been produced for Southeast Asia (SEA) by dynamically downscaling Coupled Model Intercomparison Project Phase 6 (CMIP6) Global Climate Models (GCMs) under the World Climate Research Programme’s Coordinated Regional Climate Downscaling Experiment (CORDEX). The ensemble was generated by several modelling institutes using three regional climate models (RCMs) with eight distinct model configurations, resulting in a total of 62 simulations spanning the historical period and multiple future emissions scenarios. Model performance for mean, daily maximum/minimum temperature, and precipitation was evaluated against multiple observations at annual, seasonal, and daily time scales over SEA and its two subregions: Mainland and Maritime Continent (MC). Despite large observational uncertainties in precipitation intensity, the CMIP6 CORDEX-SEA ensemble captures the spatial and seasonal rainfall distribution reasonably well but tends to substantially overestimate observed rainfall. Wet biases, evident in about two-thirds of the models, are regionally and seasonally heterogeneous and larger over monsoon-dominated regions and seasons (e.g., MC during November–April and the Mainland during May–October). All RCMs showed widespread, statistically significant cold biases in daily mean temperature, which were largest during boreal winter, over the Mainland, and in simulations that have significant wet biases. These cold biases primarily arise from the models’ underestimation of daily maximum temperature. The MC remains a challenging region since models struggle to accurately capture the spatial variability of rainfall and the internal variability of temperature. A standardised benchmarking framework was applied to precipitation and temperature, which ultimately identified 15 historical simulations that met our a priori model performance expectations. Analysing the range of future projections and model independence shows that simulations from the same RCM family exhibit similar bias structures, highlighting the importance of RCM setup and the selection of statistically independent models. From this process, eight simulations spanning three RCM configurations were selected for further kilometre-scale dynamical downscaling over megacities of SEA.
- Preprint
(4230 KB) - Metadata XML
-
Supplement
(4837 KB) - BibTeX
- EndNote
Status: open (until 13 May 2026)
-
CEC1: 'Comment on egusphere-2026-1325 - No compliance with the policy of the journal', Juan Antonio Añel, 28 Mar 2026
reply
-
AC1: 'Reply on CEC1" Revised Code and Data availability"', Phuong Loan Nguyen, 29 Mar 2026
reply
Dear Chief Editor,
Thank you very much for your careful assessment of our manuscript and for highlighting the issues related to compliance with the GMD's Code and Data Policy. We sincerely appreciate your detailed guidance and fully understand the importance of ensuring transparency, reproducibility, and long-term accessibility of code and data.
In response to your comment, we have taken substantial steps to bring our manuscript into compliance with the GMD Code and Data Policy. The Code and Data Availability section has been fully revised, and the required repositories and persistent identifiers have now been provided as the following:
Code and Data Availability
Code for benchmarking the CMIP6-downscaled CORDEX-Southeast Asia (SEA) performance (Isphording, 2024) is available at https://doi.org/10.5281/zenodo.8365065 (Isphording, 2023).
Model source codes used in this study are available as follows:
- RegCM4 is available at https://zenodo.org/records/4603556 (Coppola et al., 2021)
- RegCM5 is available at https://zenodo.org/records/17348623 (Giorgi et al., 2024)
- CCAM-2307 source code is available at https://doi.org/10.5281/zenodo.19303856 (Nguyen et al., 2026)
- HadGEM3-RA07 configuration and model information are available through https://doi.org/10.5194/gmd-16-1713-2023 (Bush et al., 2023)
The supporting dataset for this study is archived in Zenodo:
Nguyen, P. L., Alexander, L., Ngo-Duc, T., Cruz, F. A., Santisirisomboon, J., Juneng, L., Permana, D. S., Chung, J. X., Dado, J. M., McGregor, J. L., Redmond, G., Po, W., Tangang, F., Phan-Van, T., Truong, S. C. H., Thatcher, M., Trinh-Tuan, L., Ma'rufah, U., Tibay, J., White, S. (2026). Dataset for "The CMIP6-downscaled CORDEX-Southeast Asia (SEA) ensemble: evaluation and benchmarking for megacities of SEA" (Version v1). Zenodo. https://doi.org/10.5281/zenodo.19279601
This repository contains representative subsets of CMIP6-downscaled CORDEX-SEA simulations and observational datasets used in the analysis, which are sufficient to reproduce the evaluation workflow and figures presented in the manuscript.
Full CMIP6-downscaled CORDEX-SEA simulation data are being made available through the Earth System Grid Federation (ESGF) and the Australian National Computational Infrastructure (NCI) and are planned to be publicly released in 2026 due to their large data volume.
The original observational and reanalysis datasets used in this study are publicly available from the following sources:
- APHRODITE version V1101R1 and V1101XR (Yatagai et al., 2012): https://www.chikyu.ac.jp/precip/english/index.html
- SACA&D (Van den Besselaar et al., 2017): https://sacad.bmkg.go.id/
- CRU TS4.08 (Harris et al., 2020): https://crudata.uea.ac.uk/cru/data/hrg/
- ERA5 (Hersbach et al., 2020): https://doi.org/10.24381/cds.bd0915c6 (Hersbach et al., 2023)
- Berkeley Earth Surface Temperature (BEST) (Rohde and Hausfather et al., 2020): https://berkeleyearth.org/data/
- CHIRPS-v2, REGEN_ALL, and GPCC_FDD_2022 from the FROGs database (Roca et al., 2019): https://frogs.ipsl.fr/
Note that this part, along with full citation for the main DOI links included in the section "Code and data availability" will be added to the "References" section of the manuscript in the next revision.
We hope that this revision will address the concerns raised and bring the manuscript into compliance with the GMD Code and Data Policy.
Thank you once again for your guidance, and we are happy to make any further adjustments if required.
Kind regards,
Phuong Loan Nguyen
on behalf of all co-authorsCitation: https://doi.org/10.5194/egusphere-2026-1325-AC1 -
CEC2: 'Reply on AC1', Juan Antonio Añel, 30 Mar 2026
reply
Dear authors,
Many thanks for your quick reply. Unfortunately, you have not addressed the issues with the data. You continue to say that the output of the simulations will be available along 2026, which we can not accept. In such a case, the sensible thing to do regarding your manuscript is to stall the review process, and wait until the data is public. Actually, the good practice would have been not submit the manuscript until the data are available. As I mentioned in my previous comment, If the size of the simulations do not make possible to store them elsewhere, then we would expect that you state it, providing the total size of the data, and that you share a repository that contains at least a sensible amount of sample files containing the necessary variables.
Also, for the "original observational and reanalysis datasets" you continue to cite webpages that are not repositories, and we can not accept. You should store the observational and reanalysis data that you have used in an appropriate repository, unless the term and conditions of such datasets do not allow it.
Therefore, please, reply to this comment addressing the mentioned issues.
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/egusphere-2026-1325-CEC2 -
AC2: 'Reply on CEC2', Phuong Loan Nguyen, 31 Mar 2026
reply
Dear Chief Editor,
Thank you very much for your patience and for your clarification. As requested, we have updated the Code and Data Availability section of the manuscript to fully address the concerns raised in your previous comment.
In particular, we have now archived the observational datasets used in our analysis (APHRODITE, REGEN_ALL, GPCC_FDD, CHIRPSv2, SCADA, BEST, ERA5) on Zenodo: https://doi.org/10.5281/zenodo.19334323. We have also uploaded the simulation data, including all variables used in the analysis (pr, tas, tasmax, tasmin), to Zenodo at https://doi.org/10.5281/zenodo.19334179. The Code and Data Availability section has been revised accordingly and is provided below for your review.
Code and Data Availability
Code for evaluation and benchmarking of the CMIP6-downscaled CORDEX-Southeast Asia (SEA) ensemble is available at https://doi.org/10.5281/zenodo.8365065 (Isphording, 2023).
Model source codes used in this study are available as follows:
- RegCM4: https://zenodo.org/records/4603556 (Coppola et al., 2021)
- RegCM5: https://zenodo.org/records/17348623 (Giorgi et al., 2024)
- CCAM-2307: https://doi.org/10.5281/zenodo.19303856 (Nguyen et al., 2026a)
- HadGEM3-RA07: https://doi.org/10.5194/gmd-16-1713-2023 (Bush et al., 2023)
All simulations and variables used in this study are archived in repositories as follows:
- Simulation data: https://doi.org/10.5281/zenodo.19334179 (Nguyen et al., 2026b)
- Reference datasets: https://doi.org/10.5281/zenodo.19334323 (Nguyen et al., 2026c)
We hope that these updates fully meet GMD’s requirements for transparency, reproducibility, and compliance with the journal’s Code and Data Policy. Please let us know if any further adjustments or additional information is required. We would be happy to make any further revisions if needed.
Thank you again for your guidance and support in ensuring that the manuscript complies with GMD’s Code and Data Policy.
Best regards,
Phuong Loan Nguyen
On behalf of all authorsCitation: https://doi.org/10.5194/egusphere-2026-1325-AC2 -
CEC3: 'Reply on AC2', Juan Antonio Añel, 01 Apr 2026
reply
Dear authors,
Thanks for addressing this issue so quickly. I have checked the repositories and we can consider now the current version of your manuscript in compliance with the code policy of the journal.
Juan A. Añel
Geosci. Model Dev. Executive Editor
Citation: https://doi.org/10.5194/egusphere-2026-1325-CEC3
-
AC2: 'Reply on CEC2', Phuong Loan Nguyen, 31 Mar 2026
reply
-
AC1: 'Reply on CEC1" Revised Code and Data availability"', Phuong Loan Nguyen, 29 Mar 2026
reply
Viewed
| HTML | XML | Total | Supplement | BibTeX | EndNote | |
|---|---|---|---|---|---|---|
| 147 | 62 | 9 | 218 | 18 | 9 | 8 |
- HTML: 147
- PDF: 62
- XML: 9
- Total: 218
- Supplement: 18
- BibTeX: 9
- EndNote: 8
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.html
The main problem with your manuscript is that the data corresponding to the simulations described in your manuscript are not available. It must be clear that manuscripts can only be submitted to Geosci. Model Dev. and accepted for Discussions and peer review after all the code and data is available publicly. Also, from a strict point of view, we can not accept the ESGF as a trusted repository for long-term archival, and you state that your data will be hosted in it. Meanwhile we can grant you an exception in this case to this rule due to the fact that probably the size of your simulations can not be hosted adequately elsewhere, at least, you should store representative data of the full model outputs in a repository we can accept according to the policy of the journal, and do it before submitting the manuscript.
Also, for your work you have used many models, and do not provide a repository containing the code of each one of them, which our policy requires.
Additionally, in the "Code and data availability" section you cite several sites for other datasets, which are not long-term repositories we can accept because they do not comply with GMD’s requirements for a persistent data archive:
* They do not appear to have a published policy for data preservation over many years or decades (some flexibility exists over the precise length of preservation, but the policy must exist).
* They do not appear to have a published mechanism for preventing authors from unilaterally removing material. Archives must have a policy which makes removal of materials only possible in exceptional circumstances and subject to an independent curatorial decision,
* They do not appear to issue a persistent identifier such as a DOI or Handle for each precise dataset.
If we have missed a published policy which does in fact address this matter satisfactorily, please post a response linking to it. If you have any questions about this issue, please post them in a reply.
The GMD review and publication process depends on reviewers and community commentators being able to access, during the discussion phase, the code and data on which a manuscript depends, and on ensuring the provenance of replicability of the published papers for years after their publication. Please, therefore, publish your code and data in one of the appropriate repositories and reply to this comment with the relevant information (link and a permanent identifier for it (e.g. DOI)) as soon as possible. We cannot have manuscripts under discussion that do not comply with our policy.
The 'Code and Data Availability’ section must also be modified to cite the new repository locations, and corresponding references added to the bibliography.
I understand that the work that this involves in the case of your manuscript could be substantial. However, I must note that if you do not reply to this comment and perform reasonable efforts to bring your manuscript in compliance with the policy of the journal fixing the above mentioned issues, we cannot continue with the peer-review process or accept your manuscript for publication in GMD.
Juan A. Añel
Geosci. Model Dev. Executive Editor