the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
The Process and Value of Reprogramming a Legacy Global Hydrological Model
Abstract. Global hydrological models (GHMs) improve our understanding of water flows and storage on the continents and have undergone significant advancements in process representation over the past four decades. However, as research questions and GHMs become increasingly complex, maintaining and enhancing existing model codes efficiently has become challenging. Issues such as non-modular design, inconsistent variable naming, insufficient documentation, lack of automated software testing suites and containerization hinder the sustainability of GHM research software as well as the reproducibility of study results obtained with the help of GHMs. Although some GHMs have been reprogrammed to address these challenges, publications focus on evaluating model performance and do not describe the reprogramming process. To support the reprogramming of large geoscientific research software, we present in detail how the GHM WaterGAP was reprogrammed into sustainable research software. Following an agile project management approach, the software was rewritten from scratch in Python with a modular Model-View-Controller architecture, including development practices such as open-source licensing, version control, unit testing, linting, containerization, consistent and meaningful variable naming, and comprehensive in-code and external documentation. Due to the switch from C/C++ in the legacy code to Python, execution time doubled. Our evaluation of the reprogrammed WaterGAP code against software sustainability criteria and FAIR4RS principles indicates that the reprogramming substantially improved the software usability, maintainability, and extensibility, making the reprogrammed WaterGAP software much more sustainable than its predecessor. The new WaterGAP software can be easily understood, applied and enhanced by novice and experienced modelers and is suited for collaborative code development across diverse teams and locations, fostering the establishment of a community GHM.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(1313 KB)
-
Supplement
(4833 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(1313 KB) - Metadata XML
-
Supplement
(4833 KB) - BibTeX
- EndNote
- Final revised paper
Journal article(s) based on this preprint
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2025-1096', Rolf Hut, 13 May 2025
See the attached document for my thoughts.
-
AC1: 'Reply on RC1', Emmanuel Nyenah, 08 Jul 2025
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1096/egusphere-2025-1096-AC1-supplement.pdf
-
AC1: 'Reply on RC1', Emmanuel Nyenah, 08 Jul 2025
-
RC2: 'Comment on egusphere-2025-1096', Anonymous Referee #2, 28 May 2025
This paper presents the process and evaluation of re-programming of the legacy WaterGAP model into the new Python platform. While this study is of interest to the hydrological modeling community, the presentation of this manuscript needs to be substantively revised to highlight the scientific merit of the current study. In its current structure, I am afraid the manuscript does not meet the standard of scientific publication and cannot be accepted.
Specific comments
- My major concern of this manuscript is its structure and organization. The manuscript contains sections with too many (10) main sections and levels of subsections. In addition, some subsections contain overlapping information (across different main sections). This tedious structure makes the general reading of the manuscript very difficult. In addition, Section 5.1 and 5.2 contain subsections with identical titles (Architecture and new features), which should be avoided or simply merged together to reduce the level of subsections. In particular, Section 5.2.1 contains a single sentence with a reference to the Supplement and a GitHub link, which is completely unnecessary to make a separate subsection.
- In general, the authors should either avoid using abbreviations in section titles, or spell out their full names. Examples including FAIR4RS (Section 3.1.3 and 6 title), WGHM (5.1), GWSWUSE (5.2).
- Section 3.1.1 “Indicators for best practices in software engineering”: this section seemingly contains information that overlaps with Section 4.3.1 “Best practices for code development”, such as document and version control and automation. Please consider re-structure these parts to make the presentation more precise by, e.g. merging redundant information.
- Figure 2: the right panel of the figure contains major user stories that are purely text. Such information can be better presented using a table or in-text description rather than a graphic presentation (figure).
- Section 4.3 title: get rid of the period after “Code development”.
- Section 4.3.2: narrative in this section contains many hyperlinks to GitHub repositories, which impedes the flow of the text and makes reading difficult. Authors may consider documenting these links in supplementary information (SI) and making reference to SI.
- Table 2: A graphic presentation (e.g. bar plots) is recommended to better show the difference between the outputs of the reprogrammed and legacy software.
- Sections 9 does not contain much in-depth discussion, but rather recap of what has been presented in previous sections. Consider either merging this into concluding remarks (Section 10).
Citation: https://doi.org/10.5194/egusphere-2025-1096-RC2 -
AC2: 'Reply on RC2', Emmanuel Nyenah, 08 Jul 2025
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1096/egusphere-2025-1096-AC2-supplement.pdf
-
RC3: 'Comment on egusphere-2025-1096', Anonymous Referee #3, 11 Jun 2025
Reviewing of the manuscript ‘The Process and Value of Reprogramming a Legacy Global Hydrological Model’ by Emmanuel Nyenah et al. (2025) submitted to Geoscientific Model Development (GMD).
This work focuses on the model development and updates to a legacy global hydrological model, after reprogramming (e.g., different structures and languages, from C/C++ to python), aiming to make this software more sustainable and accessible to model users. The reviewer considers that the authors have done sufficient efforts to develop the numerical model, while it is struggle to me is that whether this kind of article type or writing style is acceptable to the journal GMD. Several years ago, a popular storm surge modelling community suggests how to consider the modelling work aside from publication. While it is now in consistence that modeling work is as important as publications, my concern is that whether this style could be published in a scientific journal.
For a journal, the novelty is very important, and previous limitations should also be pointed out and summarized. Regarding this manuscript, one major concern to me is the structure, which includes overly 10 parts, and should be reduced significantly. This leads to a presentation of the work, instead of proposing an assumption, and solving it in a scientific way. Secondly, it should be clear what is the advantages of this new model, e.g., computational efficiency or stability? The reviewer considers that the model updates should first consider the accuracy, e.g., whether the new model improves the model simulation results. So, several case studies or model applications should be included, and model-to-data comparison of hydrological variables should be conducted and listed.
Here are additional detailed comments and suggestions:
- L124-125: please provide reasons why this work only concerns the reprogramming of GWSWUSE and WGHM models operating at the 30 arc-minute resolution. Why do try it based on 5-arc-minute resolution?
- Avoid using abbreviations in the subsection title, e.g., FAIR4RS, WGHM, GWSWUSE etc.
- Some subsections are too short, and some even start with the subsection title without any contents, e.g., 5.1, 5.2 etc.
- In Table 2, what do the values in the bracket mean?
- Figure 5, the percentage is used for the survey results, so the reviewer suggests including the sample size or the total sample number.
Citation: https://doi.org/10.5194/egusphere-2025-1096-RC3 -
AC3: 'Reply on RC3', Emmanuel Nyenah, 08 Jul 2025
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1096/egusphere-2025-1096-AC3-supplement.pdf
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2025-1096', Rolf Hut, 13 May 2025
See the attached document for my thoughts.
-
AC1: 'Reply on RC1', Emmanuel Nyenah, 08 Jul 2025
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1096/egusphere-2025-1096-AC1-supplement.pdf
-
AC1: 'Reply on RC1', Emmanuel Nyenah, 08 Jul 2025
-
RC2: 'Comment on egusphere-2025-1096', Anonymous Referee #2, 28 May 2025
This paper presents the process and evaluation of re-programming of the legacy WaterGAP model into the new Python platform. While this study is of interest to the hydrological modeling community, the presentation of this manuscript needs to be substantively revised to highlight the scientific merit of the current study. In its current structure, I am afraid the manuscript does not meet the standard of scientific publication and cannot be accepted.
Specific comments
- My major concern of this manuscript is its structure and organization. The manuscript contains sections with too many (10) main sections and levels of subsections. In addition, some subsections contain overlapping information (across different main sections). This tedious structure makes the general reading of the manuscript very difficult. In addition, Section 5.1 and 5.2 contain subsections with identical titles (Architecture and new features), which should be avoided or simply merged together to reduce the level of subsections. In particular, Section 5.2.1 contains a single sentence with a reference to the Supplement and a GitHub link, which is completely unnecessary to make a separate subsection.
- In general, the authors should either avoid using abbreviations in section titles, or spell out their full names. Examples including FAIR4RS (Section 3.1.3 and 6 title), WGHM (5.1), GWSWUSE (5.2).
- Section 3.1.1 “Indicators for best practices in software engineering”: this section seemingly contains information that overlaps with Section 4.3.1 “Best practices for code development”, such as document and version control and automation. Please consider re-structure these parts to make the presentation more precise by, e.g. merging redundant information.
- Figure 2: the right panel of the figure contains major user stories that are purely text. Such information can be better presented using a table or in-text description rather than a graphic presentation (figure).
- Section 4.3 title: get rid of the period after “Code development”.
- Section 4.3.2: narrative in this section contains many hyperlinks to GitHub repositories, which impedes the flow of the text and makes reading difficult. Authors may consider documenting these links in supplementary information (SI) and making reference to SI.
- Table 2: A graphic presentation (e.g. bar plots) is recommended to better show the difference between the outputs of the reprogrammed and legacy software.
- Sections 9 does not contain much in-depth discussion, but rather recap of what has been presented in previous sections. Consider either merging this into concluding remarks (Section 10).
Citation: https://doi.org/10.5194/egusphere-2025-1096-RC2 -
AC2: 'Reply on RC2', Emmanuel Nyenah, 08 Jul 2025
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1096/egusphere-2025-1096-AC2-supplement.pdf
-
RC3: 'Comment on egusphere-2025-1096', Anonymous Referee #3, 11 Jun 2025
Reviewing of the manuscript ‘The Process and Value of Reprogramming a Legacy Global Hydrological Model’ by Emmanuel Nyenah et al. (2025) submitted to Geoscientific Model Development (GMD).
This work focuses on the model development and updates to a legacy global hydrological model, after reprogramming (e.g., different structures and languages, from C/C++ to python), aiming to make this software more sustainable and accessible to model users. The reviewer considers that the authors have done sufficient efforts to develop the numerical model, while it is struggle to me is that whether this kind of article type or writing style is acceptable to the journal GMD. Several years ago, a popular storm surge modelling community suggests how to consider the modelling work aside from publication. While it is now in consistence that modeling work is as important as publications, my concern is that whether this style could be published in a scientific journal.
For a journal, the novelty is very important, and previous limitations should also be pointed out and summarized. Regarding this manuscript, one major concern to me is the structure, which includes overly 10 parts, and should be reduced significantly. This leads to a presentation of the work, instead of proposing an assumption, and solving it in a scientific way. Secondly, it should be clear what is the advantages of this new model, e.g., computational efficiency or stability? The reviewer considers that the model updates should first consider the accuracy, e.g., whether the new model improves the model simulation results. So, several case studies or model applications should be included, and model-to-data comparison of hydrological variables should be conducted and listed.
Here are additional detailed comments and suggestions:
- L124-125: please provide reasons why this work only concerns the reprogramming of GWSWUSE and WGHM models operating at the 30 arc-minute resolution. Why do try it based on 5-arc-minute resolution?
- Avoid using abbreviations in the subsection title, e.g., FAIR4RS, WGHM, GWSWUSE etc.
- Some subsections are too short, and some even start with the subsection title without any contents, e.g., 5.1, 5.2 etc.
- In Table 2, what do the values in the bracket mean?
- Figure 5, the percentage is used for the survey results, so the reviewer suggests including the sample size or the total sample number.
Citation: https://doi.org/10.5194/egusphere-2025-1096-RC3 -
AC3: 'Reply on RC3', Emmanuel Nyenah, 08 Jul 2025
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-1096/egusphere-2025-1096-AC3-supplement.pdf
Peer review completion










Journal article(s) based on this preprint
Data sets
March 7, 2025 (v1)DatasetOpen The Process and Value of Reprogramming a Legacy Global Hydrological Model Emmanuel Nyenah, Petra Döll, Martina Flörke, Leon Mühlenbruch, Lasse Nissen, and Robert Reinecke https://doi.org/10.5281/zenodo.14988257
Model code and software
Reprogrammed version of the WaterGAP V2.2e and Groundwater Surface Water Use (GWSWUSE) linking model. Emmanuel Nyenah, Petra Döll, Martina Flörke, Leon Mühlenbruch, Lasse Nissen, and Robert Reinecke https://doi.org/10.5281/zenodo.14988011
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
796 | 103 | 19 | 918 | 50 | 14 | 40 |
- HTML: 796
- PDF: 103
- XML: 19
- Total: 918
- Supplement: 50
- BibTeX: 14
- EndNote: 40
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Emmanuel Nyenah
Petra Döll
Martina Flörke
Leon Mühlenbruch
Lasse Nissen
Robert Reinecke
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(1313 KB) - Metadata XML
-
Supplement
(4833 KB) - BibTeX
- EndNote
- Final revised paper