the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
GraphIDW: Incorporating spatial autocorrelation in satellite–gauge precipitation merging using graph neural networks over a tropical region
Abstract. Ground-based rain gauges remain the benchmark for accurate precipitation measurement; however, their sparse spatial distribution limits the representation of rainfall heterogeneity. Satellite-based Precipitation Products (SPPs) provide consistent spatial coverage but are often affected by retrieval errors and regional biases, restricting their direct use in local-scale hydrological applications. To overcome these limitations, Precipitation Data Merging (PDM) techniques integrating gauge and satellite observations have gained prominence. This study introduces a novel Machine Learning (ML) framework, GraphIDW, which combines Graph Neural Networks (GNNs) with Inverse Distance Weighting (IDW) interpolation to explicitly incorporate spatial autocorrelation into the merging process, addressing a major limitation of traditional ML-based PDM approaches. The framework was evaluated across the Wet Zone of Sri Lanka from 2001 to 2015 using two state-of-the-art SPPs (IMERG and CHIRPS) together with ground observations. IMERG data (0.1°) were first downscaled to 0.05° using CHIRPS, after which the downscaled product was merged with gauge observations through GraphIDW. A total of 60 gauges (70 %) were used for training and 28 (30 %) for validation. Results show that GraphIDW outperforms conventional ML algorithms, including Random Forest, Artificial Neural Network, Support Vector Regression, and XGBoost. It achieved the highest probability of detection (0.97) and reduced root mean square error (RMSE) and mean absolute error (MAE) by 13 %–41 % and 9 %–36 %, respectively, compared with the original SPPs. The results demonstrate that explicitly accounting for spatial dependence through graph-based learning significantly improves precipitation estimation, particularly in regions characterized by strong spatial heterogeneity. By embedding spatial autocorrelation directly into the merging process, GraphIDW provides a robust and computationally efficient framework for generating high-resolution rainfall datasets that are better suited for hydrological analysis in complex climatic and topographic settings.
- Preprint
(2892 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 19 Apr 2026)
- RC1: 'Comment on egusphere-2025-6551', Anonymous Referee #1, 11 Mar 2026 reply
-
RC2: 'Comment on egusphere-2025-6551', Anonymous Referee #2, 15 Apr 2026
reply
This paper compares a few different machine learning (ML) approaches to merging available station information with satellite precipitation estimates to better capture the spatial and temporal patterns of rainfall over a region of Sri Lanka. Overall, I think the paper is well-written and the study is well-done. The findings support that blending the stations using any of the ML-based approaches improves the rainfall estimates when compared to stations that were withheld from the blending routine, and that GraphIDW slightly outperforms the other techniques for this region. With that in mind, I'm suggesting minor revisions to the manuscript, which I think will improve the overall value of the paper.
One thing I think is important when employing machine learning techniques is to justify why these techniques are needed when studying/addressing the issue identified here. I think the paragraph starting at line 51 does a nice job of addressing some of these improvements and different techniques that have been imposed. My guess is that with the relatively dense station network available for this paper’s study region, an intelligent IDW approach might be all that is required to achieve very similar results as the ML techniques used in the paper. Some of this seems to be born out later in the paper when it is shown that performance is about the same when using only 40% of the available stations compared to 70% that was used in the study.
My biggest criticism of the paper might be in the formulation of the “IMERG-daily,0.05” product put forth in equation 1. If I’m understanding this equation correctly, the fraction on the right side of the equation results in each day’s proportion of the monthly total, calculated independently for every 0.1-degree pixel. This daily proportion is then multiplied by the monthly total at the corresponding CHIRPS 0.05-degree pixel. This results in more of a CHIRPS-daily product than an IMERG-daily product, because the sum of the daily values will result in the CHIRPS monthly total. If there are big differences between IMERG and CHIRPS, imagine a monthly IMERG total of 50mm while CHIRPS is 250mm, the result of equation 1 is going to result in a data product that is going to be more like the 250mm rainfall total.
As I was thinking about a better formulation of equation 1, I couldn’t figure out how best to use the higher spatial resolution of CHIRPS combined with the better temporal resolution of IMERG. The thing to do may be to look at the fraction of each 0.05-degree CHIRPS monthly pixel relative to a 0.1-degree version of CHIRPS, and use that proportion to downscale the IMERG monthly totals to the 0.05-degree value, and then use that in the equation 1. While I’m not sure the paper needs to be changed, please address this in the response to the reviewer.
The other downside of the current formulation is that CHIRPS has station data already included in the product, especially before 2007 (if using CHIRPS version 3) or so. As a result you may be normalizing your rainfall estimate to a product that has station data included, and then using that as the “baseline” product in your analysis. Another alternative would be to use CHIRP (no “S”) which has the same spatial resolution and a very similar mean, but is made only using satellite data and without the addition of stations. However, none of this changes the subsequent analysis or methods, but it might mean that your “IMERG” baseline would have some different values in the skill assessments.
I really like the residual interpolation step that is included in GraphIDW. I wonder how much improved the satellite-based estimate would be with the residual IDW added at the end? Similarly, how would the other ML techniques fare with that extra “corrective” step? Given the slim differences in skill metrics across the different ML techniques, I could imagine the residual IDW improving some of the other approaches such that their skill metrics were better than GraphIDW. I would suggest exploring this, and potentially including it in the results.
Conversely, I think you could look at the estimate from just using the GNN technique, without the IDW, to look at the improvement over the IMERG alone. Then, compare that with the final GraphIDW output, and it would give you an estimate of the improvement of each of these components to the overall estimate. If you wanted, that GNN-only estimate could be compared to the other ML techniques to get an idea for how the IDW might help other estimates as well.
This study is using a relatively large number of stations compared to some regions that are more reliant on satellite rainfall estimates. That is very useful for producing such an accurate result, but it may be that the results are different in a significantly more data sparse region. You touch on this a little bit in the discussion, and I think it would be interesting to see how few stations are needed before there is a notable drop in estimate performance.
If you are looking to cut any of the material to make the paper more concise, I might suggest section 4.4. The evaluation included here is brief and the graphics are not particularly compelling. Certainly it is important to show that the technique can capture extreme events, but maybe a little more explanation of the value would be beneficial, because as it is there isn’t quite enough to fully appreciate what is being shown in the maps/barplots.
Overall, I thought the paper is very well-written. The description of the GraphIDW technique, other ML techniques, and the evaluation tests are very clear and easy to follow. I think the results of the different evaluation tests were clearly presented, and the graphics were helpful in displaying the relative merits of each of the techniques. I do think revisiting the formulation of the satellite estimate, and showing the value of the IDW step will dramatically improve the value of the paper and the persuasiveness of the GraphIDW technique.
Citation: https://doi.org/10.5194/egusphere-2025-6551-RC2
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 271 | 113 | 19 | 403 | 42 | 42 |
- HTML: 271
- PDF: 113
- XML: 19
- Total: 403
- BibTeX: 42
- EndNote: 42
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
This manuscript presents a novel study applying graph-based machine learning methods to precipitation estimation. The use of Graph Neural Networks (GNNs) is becoming increasingly popular in the Earth sciences, particularly for problems involving non-Euclidean data structures. In this regard, the study addresses an important topic and has the potential to contribute to the growing research exploring graph-based approaches in the spatial mapping of precipitation.
Overall, the paper is well structured and generally easy to follow, with a clear presentation of the study objectives and methodology. However, several major issues related to the methodology, evaluation framework, and clarity of some sections should be addressed before the manuscript can be considered for further review. These comments are outlined in the section below. Additional minor comments, including grammar, typographical corrections, and reference-related issues (like missing references), will be provided in a subsequent review round after the major concerns have been addressed.
Minor Comments
Major Comments