the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Measurement Report: Quantifying the Trade-off Between Station Number and Spatial Layout in Sparse GNSS Networks for Calibrating All-Weather FY-4A Precipitable Water Vapor
Abstract. Integrating satellite-derived precipitable water vapor (PWV) provides data with high spatiotemporal resolution, which is crucial for monitoring and forecasting extreme weather. However, current fusion and calibration methods typically relies on dense GNSS networks, hindering application in data-sparse regions. It remains unclear whether improving calibration under sparse conditions depends more on increasing station numbers or optimizing their spatial placement. To address this, we developed a machine learning-based calibration framework for FY-4A all-weather PWV and conducted controlled experiments across China. Our key finding is that for a fixed station budget, a spatially random layout consistently outperforms clustered or geographically biased distributions, reducing RMSE by up to 27 %. While increasing station density improves spatial generalization, with RMSE at independent stations dropping from 3.24 mm to 2.28 mm and bias converging near zero, performance gains saturate beyond approximately 120–160 stations. Spatially, errors under sparse, non-uniform networks concentrate in regions with strong humidity gradients or complex terrain; a uniform layout distributes errors more evenly. Temporally, all calibrated models capture seasonal cycles, with residual errors peaking in summer due to convective activity. This study demonstrates that in sparse network design, maximizing spatial coverage uniformity is more critical than simply adding stations. We thus provide a transferable framework and a quantitative principle for generating reliable satellite PWV products where GNSS observations are limited.
- Preprint
(2360 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 19 Jun 2026)
- RC1: 'Comment on egusphere-2026-1149', Anonymous Referee #3, 19 May 2026 reply
-
RC2: 'Comment on egusphere-2026-1149', Anonymous Referee #4, 19 May 2026
reply
This paper addresses the key issue that current fusion and calibration methods for satellite-derived precipitable water vapor (PWV) products rely excessively on dense GNSS networks. Taking FY-4A all-weather PWV as the research object, a two-stage (reconstruction + calibration) machine learning framework based on random forest (RF) is constructed. The study quantifies the impact of the number and spatial layout of training stations on model accuracy, spatial generalization ability, and temporal stability across mainland China. An actionable conclusion is drawn: “given a fixed station budget, the importance of a spatially uniform random layout outweighs simply increasing the number of stations,” and a performance saturation threshold of approximately 120–160 stations is identified. Overall, the scientific problem is clearly defined and of practical value, the data foundation is solid, and the experimental design is rigorous. In summary, the research content is valuable, the methodology is generally reasonable, and the conclusions have certain general significance. However, further refinement is needed in conceptual rigor and writing quality.
- The geometric definitions of the three layouts in the spatial arrangement experiment (does “surrounding” refer to surrounding the validation station or the study area?) are unclear. Whether the number of stations is strictly identical across the three layouts is not specified. The reconstruction stage directly cites Wang et al. (2026) without providing necessary accuracy information, affecting the interpretability of the calibration stage results.
- There are errors in figure citation (e.g., line 253: “Figs. 2 and 3” should be “Figs. 3 and 4”), and some English expressions are not sufficiently accurate.
- In line 166, the authors state: “Detailed validation of the reconstruction has been reported in previous studies (Wang et al. 2026) and is not repeated here.” However, reconstruction accuracy directly affects the input quality of the calibration stage and consequently all downstream conclusions. It is recommended to at least report the overall RMSE/Bias/R² of the reconstructed FY-4A PWV relative to GNSS PWV in a table.
- The current table mixes hyperparameters for different station counts (30–244) with three spatial layouts in two columns, resulting in poor readability. It is recommended to split this into two separate tables or adopt a clearer multi-column structure.
- Line 192 mentions using Bayesian optimization with 5 iterations for hyperparameter search, but the search space (e.g., range of tree numbers, maximum depth, minimum samples per leaf node) is not specified. Five iterations are generally insufficient for Bayesian optimization. Please justify this choice or extend the iteration count and report hyperparameter convergence.
- Figure 4 uses bar charts to display Bias and RMSE for 16 independent validation stations and 6 models, resulting in extremely high information density. Moreover, the negative bias bars of the raw FY-4A data obscure the details of the calibrated models. Suggestions: (i) Plot the raw FY-4A results separately or use a logarithmic scale; (ii) Add a cross-reference between station numbers and geographic locations (can be annotated in Fig. 1b); (iii) Supplement the standard deviation across the 16 stations for each model to characterize spatial consistency.
- Table 2 shows that RF_120 achieves better Bias (0.02 mm) and RMSE (2.36 mm) than RF_244 (-0.14 mm, 2.37 mm). This phenomenon contradicts the intuition that “more training samples lead to better performance.” A discussion of this phenomenon is recommended at the end of Section 4.1.
- The future outlook in lines 436–438 is too general. It is recommended to add specific directions, such as: (i) transfer learning validation in truly sparse regions (e.g., Tibetan Plateau, Sahara, ocean islands); (ii) comparison with satellite–radiosonde data assimilation methods.
- In the abstract, “hindering application” should be “hindering its application”; “relies” should be “rely.”
- The manuscript mixes terms such as “training station number,” “training sample size,” and “station density.” It is recommended to unify the terminology.
- The titles of Figures 5, 6, and 7 are too brief and lack key information. It is recommended to clearly state in each title what each subfigure (a, b, c, etc.) corresponds to, so that the figures can be understood independently.
- In Table 1, expressions such as “200/Surrounding” do not conform to table formatting standards. It is recommended to rename the “Number of Station” column to “Configuration” and change “200/Surrounding” to a more standard format like “200 (Surrounding).”
Citation: https://doi.org/10.5194/egusphere-2026-1149-RC2 -
RC3: 'Comment on egusphere-2026-1149', Anonymous Referee #2, 24 May 2026
reply
This manuscript focuses on the calibration of FY-4A all-weather precipitable water vapor (PWV) products under sparse GNSS network conditions, and systematically analyzes the effects of training station number and spatial layout on model accuracy, spatial generalization capability, and temporal stability. The research topic is of practical significance, and the experimental design is relatively clear. The results indicate that, under limited GNSS station availability, spatial representativeness and coverage uniformity play an important role in improving model generalization. Overall, the manuscript is well structured, the data sources and validation strategy are reasonable, and the conclusions have potential application value. It is recommended that the manuscript be accepted after minor revisions concerning methodological details, interpretation of results, figure presentation, and language/formatting issues.
Comment 1: Further highlight the core innovation of the manuscript
The authors are encouraged to further highlight the core innovation of the manuscript. The current manuscript provides a relatively complete description of the research background and experimental results, but the main contributions of the study are somewhat scattered. It is suggested that the authors add a more focused summary of the contributions at the end of the Introduction. Specifically, the authors should clearly state that this study is not only intended to construct a GNSS-constrained FY-4A PWV calibration model, but more importantly, to quantitatively evaluate, through controlled experiments, the effects of GNSS training station number and spatial layout on model accuracy, spatial generalization capability, and temporal stability. The proposed station configuration principle, namely prioritizing spatial representativeness and coverage uniformity under limited station availability, should also be emphasized.
Comment 2: Clarify the resolution harmonization and scale consistency of auxiliary variables
In Section 2.2, some variables were subjected to resolution harmonization and scale transformation. The authors are advised to provide further details in the data processing or methodology section regarding how variables such as DEM and NDVI were spatially matched to the FY-4A PWV grid. Adding these explanations would help improve the transparency of the data processing workflow and enhance the rationality and reproducibility of the model input design.
Comment 3: Provide more details on the construction of the three spatial layouts
The authors are encouraged to provide more details on how the three station spatial layouts were constructed. Section 3.3 mentions three station distribution patterns: random, surrounding, and clustered. However, it is still unclear how these layouts were generated. For example, it should be clarified whether the random layout was generated through repeated random sampling and whether the averaged results were reported, how the spatial clustering range of the clustered layout was determined, and whether the same number of training stations was strictly maintained among the three layouts. The authors are advised to supplement these methodological details and explain how the corresponding validation stations were determined, in order to enhance the reproducibility of the experiments and the reliability of the conclusions.
Comment 4: Improve the presentation of figures, equations, and language
The overall structure of the manuscript is clear, and the figures, tables, and equations generally support the methodological description and experimental analysis. However, there are still some minor issues related to figure presentation, equation formatting, and formatting consistency. The authors are advised to further check and revise these aspects to improve the readability, standardization, and overall consistency of the manuscript.
(1) Some figures contain a large amount of information, especially multi-panel result figures. It is suggested that the authors further standardize the size and layout of the figures. For example, the aspect ratios of the subplots in Figures 6 and 7 could be appropriately adjusted to improve the standardization and consistency of the manuscript.
(2) In some figures, the text overlaps with graphical elements. The authors are advised to further check and standardize the font size, legends, color bars, axis labels, and subplot numbering, so that comparisons among different station numbers, model results, and spatial layouts can be presented more clearly.
(3) The authors should carefully check whether the equation formatting and variable descriptions are complete. For example, the explanation of the variables in Equation (4) should be further supplemented, and the symbol formatting in Equations (7)–(10) should be kept consistent. This would help avoid ambiguity when readers interpret the evaluation metrics.
Citation: https://doi.org/10.5194/egusphere-2026-1149-RC3
Data sets
Calibration model for FY-4A PWV based on different GNSS station network Y. Ma https://doi.org/10.5281/zenodo.18751647
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 272 | 22 | 15 | 309 | 18 | 22 |
- HTML: 272
- PDF: 22
- XML: 15
- Total: 309
- BibTeX: 18
- EndNote: 22
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Overall Assessment
This paper investigates the GNSS calibration of the FY-4A all-weather (or all-sky) precipitable water vapor (PWV) product. It systematically conducts controlled experiments focusing on two variables—“number of training sites” and “spatial layout”—within mainland China, based on a random forest framework. The paper’s core contribution—explicitly proposing the quantitative principle of “layout over density” under sparse GNSS network conditions and indicating that performance tends to saturate when the number of sites reaches approximately 120–160 stations—demonstrates strong practical significance for engineering applications and methodological value. It provides useful insights for calibrating satellite-based PWV products in data-sparse regions (such as the Tibetan Plateau, Africa, and oceanic islands). The paper has a complete structure, logically clear experimental design, and its conclusions are generally supported by the figures and tables. The cited literature covers the main advances in MODIS/FY series PWV calibration over the past five years. However, the manuscript requires important revisions in the following areas before it reaches a publishable standard.
Specific Comments