the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
A neural-network based forward operator for the assimilation of microwave satellite observations with LDAS-Monde
Abstract. The knowledge of land-surface variables (LSVs) is essential for an accurate description of the carbon cycle and, hence, for deriving the fraction of anthropogenic CO2 emissions contributing to global warming. For a better representation of LSVs like the leaf-area index (LAI), we assimilate the observations of the satellite microwave sensors SMAP and AMSR2 with our land-data assimilation system (LDAS). Dedicated observation operators using a neural network (NN) are developed to enable a direct assimilation of the measured brightness temperatures. Direct assimilation is not yet an established method due to the incomplete representation of physical emission processes and the associated computational constraints of physical observation operators. We derive an optimal set of predictors for both instruments resulting in a good match between model equivalent and the observations in the testing period, with correlations up to 0.87/0.93 and a total RMSE of 8.2/3.6 K for SMAP/AMSR2, respectively. The implementation of the derived weights into the LDAS is straightforward and is found to lead to a reasonably good performance of the assimilation system, with a stronger improvement of the departures for SMAP than for AMSR2, which can be attributed to the characteristics of the microwave bands of the observations. For the best assimilation experiments, the verification against LAI observations shows an improvement compared to the open loop on a global scale. For different global cropland regions especially prone to droughts, AMSR2 outperforms SMAP in most cases. Nevertheless, the seasonal and subseasonal variability is still not well represented though due to unsolved issues in our model.
- Preprint
(6989 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
- RC1: 'Comment on egusphere-2026-838', Anonymous Referee #1, 01 Apr 2026
-
RC2: 'Comment on egusphere-2026-838', Anonymous Referee #2, 10 Apr 2026
This manuscript presents a neural-network-based observation operator to enable the direct assimilation of passive microwave brightness temperatures from SMAP (L-band) and AMSR2 (X-band) within LDAS-Monde, with the aim of improving soil moisture and leaf area index (LAI) estimates. The approach links model states and auxiliary predictors to observed TB, and evaluates the impact of assimilating each sensor (separately) over cropland-dominated regions at the global scale. Results show reductions in TB departures and some improvements in LAI—more consistent at the global scale than regionally—while the impact on soil moisture and the physical consistency of the framework are less clear. In addition, I found it difficult to reconcile the relatively weak sensitivity of the NN to LAI (compared to SWI) with the magnitude of the resulting LAI increments, which raises further questions about how information is being propagated through the system.
Main concerns (summary):
I have some reservations about the physical consistency of the observation operator as currently presented. The use of observed LAI, along with static predictors (e.g. latitude/longitude) and even forcing variables within the NN, makes it less clear that this is a clean mapping from model state to observation. In particular, it raises questions about how the operator remains dynamically consistent with the model state, and what exactly the NN is learning (physical relationships vs. geographical or climatological patterns). Clarifying the role of these predictors, and why they are required, would strengthen confidence in the approach.
More generally, I found parts of the methodology difficult to follow, with some key elements either not clearly described or introduced relatively late. In particular, it is not always clear whether observed or modeled LAI is used within the operator, how anomalies are defined, what exactly is included in the control vector, and how the assimilation timing is implemented. These are all important for interpretation and reproducibility, and would benefit from a clearer and more explicit description earlier in the manuscript.
Finally, I am not fully convinced by the experimental design and its interpretation. The rationale for assimilating the different sensors separately is not entirely clear, and it would be useful to understand whether their information content is complementary when used together. Similarly, the role of static and forcing predictors, and the apparent mismatch between NN sensitivities and the resulting analysis increments, are not fully explained. Addressing these points would help clarify how the system is behaving and how robust the conclusions are.
Overall, I find the topic timely and the approach promising, and I appreciate the effort involved in developing and implementing the NN-based operator within LDAS-Monde. However, the concerns outlined above—particularly regarding the physical consistency of the observation operator, the clarity of the methodological description, and aspects of the experimental design—make it difficult to fully interpret and trust the results as currently presented. Addressing these points would significantly strengthen the manuscript.
Major comments:
L-1 "A neural-network based forward operator for the assimilation of microwave satellite observations with LDAS-Monde" The title is somewhat generic and does not fully reflect the scope of the study. In particular, the focus on LAI and soil moisture is not evident, despite being central to the results. It would be helpful to revise the title to better highlight the scientific objectives and outcomes of the work, rather than focusing primarily on the methodological aspect.
L1 Abstract. The abstract would benefit from a more balanced description of the results and limitations. In particular, the improvements appear more consistent for LAI than for soil moisture, and vary depending on spatial scale, which is not clearly reflected. It would also be helpful to more explicitly acknowledge key aspects of the experimental setup (e.g. cropland focus), as these define the scope of the conclusions.
L23 “observing in near-real time and providing gridded data” Not sure what “gridded” adds here — this could be clarified or removed.
L53 “assimilating land-surface variables using SMAP and AMSR2 observations” maybe “assimilating land-surface variables using both SMAP and AMSR2 observations”?
L67 “interpolated linearly onto the model grid” This seems a little basic — should this not be part of the observation operator (ie its model states that are interpolated, not observations), rather than just preprocessing? Could be clarified.
L67 What is the resolution of the model grid?
L95 “converted into anomalies by subtracting the mean from the actual value for each data set and scaling with the standard deviation.” Mean over what exactly (grid cell? time period?) — this should be specified.
L106 “The LAI is computed once per day.” So LAI is a fully prognostic variable?
L106 “interaction between CO2 and photosynthesis with regard to stomatal respiration and plant growth” “stomatal respiration” seems like an unusual description — could this be clarified?
L122 “control variable xj (here LAI and soil moistures of layers 2-7, i.e., 0.01- 1 m depth)” It would help to state more explicitly which model variables are included in the control vector, and why they were chosen. Should surface/soil temperature also be included?
L133 “croplands (Corchia et al., 2023; Shan et al., 2024), we use exclusively grid cells that are dominated by crops.” The focus on croplands should be introduced earlier, as it has important implications for the interpretation of the results.
L160 and Table 1 “variable model fields as predictors” It would be helpful to clarify how this list of predictors was selected.
L172 “In addition to the predictors that have already been employed in similar studies, we achieved a significant improvement of the model equivalent when using forcing data.” Clarify exactly what you mean by "forcing data" here.
L175 I’m not sure about having latitude and longitude as static predictors. What do they really represent?
L185 “strong benefit seen from adding the forcing variables” I assume this refers to meteorological forcing used to drive the model? These seem unusual to include in an observation operator, as they break the mapping from model state to observation, and would benefit from further justification.
L213 “observation operator described in Sect. 4” The observation operator is not described clearly enough here — in particular, it is not clear whether observed or modeled LAI is used. It seems the NN is trained with observed LAI?
L218 “LDAS-Monde is set up to work on a 24 h assimilation window and the observations are assimilated at a given hour. To account for the diurnal variability of the observations, we ran four separate assimilation cycles using 6-hour time bins around each of 0300UTC, 0900UTC, 1500UTC, and 2100UTC, which correspond to the model output times in our setup.” Don’t really understand what you have done here – clarify.
L263 Figure 8 is difficult to read in its current form — in particular, SWI2 is indecipherable.
L315 “static predictors are essential for our data sets” This would benefit from further elaboration — in particular, why a coordinate-independent NN is not sufficient in this case. Why is a coordinate-independent NN seemingly not possible with this approach?
Minor Comments:
The manuscript would benefit from some minor improvements in terminology consistency (e.g. “forward operator” vs. “observation operator”, TB vs brightness temperature), clarification of certain terms (e.g. “model equivalent”), and overall readability (sentence length, figure referencing)
Citation: https://doi.org/10.5194/egusphere-2026-838-RC2
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 79 | 49 | 17 | 145 | 10 | 19 |
- HTML: 79
- PDF: 49
- XML: 17
- Total: 145
- BibTeX: 10
- EndNote: 19
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
General comments:
This study proposes observation operators to enable the assimilation of SMAP 1.4 GHz vertically polarized signals and AMSR2 10.65 GHz vertically polarized signals, with the objective of improving LAI and soil moisture retrievals within LDAS-Monde. The analysis focuses on cropland areas at the global scale. To facilitate the direct assimilation of passive microwave brightness temperatures (TB), the authors develop an observation operator based on neural networks, linking geophysical and static predictors to the observed TB.
The implementation of these forward operators within LDAS demonstrates improvements in the departures, particularly for SMAP observations, likely due to the higher variability of TB at L-band compared to X-band. Regarding LAI, the results indicate a positive impact at the global scale. However, at regional scales, the results appear more mixed and depend on the evaluation metric, with improvements mainly observed in terms of bias.
I fully agree with the authors on the importance of such studies aiming at the direct assimilation of observations, and I support the general methodology presented here. I also recognize the amount of work from designing the observation operator to experiments within an assimilation system.
However, I have two major concerns: 1) I find it limiting that only vertical polarization is exploited, while horizontal polarization is also available and contains valuable information. Although I understand the challenges associated with developing observation operators for H-polarization, it would have been beneficial to include results for this polarization, even if their performance is inferior to that of V-polarization. Presenting such results, at least for observation operator, would provide a more complete picture. Scientific progress also relies on documenting difficulties and limitations, which can guide future developments toward assimilating a broader range of observations; 2) The manuscript is sometimes difficult to follow. Certain information appears to be missing or is introduced only later in the text, which affects the overall clarity and understanding especially for reader which is not exactly in the field. More detailed comments are provided in the specific remarks below. I tried to separate them between major and minor even though some might in the middle.
Specific comments:
Major
Additionally, the readability of the figure could be improved. The ordering of the experiment numbers seems shuffled, making it even more difficult to relate each case to its corresponding input parameters. As a minor suggestion, inverting the color scale (e.g., blue for lower values) could improve interpretability.
Minor
Technical corrections:
References:
[1] Choudhury, B. J., & Tucker, C. J. (1987). Monitoring global vegetation using Nimbus-7 37 GHz Data Some empirical relations. International Journal of Remote Sensing, 8(7), 1085-1090.
[2] Becker, F., & Choudhury, B. J. (1988). Relative sensitivity of normalized difference vegetation index (NDVI) and microwave polarization difference index (MPDI) for vegetation and desertification monitoring. Remote sensing of environment, 24(2), 297-311.
[3] Owe, M., de Jeu, R., & Walker, J. (2002). A methodology for surface soil moisture and vegetation optical depth retrieval using the microwave polarization difference index. IEEE Transactions on Geoscience and Remote Sensing, 39(8), 1643-1654.
[4] Jones, M. O., Jones, L. A., Kimball, J. S., & McDonald, K. C. (2011). Satellite passive microwave remote sensing for monitoring global land surface phenology. Remote Sensing of Environment, 115(4), 1102-1114.
[5] Munchak, S. J., Ringerud, S., Brucker, L., You, Y., De Gelis, I., & Prigent, C. (2020). An active–passive microwave land surface database from GPM. IEEE Transactions on Geoscience and Remote Sensing, 58(9), 6224-6242.
[6] Prigent, C., Jimenez, C., Santoro, M., Cartus, O., & Favrichon, S. (2026). Assessing the combination of passive and active microwave satellite observations (1.4 to 36 GHz) to estimate above ground biomass (AGB) globally. Science of Remote Sensing, 100386.