All-Sky Temperature and Humidity Retrieval from the MWRI-RM Onboard the FY-3G Satellite

Liu, Minghua; Han, Wei; Yang, Yunfan; Sun, Haofei; Yin, Ruoying

doi:10.5194/egusphere-2025-680

Preprints

https://doi.org/10.5194/egusphere-2025-680

Preprints

17 Mar 2025

| 17 Mar 2025

All-Sky Temperature and Humidity Retrieval from the MWRI-RM Onboard the FY-3G Satellite

Minghua Liu, Wei Han, Yunfan Yang, Haofei Sun, and Ruoying Yin

Abstract. To investigate the application of deep learning in satellite remote sensing, this study employs brightness temperature observations from the remapped Micro-Wave Radiation Imager-Rainfall Mission (MWRI-RM) onboard the Fengyun-3G (FY-3G) satellite as input data, while temperature and humidity profiles (ranging from 1000 hPa to 100 hPa) obtained from ERA5 reanalysis data are used as label data. An Advanced Residual Convolutional Neural Network (AR-CNN) model was developed to retrieve atmospheric temperature and humidity profile data. The results show that: (1) The retrieval of temperature profiles has a root mean square error (RMSE) of approximately 1.24 K, and the RMSE for humidity profile retrieval is 12.98 %. (2) A comparison between predicted and labeled samples reveals consistent results for temperature retrieval but inconsistencies in high-humidity regions, indicating that further refinement of the model is needed in these areas. (3) Gradient backpropagation and perturbation experiments demonstrate that channels near 118 GHz are critical for retrieving upper-level temperatures, and those near 183 GHz mainly affect mid-to-lower atmospheric temperature retrieval. For humidity, channels near 183 GHz are essential for detecting mid-to-lower water vapor, and the 118 GHz oxygen absorption channel is indispensable for upper-level humidity retrieval. This suggests that the model possesses a certain degree of interpretability and stability.

Received: 15 Feb 2025 – Discussion started: 17 Mar 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 8585 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (8585 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

25 Mar 2026

All-sky temperature and humidity retrieval from the MWRI-RM onboard the FY-3G satellite

Minghua Liu, Wei Han, Yunfan Yang, Haofei Sun, and Ruoying Yin

Atmos. Meas. Tech., 19, 2061–2077, https://doi.org/10.5194/amt-19-2061-2026,https://doi.org/10.5194/amt-19-2061-2026, 2026

Short summary

Minghua Liu, Wei Han, Yunfan Yang, Haofei Sun, and Ruoying Yin

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2025-680', Anonymous Referee #1, 10 Apr 2025
The paper used machine learning and microwave channels to retrieve temperature and humidity profiles. However, the paper did not provide enough evidence in the analysis to support the arguments. There is nothing new in the paper in retrieving temperature and humidity profiles using machine learning. The paper should be thoroughly edited, and the authors should provide more evidence.
The arrangement of the paragraph is chaotic. It seems put a bunch of unedited paragraphs together.

Not enough paragraph to support of using AR-CNN architecture. What if using other machine learning architecture? The authors did not show the performance in the retrieval algorithm validation.

A lot of the arguments mentioned in the results are based on the speculation. No other future examination. No discussion to the appendix figures.

What is the purpose of making an ERA5 temperature and humidity estimator? At best, the model is to get ERA5 temperature and humidity profiles without improving the biases and uncertainties inherent in ERA5.
Citation: https://doi.org/10.5194/egusphere-2025-680-RC1
- AC1:
  'Reply on RC1', Wei Han, 23 Apr 2025
  Thank you for your insightful feedback. We appreciate your concerns and address them as follows:
  Novelty and Evidence for AR-CNN
  
  Our study focuses on evaluating the contributions of the 118 GHz and 183 GHz microwave channels to temperature and humidity profile retrievals, leveraging the enhanced capabilities of the FY-3G/MWRI-RM. While machine learning (ML) has been applied to atmospheric retrievals, the novelty lies in the Advanced Residual CNN (AR-CNN) architecture, which integrates residual blocks and adaptive pooling to better capture spatial features from 26-channel microwave observations. To validate AR-CNN’s superiority, we compared it with MLP and standard CNN models (not explicitly tabulated in the original manuscript). The AR-CNN achieved a lower RMSE (1.24 K for temperature, 12.98% for humidity) compared to MLP (1.52 K, 15.3%) and CNN (1.38 K, 14.1%), demonstrating its effectiveness in handling high-dimensional, nonlinear satellite data.
  
  Structure and Methodology Clarification
  
  Regarding the arrangement of the paragraph, we apologize for any confusion caused by the unedited paragraphs. We will thoroughly edit the manuscript to ensure a clear and logical structure. We will also add more paragraphs to support the use of the AR-CNN architecture and provide a detailed discussion of the results.
  
  Physical Interpretability and Appendix Figures
  
  The results (Sections 4.1–4.2) are grounded in gradient backpropagation and perturbation experiments, which quantitatively link 118 GHz and 183 GHz channels to their respective atmospheric layers (e.g., 118 GHz for upper-level retrievals, 183 GHz for mid-to-lower layers). These align with Jacobian analyses (Figs. 5, 9) and confirm the model’s interpretability. The appendix figures (e.g., global maps in Fig. A1–A2) visually validate spatial consistency between retrievals and ERA5, particularly in regions with high humidity gradients or dynamic processes. Future revisions will explicitly reference these figures in the discussion.
  
  Purpose of ERA5-Based Retrieval
  
  The goal is not to replicate ERA5 but to explore satellite-driven retrievals with higher spatiotemporal resolution. ERA5 serves as a reanalysis benchmark due to its global coverage and assimilation of multi-source data. While ERA5 uncertainties exist, our model reduces systematic biases and provides independent retrievals for regions with sparse radiosonde data.
  
  Citation: https://doi.org/10.5194/egusphere-2025-680-AC1
RC2:
'Comment on egusphere-2025-680', Anonymous Referee #2, 29 Apr 2025

This paper presents a algorithm for use on the microwave imager instrument aboard FY-3G to retrieve profiles of temperature and humidity. The algorithm is a convolutional neural network (CNN) that was trained with ERA5 data, and because it is in the microwave part of the spectrum it can retrieve all sky conditions. Neural network retrievals are actively being improved in the literature, especially multidimensional algorithms such as convolutional neural networks, so this paper in its ideas is suitable for publication in AMT.
Broader feedback:
From what I can discern, the validation here is strictly whether the CNN is able to reproduce the input data based on the sample learning period. While it is good to utilize this as a check of the algorithm behavior, there should be a validation performed with observations from alternative sensors/algorithms or ground-based observations (including ground-launched radiosondes).
Why was relative humidity (RH) chosen over water vapor mixing ratio or specific humidity? While I am not arguing that RH is invalid to retrieve, RH is not an absolute quantity and is dependent on temperature (which entangles retrieval of waver vapor and temperature). RH makes it more difficult to compare results against other literature in the microwave remote sensing community. Retrieving a water vapor value specifically would make the two values more distinct and provide better illumination of what channels are sensitive to which physical values.
The indication that 183 GHz provides higher information content for temperature sounding than all but one 118 GHz channel is a surprise to me (this statement is based on my interpretation of Figure 5 along with corresponding text). One citation is given for the reasoning behind this but is unfortunately in another language and inaccessible to me. Also, L239-242 states water vapor attenuates the signal (and cites a paper describing an infrared retrieval paper), which seems contrary to the stated conclusion. Detailed information about this analysis methodology and better tying the conclusions to relevant principles should be provided.
The methodology explanation does not line up with explanations in the results sections. Up until section 4.2, I thought the retrieval was only over ocean (viz. L137) given that the training database was constructed from ocean only profiles and the validation dataset is part of the training database. The appendix plots also show only over ocean data. Yet Section 4.2 describes land cover characteristics when discussing the humidity retrieval performance. More detailed information about the database construction (perhaps a spatial map of where the database is compiled from) should be provided and the text clarified to line up with the methods.
Figures 3 (c-h) and 7 (c-h) are not very illustrative because of the density of points. Rather than shading by absolute error, a two-dimensional histogram should be performed and shading indicate the counts in the bins. Additionally, the line plotted on each subplot has no label -- whether it is a least squares fit or a 1-to-1 slope, that should be indicated in the figures. And speaking of least squares fit, are any fitting statistics available for the scatter plots?
In surveying the literature, much of the historical work mentioned was on infrared retrievals. More citations on microwave regime development should be included beyond why the regime is important for all-sky retrievals.
Below is more specific copyediting feedback (which can be rejected where deemed), but overall the wording should be refined to focus on the scientific conclusions drawn from the results at hand without extra pomp or overbroad explanations.
The introduction section was difficult to parse. It is a three page-long paragraph. Consider splitting it up into thematic paragraphs. (This may be simply because of the template used.)
L47-50: Is it the case that the spatiotemporal resolution is higher indeed for microwave instruments, or that the spatial sampling and therefore temporal resolution is higher given all-sky conditions sensing?
L55+: I would not personally exclude neural networks when defining conventional satellite retrievals as done in the separation. While neural networks are still being developed and improved, they are not a super novel concept. In example, L85 cites a neural network retrieval from 15 years ago.
L79-81: This sentence has too much puffery.
Table 1 should be properly cited with a note to the references section; the current listing in the title is inadequate.
L135-139: The first sentence of this paragraph should be reworded to indicate that brightness temperatures from all channels are used--or whatever methodology is appropriate. The static values should be further described.
L142-L144: There is some redundancy in these two sentences about interpolation.
Figure 2: Font size should be increased on within-figure text.
L185-186: This wording is awkward.
L187-188: I would argue this is more a matter of instrument sensitivity vertically than model performance.
L198-200: I would rephrase or better convey the argument here.
L208-219: This is a very broad set of statements that, while touch on the fundamental uncertainties and error sources in remote sensing, are not fully tied into the results at hand. Further, how would you plan to account for this or mitigate this in the future?
L228-231: This is a very abrupt statement as an off-hand comment in parentheses along with the introduction sentence. This should be expanded into a paragraph of its own methodology as I do not understand what has occurred. What is model.conv1? What spatial dimensions?
L231-L246: This paragraph oscillates between highly detailed and very broad. I would reconsider how it is phrased and the statements used to weave together the points. Overall, I understand and appreciate the information content study here.
Figure 6: To directly compare all of these perturbations, the x axes should be the same on all the plots.
L272-282: Similar to L208-219, this is another very broad set of statements that do not fully tie into the results or indicate how/if you plan to address these concerns in the future.
L286-288: These sentences are partially redundant.
L290-295: Repeated phrasing. Further, I would argue that the "complex meteorological process" are the local humidity distributions undergoing alteration--not that the humidity change is an indirect consequence of some other phenomenon.
L307-329: More redundancies, especially when including explanations earlier in this section.
Figure 10: To directly compare all of these perturbations, the x axes should be the same on all the plots.

Citation: https://doi.org/10.5194/egusphere-2025-680-RC2
- AC2: 'Reply on RC2', Wei Han, 23 May 2025
  
  Overview
  This paper presents a algorithm for use on the microwave imager instrument aboard FY-3G to retrieve profiles of temperature and humidity. The algorithm is a convolutional neural network (CNN) that was trained with ERA5 data, and because it is in the microwave part of the spectrum it can retrieve all sky conditions. Neural network retrievals are actively being improved in the literature, especially multidimensional algorithms such as convolutional neural networks, so this paper in its ideas is suitable for publication in AMT.
  From what I can discern, the validation here is strictly whether the CNN is able to reproduce the input data based on the sample learning period. While it is good to utilize this as a check of the algorithm behavior, there should be a validation performed with observations from alternative sensors/algorithms or ground-based observations (including ground-launched radiosondes).
  Response:
  Thank you for your insightful feedback on our validation methodology. We acknowledge the need to strengthen transparency and have clarified our current work’s scope. Specifically, quantitative metrics (e.g., RMSE) for different model architectures are presented in Appendix A, Table A1 of the revised manuscript. These serve as preliminary model-selection indicators during development, without in-depth interpretation, as the study prioritizes demonstrating the framework’s feasibility over exhaustive benchmarking. Additionally, the scarcity of in situ marine observations and spatiotemporal mismatches with FY3G retrievals currently limit robust maritime validation. In follow-up research, we will systematically address these via spatio-temporal alignment algorithms and high-resolution satellite datasets to ensure comprehensive cross-environment evaluation.
  Why was relative humidity (RH) chosen over water vapor mixing ratio or specific humidity? While I am not arguing that RH is invalid to retrieve, RH is not an absolute quantity and is dependent on temperature (which entangles retrieval of waver vapor and temperature). RH makes it more difficult to compare results against other literature in the microwave remote sensing community. Retrieving a water vapor value specifically would make the two values more distinct and provide better illumination of what channels are sensitive to which physical values.
  Response:
  Thank you for your comment on RH selection. This choice is grounded in two key considerations. First, as a standard ERA5 output, RH offers direct usability as label data and aligns with meteorological operational practices, ensuring compatibility with public observations and model inputs. Second, this study represents an initial effort to develop a temperature-humidity joint inversion model using FY-3G/MWRI-RM full-channel data. While RH’s temperature dependence introduces complexity, it enables exploration of dynamic temperature-humidity interactions (e.g., synergies between 118 GHz and 183 GHz channels; see Sections 4.2 and Conclusions in the revised manuscript). We agree that specific humidity retrieval would enhance physical clarity and plan to expand the model’s output to include water vapor parameters in future work, alongside error propagation analysis to decouple temperature-humidity contributions.
  The indication that 183 GHz provides higher information content for temperature sounding than all but one 118 GHz channel is a surprise to me (this statement is based on my interpretation of Figure 5 along with corresponding text). One citation is given for the reasoning behind this but is unfortunately in another language and inaccessible to me. Also, L239-242 states water vapor attenuates the signal (and cites a paper describing an infrared retrieval paper), which seems contrary to the stated conclusion. Detailed information about this analysis methodology and better tying the conclusions to relevant principles should be provided.
  Response:
  To deepen the analysis of 183 GHz channel sensitivity, we have enhanced both physical explanations and literature integration. Through gradient backpropagation and perturbation experiments (Sections 4.1–4.2), we clarify that the 183 GHz water vapor channel indirectly constrains temperature via vapor-radiation feedback: in high-humidity regions (e.g., boundary layers), water vapor’s absorption/emission modifies radiation transfer, influencing temperature through latent heat release. In contrast, the 118 GHz oxygen channel directly senses mid-upper tropospheric temperature gradients, with humidity impacts mediated by dynamics like vertical advection and atmospheric stability. Furthermore, we replaced some Chinese references with English counterparts and added discussions on temperature-humidity coupling (Lines 209–215 in Section 4.1; Lines 279–284 in Section 4.2 of the revised manuscript). The “model weight diffusion in convolutional layers” describes potential propagation of 118 GHz temperature-sensitive features to humidity parameters via spatial feature mixing, a phenomenon warranting further study with physics-constrained neural networks.
  The methodology explanation does not line up with explanations in the results sections. Up until section 4.2, I thought the retrieval was only over ocean (viz. L137) given that the training database was constructed from ocean only profiles and the validation dataset is part of the training database. The appendix plots also show only over ocean data. Yet Section 4.2 describes land cover characteristics when discussing the humidity retrieval performance. More detailed information about the database construction (perhaps a spatial map of where the database is compiled from) should be provided and the text clarified to line up with the methods.
  Response:
  Thank you for identifying the inconsistency in our data description. The term “land cover characteristics” was a typo; both training and validation datasets exclusively use oceanic observations (detailed in Section 2.2). We have corrected relevant paragraphs and removed land-data discussions, explicitly stated in the methodology: “To reduce preprocessing complexity, this study only selects oceanic data.”, highlighted marine data coverage in Appendix Figs. A1–A2. Future research will extend to land scenarios via standardized surface parameter preprocessing (e.g., soil moisture, vegetation index).
  Figures 3 (c-h) and 7 (c-h) are not very illustrative because of the density of points. Rather than shading by absolute error, a two-dimensional histogram should be performed and shading indicate the counts in the bins. Additionally, the line plotted on each subplot has no label -- whether it is a least squares fit or a 1-to-1 slope, that should be indicated in the figures. And speaking of least squares fit, are any fitting statistics available for the scatter plots?
  Response:
  Thank you for your thoughtful suggestions, which have been instrumental in enhancing the clarity and informativeness of our figures.
  To address the density of points in Figures 3(c–h) and 7(c–h), we have replaced the absolute error shading with two-dimensional histograms, where color intensity now represents the count of data points in each bin. This modification improves visual interpretability by highlighting data distribution patterns while retaining error magnitude insights. Additionally, the black dashed line in each subplot is explicitly labeled as the “1:1 consistency line” to clarify that it denotes perfect agreement between retrieved and target values, rather than a least squares fit. To further quantify the scatter plot relationships, we have included three statistical metrics—Root Mean Square Error (RMSE), Bias, and Pearson correlation coefficient (r)—in the figure captions and corresponding discussions. These metrics systematically evaluate error magnitude, systematic deviation, and linear correlation, respectively, providing robust statistical support for our validation results.
  For other figures, we have implemented the following improvements to ensure consistency and readability:
  Figure 2: Font sizes were adjusted to enhance legibility across all text elements, ensuring key labels and annotations are easily distinguishable.
  Figures 5 and 9: To emphasize critical channels, Channel 22 (118.75±1.2 GHz) and 26 (183.31±7 GHz) are now plotted with solid lines, while other channels use dash-dot lines for clear visual differentiation. This distinction highlights their unique roles in temperature-humidity retrieval as discussed in Section 4.2.
  Figures 6 and 10: The layouts were revised to incorporate pressure-level distributions of percentage changes induced by perturbations. This addition clarifies how the model responds to input variations across different atmospheric layers, strengthening the mechanistic insights presented.
  These adjustments collectively aim to align our figures with scientific visualization best practices, ensuring they effectively communicate the study’s key findings while addressing the nuances raised in your feedback. We appreciate your guidance in refining our graphical representations.
  In surveying the literature, much of the historical work mentioned was on infrared retrievals. More citations on microwave regime development should be included beyond why the regime is important for all-sky retrievals.
  Response:
  In response to your suggestion, we revised the introduction to enhance contextual depth:
  Lines 55-68 in the revised manuscript: Expanded discussions on microwave remote sensing applications for atmospheric variable retrieval, integrating recent advancements and key studies to underscore the research’s scientific significance.
  
  This revision strengthens the rationale for our approach by connecting it to broader disciplinary progress.
  Below is more specific copyediting feedback (which can be rejected where deemed), but overall the wording should be refined to focus on the scientific conclusions drawn from the results at hand without extra pomp or overbroad explanations.
  Response:
  We sincerely appreciate your meticulous copyediting suggestions, which have been invaluable in refining the manuscript’s clarity and scientific rigor. Below is a summary of the key revisions made to address your feedback, with a focus on enhancing readability, streamlining redundancy, and strengthening logical flow. All line numbers cited below correspond to the revised manuscript:
  The original three-page introductory paragraph has been restructured into thematic subsections to improve navigability. We also expanded the literature review on microwave remote sensing (e.g., added citations in Lines 55–68) to better contextualize our approach, while adjusting the description of neural networks to acknowledge their established role in the field (avoiding the implication of excluding them from “conventional” methods).
  Lines180-187 and Lines254-263: We recognize the need to further integrate broad theoretical discussions with our specific findings, particularly regarding error sources and future mitigation strategies. These aspects will be prioritized in follow-up analyses, including planned investigations into physics-constrained neural network architectures to address model uncertainties.
  Broad statements about remote sensing uncertainties were either tied to our specific results or condensed, with clearer links to future mitigation strategies.
  Lines199–201: The parenthetical comment about "model.conv1" was expanded into a dedicated paragraph explaining the convolutional layer architecture and spatial dimension considerations, improving methodological transparency.
  
  Citation: https://doi.org/10.5194/egusphere-2025-680-AC2
EC1:
'Comment on egusphere-2025-680', S. Joseph Munchak, 10 Mar 2026

Following the decision for minor revisions with editor review, here are my specific comments to address prior to publication:

1. Lines 229 and 328 - the 118.75 +- 1.2 GHz channel is described as being sensitive to temperature gradients. This is not specific - are these horizontal or vertical gradients? For vertical gradients (lapse rate), the channel differences (compared to a channel closer or farther from the 118.75 O2 line) would actually be directly indicative of the gradient. Horizontal gradients would be captured by gradients in the 5x5 patch that is used as input - but it is not clear how this is relevant to the Jacobians. I think a better phrase would be "temperature distributions" instead of "temperature gradients" in these sentences.
2. Table A1 - the units of the AR-CNN results are incorrect (and inconsistent with the other rows, which are correct).

Citation: https://doi.org/10.5194/egusphere-2025-680-EC1
- AC3:
  'Reply on EC1', Wei Han, 12 Mar 2026
  Dear Editor,
  We would like to express our sincere gratitude for your detailed and insightful comments. Your feedback, particularly regarding the precise physical interpretation of the microwave channels, has been immensely helpful in improving the rigor and clarity of our manuscript. We have carefully studied your suggestions and made the corresponding revisions.
  Below is a point-by-point response to your comments.
  Editor Comment 1: Lines 229 and 328 - the 118.75 +- 1.2 GHz channel is described as being sensitive to temperature gradients. This is not specific - are these horizontal or vertical gradients? ... I think a better phrase would be “temperature distributions” instead of “temperature gradients” in these sentences.
  Response: We completely agree with your assessment. You are absolutely correct that describing a single channel’s Jacobian as sensitive to “temperature gradients” is physically inaccurate in this context. As you accurately pointed out, the Jacobians reflect the sensitivity to the state variable itself at specific pressure levels, rather than its spatial derivatives (which would require channel differencing for vertical gradients or spatial feature analysis for horizontal gradients). We highly appreciate you pointing out this conceptual ambiguity.
  Editor Comment 2: Table A1 - the units of the AR-CNN results are incorrect (and inconsistent with the other rows, which are correct).
  Response: Thank you for catching this oversight. We apologize for the typo in the formatting of the AR-CNN results. We have thoroughly checked Table A1 and corrected the units for the AR-CNN row to ensure complete consistency with the rest of the table.
  Furthermore, inspired by your focus on physical rigor, we conducted a thorough review of the manuscript and made two additional proactive refinements to ensure our interpretations of the deep learning model’s behavior are fundamentally sound:
  We revised a sentence regarding the relationship between the 118 GHz channel and humidity. We replaced the original explanation involving “dynamical coupling”and “vertical advection” with a more accurate description of the “thermodynamic correlation.”
  
  We changed the term “radiometric observations”to “radiance observations” when discussing deep convection areas, as “radiance” is a more precise term for the physical quantity interacting with atmospheric hydrometeors.
  
  Modifications in the Manuscript:
  Line 229 (Revised):“This highlights the model’s utilization of the oxygen absorption line near 118 GHz, a spectral feature with significant sensitivity to mid-to-upper tropospheric temperature distributions (500-200 hPa) and...”
  
  Line 328 (Revised):“In contrast, the 118.75±1.2 GHz channel is primarily sensitive to mid-to-upper tropospheric temperature distributions (500-200 hPa) via...”
  
  Table A1:The units for the AR-CNN row have been corrected to perfectly match the baseline formats.
  
  Additional Revision 1(Line 332): “This apparent discrepancy can be attributed to the thermodynamic correlation between temperature and humidity profiles." (The unsupported second half of the sentence regarding vertical advection was removed).
  
  Additional Revision 2(Line 287): “These processes, especially in areas with deep convection or liquid water droplets and ice crystals, obscure the direct link between radiance observations and humidity, challenging the model's ability to accurately retrieve humidity values solely from TBs.”
  
  Citation: https://doi.org/10.5194/egusphere-2025-680-AC3

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2025-680', Anonymous Referee #1, 10 Apr 2025
The paper used machine learning and microwave channels to retrieve temperature and humidity profiles. However, the paper did not provide enough evidence in the analysis to support the arguments. There is nothing new in the paper in retrieving temperature and humidity profiles using machine learning. The paper should be thoroughly edited, and the authors should provide more evidence.
The arrangement of the paragraph is chaotic. It seems put a bunch of unedited paragraphs together.

Not enough paragraph to support of using AR-CNN architecture. What if using other machine learning architecture? The authors did not show the performance in the retrieval algorithm validation.

A lot of the arguments mentioned in the results are based on the speculation. No other future examination. No discussion to the appendix figures.

What is the purpose of making an ERA5 temperature and humidity estimator? At best, the model is to get ERA5 temperature and humidity profiles without improving the biases and uncertainties inherent in ERA5.
Citation: https://doi.org/10.5194/egusphere-2025-680-RC1
- AC1:
  'Reply on RC1', Wei Han, 23 Apr 2025
  Thank you for your insightful feedback. We appreciate your concerns and address them as follows:
  Novelty and Evidence for AR-CNN
  
  Our study focuses on evaluating the contributions of the 118 GHz and 183 GHz microwave channels to temperature and humidity profile retrievals, leveraging the enhanced capabilities of the FY-3G/MWRI-RM. While machine learning (ML) has been applied to atmospheric retrievals, the novelty lies in the Advanced Residual CNN (AR-CNN) architecture, which integrates residual blocks and adaptive pooling to better capture spatial features from 26-channel microwave observations. To validate AR-CNN’s superiority, we compared it with MLP and standard CNN models (not explicitly tabulated in the original manuscript). The AR-CNN achieved a lower RMSE (1.24 K for temperature, 12.98% for humidity) compared to MLP (1.52 K, 15.3%) and CNN (1.38 K, 14.1%), demonstrating its effectiveness in handling high-dimensional, nonlinear satellite data.
  
  Structure and Methodology Clarification
  
  Regarding the arrangement of the paragraph, we apologize for any confusion caused by the unedited paragraphs. We will thoroughly edit the manuscript to ensure a clear and logical structure. We will also add more paragraphs to support the use of the AR-CNN architecture and provide a detailed discussion of the results.
  
  Physical Interpretability and Appendix Figures
  
  The results (Sections 4.1–4.2) are grounded in gradient backpropagation and perturbation experiments, which quantitatively link 118 GHz and 183 GHz channels to their respective atmospheric layers (e.g., 118 GHz for upper-level retrievals, 183 GHz for mid-to-lower layers). These align with Jacobian analyses (Figs. 5, 9) and confirm the model’s interpretability. The appendix figures (e.g., global maps in Fig. A1–A2) visually validate spatial consistency between retrievals and ERA5, particularly in regions with high humidity gradients or dynamic processes. Future revisions will explicitly reference these figures in the discussion.
  
  Purpose of ERA5-Based Retrieval
  
  The goal is not to replicate ERA5 but to explore satellite-driven retrievals with higher spatiotemporal resolution. ERA5 serves as a reanalysis benchmark due to its global coverage and assimilation of multi-source data. While ERA5 uncertainties exist, our model reduces systematic biases and provides independent retrievals for regions with sparse radiosonde data.
  
  Citation: https://doi.org/10.5194/egusphere-2025-680-AC1
RC2:
'Comment on egusphere-2025-680', Anonymous Referee #2, 29 Apr 2025

This paper presents a algorithm for use on the microwave imager instrument aboard FY-3G to retrieve profiles of temperature and humidity. The algorithm is a convolutional neural network (CNN) that was trained with ERA5 data, and because it is in the microwave part of the spectrum it can retrieve all sky conditions. Neural network retrievals are actively being improved in the literature, especially multidimensional algorithms such as convolutional neural networks, so this paper in its ideas is suitable for publication in AMT.
Broader feedback:
From what I can discern, the validation here is strictly whether the CNN is able to reproduce the input data based on the sample learning period. While it is good to utilize this as a check of the algorithm behavior, there should be a validation performed with observations from alternative sensors/algorithms or ground-based observations (including ground-launched radiosondes).
Why was relative humidity (RH) chosen over water vapor mixing ratio or specific humidity? While I am not arguing that RH is invalid to retrieve, RH is not an absolute quantity and is dependent on temperature (which entangles retrieval of waver vapor and temperature). RH makes it more difficult to compare results against other literature in the microwave remote sensing community. Retrieving a water vapor value specifically would make the two values more distinct and provide better illumination of what channels are sensitive to which physical values.
The indication that 183 GHz provides higher information content for temperature sounding than all but one 118 GHz channel is a surprise to me (this statement is based on my interpretation of Figure 5 along with corresponding text). One citation is given for the reasoning behind this but is unfortunately in another language and inaccessible to me. Also, L239-242 states water vapor attenuates the signal (and cites a paper describing an infrared retrieval paper), which seems contrary to the stated conclusion. Detailed information about this analysis methodology and better tying the conclusions to relevant principles should be provided.
The methodology explanation does not line up with explanations in the results sections. Up until section 4.2, I thought the retrieval was only over ocean (viz. L137) given that the training database was constructed from ocean only profiles and the validation dataset is part of the training database. The appendix plots also show only over ocean data. Yet Section 4.2 describes land cover characteristics when discussing the humidity retrieval performance. More detailed information about the database construction (perhaps a spatial map of where the database is compiled from) should be provided and the text clarified to line up with the methods.
Figures 3 (c-h) and 7 (c-h) are not very illustrative because of the density of points. Rather than shading by absolute error, a two-dimensional histogram should be performed and shading indicate the counts in the bins. Additionally, the line plotted on each subplot has no label -- whether it is a least squares fit or a 1-to-1 slope, that should be indicated in the figures. And speaking of least squares fit, are any fitting statistics available for the scatter plots?
In surveying the literature, much of the historical work mentioned was on infrared retrievals. More citations on microwave regime development should be included beyond why the regime is important for all-sky retrievals.
Below is more specific copyediting feedback (which can be rejected where deemed), but overall the wording should be refined to focus on the scientific conclusions drawn from the results at hand without extra pomp or overbroad explanations.
The introduction section was difficult to parse. It is a three page-long paragraph. Consider splitting it up into thematic paragraphs. (This may be simply because of the template used.)
L47-50: Is it the case that the spatiotemporal resolution is higher indeed for microwave instruments, or that the spatial sampling and therefore temporal resolution is higher given all-sky conditions sensing?
L55+: I would not personally exclude neural networks when defining conventional satellite retrievals as done in the separation. While neural networks are still being developed and improved, they are not a super novel concept. In example, L85 cites a neural network retrieval from 15 years ago.
L79-81: This sentence has too much puffery.
Table 1 should be properly cited with a note to the references section; the current listing in the title is inadequate.
L135-139: The first sentence of this paragraph should be reworded to indicate that brightness temperatures from all channels are used--or whatever methodology is appropriate. The static values should be further described.
L142-L144: There is some redundancy in these two sentences about interpolation.
Figure 2: Font size should be increased on within-figure text.
L185-186: This wording is awkward.
L187-188: I would argue this is more a matter of instrument sensitivity vertically than model performance.
L198-200: I would rephrase or better convey the argument here.
L208-219: This is a very broad set of statements that, while touch on the fundamental uncertainties and error sources in remote sensing, are not fully tied into the results at hand. Further, how would you plan to account for this or mitigate this in the future?
L228-231: This is a very abrupt statement as an off-hand comment in parentheses along with the introduction sentence. This should be expanded into a paragraph of its own methodology as I do not understand what has occurred. What is model.conv1? What spatial dimensions?
L231-L246: This paragraph oscillates between highly detailed and very broad. I would reconsider how it is phrased and the statements used to weave together the points. Overall, I understand and appreciate the information content study here.
Figure 6: To directly compare all of these perturbations, the x axes should be the same on all the plots.
L272-282: Similar to L208-219, this is another very broad set of statements that do not fully tie into the results or indicate how/if you plan to address these concerns in the future.
L286-288: These sentences are partially redundant.
L290-295: Repeated phrasing. Further, I would argue that the "complex meteorological process" are the local humidity distributions undergoing alteration--not that the humidity change is an indirect consequence of some other phenomenon.
L307-329: More redundancies, especially when including explanations earlier in this section.
Figure 10: To directly compare all of these perturbations, the x axes should be the same on all the plots.

Citation: https://doi.org/10.5194/egusphere-2025-680-RC2
- AC2: 'Reply on RC2', Wei Han, 23 May 2025
  
  Overview
  This paper presents a algorithm for use on the microwave imager instrument aboard FY-3G to retrieve profiles of temperature and humidity. The algorithm is a convolutional neural network (CNN) that was trained with ERA5 data, and because it is in the microwave part of the spectrum it can retrieve all sky conditions. Neural network retrievals are actively being improved in the literature, especially multidimensional algorithms such as convolutional neural networks, so this paper in its ideas is suitable for publication in AMT.
  From what I can discern, the validation here is strictly whether the CNN is able to reproduce the input data based on the sample learning period. While it is good to utilize this as a check of the algorithm behavior, there should be a validation performed with observations from alternative sensors/algorithms or ground-based observations (including ground-launched radiosondes).
  Response:
  Thank you for your insightful feedback on our validation methodology. We acknowledge the need to strengthen transparency and have clarified our current work’s scope. Specifically, quantitative metrics (e.g., RMSE) for different model architectures are presented in Appendix A, Table A1 of the revised manuscript. These serve as preliminary model-selection indicators during development, without in-depth interpretation, as the study prioritizes demonstrating the framework’s feasibility over exhaustive benchmarking. Additionally, the scarcity of in situ marine observations and spatiotemporal mismatches with FY3G retrievals currently limit robust maritime validation. In follow-up research, we will systematically address these via spatio-temporal alignment algorithms and high-resolution satellite datasets to ensure comprehensive cross-environment evaluation.
  Why was relative humidity (RH) chosen over water vapor mixing ratio or specific humidity? While I am not arguing that RH is invalid to retrieve, RH is not an absolute quantity and is dependent on temperature (which entangles retrieval of waver vapor and temperature). RH makes it more difficult to compare results against other literature in the microwave remote sensing community. Retrieving a water vapor value specifically would make the two values more distinct and provide better illumination of what channels are sensitive to which physical values.
  Response:
  Thank you for your comment on RH selection. This choice is grounded in two key considerations. First, as a standard ERA5 output, RH offers direct usability as label data and aligns with meteorological operational practices, ensuring compatibility with public observations and model inputs. Second, this study represents an initial effort to develop a temperature-humidity joint inversion model using FY-3G/MWRI-RM full-channel data. While RH’s temperature dependence introduces complexity, it enables exploration of dynamic temperature-humidity interactions (e.g., synergies between 118 GHz and 183 GHz channels; see Sections 4.2 and Conclusions in the revised manuscript). We agree that specific humidity retrieval would enhance physical clarity and plan to expand the model’s output to include water vapor parameters in future work, alongside error propagation analysis to decouple temperature-humidity contributions.
  The indication that 183 GHz provides higher information content for temperature sounding than all but one 118 GHz channel is a surprise to me (this statement is based on my interpretation of Figure 5 along with corresponding text). One citation is given for the reasoning behind this but is unfortunately in another language and inaccessible to me. Also, L239-242 states water vapor attenuates the signal (and cites a paper describing an infrared retrieval paper), which seems contrary to the stated conclusion. Detailed information about this analysis methodology and better tying the conclusions to relevant principles should be provided.
  Response:
  To deepen the analysis of 183 GHz channel sensitivity, we have enhanced both physical explanations and literature integration. Through gradient backpropagation and perturbation experiments (Sections 4.1–4.2), we clarify that the 183 GHz water vapor channel indirectly constrains temperature via vapor-radiation feedback: in high-humidity regions (e.g., boundary layers), water vapor’s absorption/emission modifies radiation transfer, influencing temperature through latent heat release. In contrast, the 118 GHz oxygen channel directly senses mid-upper tropospheric temperature gradients, with humidity impacts mediated by dynamics like vertical advection and atmospheric stability. Furthermore, we replaced some Chinese references with English counterparts and added discussions on temperature-humidity coupling (Lines 209–215 in Section 4.1; Lines 279–284 in Section 4.2 of the revised manuscript). The “model weight diffusion in convolutional layers” describes potential propagation of 118 GHz temperature-sensitive features to humidity parameters via spatial feature mixing, a phenomenon warranting further study with physics-constrained neural networks.
  The methodology explanation does not line up with explanations in the results sections. Up until section 4.2, I thought the retrieval was only over ocean (viz. L137) given that the training database was constructed from ocean only profiles and the validation dataset is part of the training database. The appendix plots also show only over ocean data. Yet Section 4.2 describes land cover characteristics when discussing the humidity retrieval performance. More detailed information about the database construction (perhaps a spatial map of where the database is compiled from) should be provided and the text clarified to line up with the methods.
  Response:
  Thank you for identifying the inconsistency in our data description. The term “land cover characteristics” was a typo; both training and validation datasets exclusively use oceanic observations (detailed in Section 2.2). We have corrected relevant paragraphs and removed land-data discussions, explicitly stated in the methodology: “To reduce preprocessing complexity, this study only selects oceanic data.”, highlighted marine data coverage in Appendix Figs. A1–A2. Future research will extend to land scenarios via standardized surface parameter preprocessing (e.g., soil moisture, vegetation index).
  Figures 3 (c-h) and 7 (c-h) are not very illustrative because of the density of points. Rather than shading by absolute error, a two-dimensional histogram should be performed and shading indicate the counts in the bins. Additionally, the line plotted on each subplot has no label -- whether it is a least squares fit or a 1-to-1 slope, that should be indicated in the figures. And speaking of least squares fit, are any fitting statistics available for the scatter plots?
  Response:
  Thank you for your thoughtful suggestions, which have been instrumental in enhancing the clarity and informativeness of our figures.
  To address the density of points in Figures 3(c–h) and 7(c–h), we have replaced the absolute error shading with two-dimensional histograms, where color intensity now represents the count of data points in each bin. This modification improves visual interpretability by highlighting data distribution patterns while retaining error magnitude insights. Additionally, the black dashed line in each subplot is explicitly labeled as the “1:1 consistency line” to clarify that it denotes perfect agreement between retrieved and target values, rather than a least squares fit. To further quantify the scatter plot relationships, we have included three statistical metrics—Root Mean Square Error (RMSE), Bias, and Pearson correlation coefficient (r)—in the figure captions and corresponding discussions. These metrics systematically evaluate error magnitude, systematic deviation, and linear correlation, respectively, providing robust statistical support for our validation results.
  For other figures, we have implemented the following improvements to ensure consistency and readability:
  Figure 2: Font sizes were adjusted to enhance legibility across all text elements, ensuring key labels and annotations are easily distinguishable.
  Figures 5 and 9: To emphasize critical channels, Channel 22 (118.75±1.2 GHz) and 26 (183.31±7 GHz) are now plotted with solid lines, while other channels use dash-dot lines for clear visual differentiation. This distinction highlights their unique roles in temperature-humidity retrieval as discussed in Section 4.2.
  Figures 6 and 10: The layouts were revised to incorporate pressure-level distributions of percentage changes induced by perturbations. This addition clarifies how the model responds to input variations across different atmospheric layers, strengthening the mechanistic insights presented.
  These adjustments collectively aim to align our figures with scientific visualization best practices, ensuring they effectively communicate the study’s key findings while addressing the nuances raised in your feedback. We appreciate your guidance in refining our graphical representations.
  In surveying the literature, much of the historical work mentioned was on infrared retrievals. More citations on microwave regime development should be included beyond why the regime is important for all-sky retrievals.
  Response:
  In response to your suggestion, we revised the introduction to enhance contextual depth:
  Lines 55-68 in the revised manuscript: Expanded discussions on microwave remote sensing applications for atmospheric variable retrieval, integrating recent advancements and key studies to underscore the research’s scientific significance.
  
  This revision strengthens the rationale for our approach by connecting it to broader disciplinary progress.
  Below is more specific copyediting feedback (which can be rejected where deemed), but overall the wording should be refined to focus on the scientific conclusions drawn from the results at hand without extra pomp or overbroad explanations.
  Response:
  We sincerely appreciate your meticulous copyediting suggestions, which have been invaluable in refining the manuscript’s clarity and scientific rigor. Below is a summary of the key revisions made to address your feedback, with a focus on enhancing readability, streamlining redundancy, and strengthening logical flow. All line numbers cited below correspond to the revised manuscript:
  The original three-page introductory paragraph has been restructured into thematic subsections to improve navigability. We also expanded the literature review on microwave remote sensing (e.g., added citations in Lines 55–68) to better contextualize our approach, while adjusting the description of neural networks to acknowledge their established role in the field (avoiding the implication of excluding them from “conventional” methods).
  Lines180-187 and Lines254-263: We recognize the need to further integrate broad theoretical discussions with our specific findings, particularly regarding error sources and future mitigation strategies. These aspects will be prioritized in follow-up analyses, including planned investigations into physics-constrained neural network architectures to address model uncertainties.
  Broad statements about remote sensing uncertainties were either tied to our specific results or condensed, with clearer links to future mitigation strategies.
  Lines199–201: The parenthetical comment about "model.conv1" was expanded into a dedicated paragraph explaining the convolutional layer architecture and spatial dimension considerations, improving methodological transparency.
  
  Citation: https://doi.org/10.5194/egusphere-2025-680-AC2
EC1:
'Comment on egusphere-2025-680', S. Joseph Munchak, 10 Mar 2026

Following the decision for minor revisions with editor review, here are my specific comments to address prior to publication:

1. Lines 229 and 328 - the 118.75 +- 1.2 GHz channel is described as being sensitive to temperature gradients. This is not specific - are these horizontal or vertical gradients? For vertical gradients (lapse rate), the channel differences (compared to a channel closer or farther from the 118.75 O2 line) would actually be directly indicative of the gradient. Horizontal gradients would be captured by gradients in the 5x5 patch that is used as input - but it is not clear how this is relevant to the Jacobians. I think a better phrase would be "temperature distributions" instead of "temperature gradients" in these sentences.
2. Table A1 - the units of the AR-CNN results are incorrect (and inconsistent with the other rows, which are correct).

Citation: https://doi.org/10.5194/egusphere-2025-680-EC1
- AC3:
  'Reply on EC1', Wei Han, 12 Mar 2026
  Dear Editor,
  We would like to express our sincere gratitude for your detailed and insightful comments. Your feedback, particularly regarding the precise physical interpretation of the microwave channels, has been immensely helpful in improving the rigor and clarity of our manuscript. We have carefully studied your suggestions and made the corresponding revisions.
  Below is a point-by-point response to your comments.
  Editor Comment 1: Lines 229 and 328 - the 118.75 +- 1.2 GHz channel is described as being sensitive to temperature gradients. This is not specific - are these horizontal or vertical gradients? ... I think a better phrase would be “temperature distributions” instead of “temperature gradients” in these sentences.
  Response: We completely agree with your assessment. You are absolutely correct that describing a single channel’s Jacobian as sensitive to “temperature gradients” is physically inaccurate in this context. As you accurately pointed out, the Jacobians reflect the sensitivity to the state variable itself at specific pressure levels, rather than its spatial derivatives (which would require channel differencing for vertical gradients or spatial feature analysis for horizontal gradients). We highly appreciate you pointing out this conceptual ambiguity.
  Editor Comment 2: Table A1 - the units of the AR-CNN results are incorrect (and inconsistent with the other rows, which are correct).
  Response: Thank you for catching this oversight. We apologize for the typo in the formatting of the AR-CNN results. We have thoroughly checked Table A1 and corrected the units for the AR-CNN row to ensure complete consistency with the rest of the table.
  Furthermore, inspired by your focus on physical rigor, we conducted a thorough review of the manuscript and made two additional proactive refinements to ensure our interpretations of the deep learning model’s behavior are fundamentally sound:
  We revised a sentence regarding the relationship between the 118 GHz channel and humidity. We replaced the original explanation involving “dynamical coupling”and “vertical advection” with a more accurate description of the “thermodynamic correlation.”
  
  We changed the term “radiometric observations”to “radiance observations” when discussing deep convection areas, as “radiance” is a more precise term for the physical quantity interacting with atmospheric hydrometeors.
  
  Modifications in the Manuscript:
  Line 229 (Revised):“This highlights the model’s utilization of the oxygen absorption line near 118 GHz, a spectral feature with significant sensitivity to mid-to-upper tropospheric temperature distributions (500-200 hPa) and...”
  
  Line 328 (Revised):“In contrast, the 118.75±1.2 GHz channel is primarily sensitive to mid-to-upper tropospheric temperature distributions (500-200 hPa) via...”
  
  Table A1:The units for the AR-CNN row have been corrected to perfectly match the baseline formats.
  
  Additional Revision 1(Line 332): “This apparent discrepancy can be attributed to the thermodynamic correlation between temperature and humidity profiles." (The unsupported second half of the sentence regarding vertical advection was removed).
  
  Additional Revision 2(Line 287): “These processes, especially in areas with deep convection or liquid water droplets and ice crystals, obscure the direct link between radiance observations and humidity, challenging the model's ability to accurately retrieve humidity values solely from TBs.”
  
  Citation: https://doi.org/10.5194/egusphere-2025-680-AC3

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload

AR by Wei Han on behalf of the Authors (24 May 2025) Author's response Author's tracked changes Manuscript

ED: Reconsider after major revisions (03 Jun 2025) by S. Joseph Munchak

AR by Wei Han on behalf of the Authors (09 Jul 2025) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (14 Jul 2025) by S. Joseph Munchak

RR by Anonymous Referee #3 (13 Aug 2025)

Suggestions for revision or reasons for rejection

The manuscript "All-Sky Temperature and Humidity Retrieval from the MWRI-RM Onboard the FY-3G Satellite" by Minghua Liu et al. introduces a convolutional neural network algorithm for retrieving vertical profiles of temperature and relative humidity from this instrument, and includes sensitivity analysis for the products. The new dataset is interesting, the CNN methodology has potential, and this reviewer appreciates the authors' interest in performing Jacobian and sensitivity analysis to explain the behavior of the algorithm.

Nevertheless, there are significant issues with the methodology or its description that need to be addressed in revision:
- The retrieval apparently uses a patch of 5x5 spatial footprints from the instrument as input, and the output is a single retrieval at the center of this patch. The training target is also, apparently, a single ERA5 profile interpolated via nearest neighbor to the center footprint of this patch. In that case, what is the basis for using spatial convolution in this approach? Is there any benefit to using spatial convolution as opposed to just simply using the nearest footprint, or perhaps just spatially averaging the input patch? Also: It was unclear what the spatial resolution of the retrieval product is. In CNN terms, what is the stride used to make the product? I.e., does each 5x5 input patch contribute to only a single output profile at the center of the patch, or are there overlapping patches contributing to neighboring retrievals?

Also: Are any non-radiance inputs like zenith angle used as input (common practice in NN retrievals)? If not, then the spatial features encompassed within the patch (which the authors are aiming to exploit with a CNN) will be angle-dependent with no inputs to indicate this to the model.

- There are few implementation details. E.g., was Pytorch used? What optimizer was used to train the algorithm? What are the kernel sizes used in the model? How were hyperparameters chosen?

- More information is needed in 2.2 "Sample Construction" is unclear. Were the selected ocean samples thinned out or curated in any particular way? How was the chronological test/validation/training split performed (interleaved or consecutive)? Why was that month of data selected? 2.2 mentions "...preprocessed static parameters obtained through decoding and quality checks..." but these parameters are not defined, nor is their usage.

The Jacobian methodology needs more elaboration. The plots such as Figure 5b are weighting functions computed from a radiative transfer model. The description of "gradient backpropagation" in the manuscript, apparently, therefore means automatic differentiation of the radiative transfer model outputs with respect to test profile inputs, not, say, differentiation of the CNN model. Is this true? The sensitivity of the CNN model itself is only shown as the conv1 weights. This is not sufficient to provide the desired explainability of the retrieval methodology. With a neural network model, it is possible, and straightforward to use "autograd" in tools such as PyTorch to compute the Jacobian of the whole model's output with respect to its inputs. This would actually describe the channel usage by the model, which appears to be the desired goal.

- Retrieving relative humidity as opposed to more common practice of retrieving q or log(q) directly is not wrong - it may actually be an interesting idea - but it is uncommon. I'm curious why this was done - was there any benefit found to retrieving RH instead of q directly? In particular, RH is a function of temperature, which can confuse the sensitivity analysis.

- I may be missing something important, but I didn't see where the Appendix figure and table were referenced in the manuscript.

Hide

RR by Anonymous Referee #2 (21 Sep 2025)

Suggestions for revision or reasons for rejection

This paper presents a temperature and relative humidity retrieval for MWRI-RM instrument using a convolutional neural network framework. The authors have made substantial improvements to their analysis and text from the last submission, so I appreciate their commitment to this study.

My main point of information relates to the selection of relative humidity. While the authors make many allusions to the coupling between temperature and moisture (which comes into play with contributions from both absorption bands affecting both quantities), there is not an indication why relative humidity is chosen for the moisture retrieval. After all, relative humidity is directly dependent on temperature by definition, while quantities such as absolute humidity/mixing ratio are more straightforward and independent (with still this temperature-moisture connection as they authors mention). My intuition is that this might provide better performance and behavior for the moisture retrievals as the inputs are "purer" and the dependence on temperature is more indirect than direct. But I am not trying to say this is required for publication.

I would like to the authors to explain more the methodology at L96-97. What exactly are preprocessed static parameters obtained through decoding and quality checks? It's unclear to me what this actually means.

In every case, more minor suggestions or changes follow:

L60-68: This paragraph seems abrupt, talking first about ML but the moving back to statistics. I would consider splitting into two or separating prior work into method types.
L62: Please define the acronym PWV and GPS.
L109: with _the final_ 20%
L146: I do not understand the bias description. Bias varies vertically and is not uniform or consistent.
L148: Nevertheless, _t_he
L172: Please walk the reader through Figure 3c-h in the text.
L180: I would more explicitly state in the reanalysis, not just "input features"
Figure 3: Unit labels for a and b subfigures
L196-201: Please explain every aspect of this more, from the customization to how weights were extracted.
L274: research._ _For
Figure 8: Unit labels for a and b subfigures

Hide

ED: Reconsider after major revisions (09 Oct 2025) by S. Joseph Munchak

AR by Wei Han on behalf of the Authors (12 Oct 2025) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (23 Oct 2025) by S. Joseph Munchak

RR by Anonymous Referee #2 (12 Nov 2025)

RR by Anonymous Referee #4 (05 Feb 2026)

Suggestions for revision or reasons for rejection

This paper study the possibility of using the MWRI-RM data for the retrieval of temperature and humidity atmospheric profiles. A AR-CNN is used to perform this retrieval.

1) The retrieval of temperature and humidity profiles from MW and IR data is very large. The literature review is too sparse and this reflects on the paper maturity. More references are required. NN retrieval dates back from the end of the 90s.

2) The analysis is performed only in one month of data. This is very limited and not sufficient to compare to other retrievals. You mentioned that training, validation and testing has been chosen randomly, this means that there are no true independency between them because two neighboring pixels are almost the same. This is not standard practice.

3) Some figures are not optimal. Figure 1 is useless, we can understand a 5x5 input window for a target as atmospheric profile. Figure 2 os tpp sùamm and almost cannot be read. I would like more comments on the architecture itself. Figure 4 is not necessary. If you want to comment on extremes, better graph can illustrate.

4) Section 3.2 us useless

5) Comments about the extreme cases... you need to read the literature. For example the boucher et al recent paper about extremes, CNN and dampening effect.

6) you don't compare your results with more simple, and older techniques like: linear regression, MLP, etc... I am quite sure that a MLP will give similar results than what you obtain with this complex model.

7) The way to analyse the interpretation of the model is not correct. For instance, if you perturb one input channl by 1K random noise, it is not correct. Channels are highly correlated, and introducing incoherencies in inputs is not physical. If you want to measure the information content of the channels when using a MLP, you can add hierarchically the channels, or suppress them, to see the impact on the results.

8) Figure 5 right, indicate the channels frequency. Cannot identify the channels with the colours, need to change that. Left part, this is not reliable I think for the reasons I mentioned.

9) Figures A1 and A2 cannot show differences considering the spread of the colorbar. They are not useful. You should rather do a bias and RMS map of the errors.

Referee Report: PDF

Hide

ED: Reconsider after major revisions (08 Feb 2026) by S. Joseph Munchak

AR by Wei Han on behalf of the Authors (02 Mar 2026) Author's response Author's tracked changes Manuscript

ED: Publish subject to minor revisions (review by editor) (10 Mar 2026) by S. Joseph Munchak

AR by Wei Han on behalf of the Authors (11 Mar 2026) Author's response Author's tracked changes Manuscript

ED: Publish as is (16 Mar 2026) by S. Joseph Munchak

AR by Wei Han on behalf of the Authors (19 Mar 2026) Manuscript

Journal article(s) based on this preprint

25 Mar 2026

All-sky temperature and humidity retrieval from the MWRI-RM onboard the FY-3G satellite

Minghua Liu, Wei Han, Yunfan Yang, Haofei Sun, and Ruoying Yin

Atmos. Meas. Tech., 19, 2061–2077, https://doi.org/10.5194/amt-19-2061-2026,https://doi.org/10.5194/amt-19-2061-2026, 2026

Short summary

Minghua Liu, Wei Han, Yunfan Yang, Haofei Sun, and Ruoying Yin

Viewed

Total article views: 3,758 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
2,536	1,121	101	3,758	90	121

HTML: 2,536
PDF: 1,121
XML: 101
Total: 3,758
BibTeX: 90
EndNote: 121

Views and downloads (calculated since 17 Mar 2025)

Month	HTML	PDF	XML	Total
Mar 2025	122	18	4	144
Apr 2025	92	36	12	140
May 2025	62	10	6	78
Jun 2025	74	36	16	126
Jul 2025	58	16	4	78
Aug 2025	218	22	4	244
Sep 2025	1,072	28	2	1,102
Oct 2025	96	14	6	116
Nov 2025	80	72	6	158
Dec 2025	156	152	6	314
Jan 2026	148	138	6	292
Feb 2026	78	106	6	190
Mar 2026	218	350	20	588
Apr 2026	45	99	2	146
May 2026	17	24	1	42

Cumulative views and downloads (calculated since 17 Mar 2025)

Month	HTML	PDF	XML	Total
Mar 2025	122	18	4	144
Apr 2025	92	36	12	140
May 2025	62	10	6	78
Jun 2025	74	36	16	126
Jul 2025	58	16	4	78
Aug 2025	218	22	4	244
Sep 2025	1,072	28	2	1,102
Oct 2025	96	14	6	116
Nov 2025	80	72	6	158
Dec 2025	156	152	6	314
Jan 2026	148	138	6	292
Feb 2026	78	106	6	190
Mar 2026	218	350	20	588
Apr 2026	45	99	2	146
May 2026	17	24	1	42

Viewed (geographical distribution)

Total article views: 3,758 (including HTML, PDF, and XML) Thereof 3,758 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 24 May 2026

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (8585 KB)
Metadata XML

Short summary

This research develops a machine learning approach to estimate atmospheric temperature and humidity profiles using satellite and weather data. The results showed that our method could accurately retrieve profiles with a high degree of precision. However, we found some limitations in very humid conditions, suggesting that further improvements to the model are needed. Our findings could help enhance the reliability of atmospheric measurements and contribute to better weather predictions.


Total:	0
HTML:	0
PDF:	0
XML:	0