the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Research on deep learning-based missing echo restoration method for weather radar mosaic data
Abstract. Radar mosaic data represent a critical and widely utilized resource in weather forecasting. Nevertheless, the frequent occurrence of regional radar echo gaps, caused by factors including radar hardware malfunctions, data delivery delays, and software processing errors—each contributing to substantial spatial uncertainty in the missing areas—significantly constrains its quantitative application. To address this issue, we propose BiConvLSTM-UNet, a sequence reconstruction model designed to restore missing radar echoes. The model operates without relying on a missing-value mask during both training and inference, learning the inherent spatiotemporal variation patterns of radar echoes to reconstruct complete sequences. A post-processing procedure is implemented to minimize the impact of reconstruction on areas without missing data. Furthermore, multiple missing scenarios are synthetically generated to improve the model’s robustness and repair performance across diverse missing-data conditions. Comparative assessments against traditional and other deep learning approaches demonstrate the superior inpainting performance of the proposed BiConvLSTM-UNet across multiple missing-data scenarios. The method introduces minimal artifact to non-missing regions, and subsequent post-processing further diminishes reconstruction errors. Moreover, the model sustains consistent performance across varying missing data lengths and continuity patterns, indicating robust generalization capabilities. Consequently, the BiConvLSTM-UNet is more adept at addressing the intricate and varied scenarios of incomplete radar mosaic data encountered in practical applications.
- Preprint
(3177 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 16 Jun 2026)
- RC1: 'Comment on egusphere-2026-1422', Anonymous Referee #1, 25 May 2026 reply
-
AC1: 'Comment on egusphere-2026-1422', Muyun Du, 06 Jun 2026
reply
Thank you to the reviewer for taking the time to read the manuscript and providing valuable suggestions and comments. Your insights are crucial for the further improvement of this work. Regarding each of the issues, the authors have provided the following responses. If there are any unclear points in the answers or other questions, please feel free to offer your comments. We will continue to correct and improve. Thank you!
- As described in Abstract, “Moreover, the model sustains consistent performance across varying missing data lengths and continuity patterns, indicating robust generalization capabilities.”, the statement of “consistent performance” is based on the experimental results in section 4.4. However, the numerical values are not the same. How should this be understood?
Reply: The emphasis here on maintaining consistency is on the robustness of the model. Specifically, relative stability is demonstrated by the gradual and moderate decline in performance as the length of missing data increases and the continuity of missing segments deepens—yet all metrics remain consistently above a high baseline, with no abrupt or catastrophic degradation. Furthermore, no matter what kind of missing situation is encountered (such as an increase in the length of missing data or a deeper degree of continuity), the model can effectively reconstruct the missing echoes, rather than completely failing in certain cases.
- In Section 2.2, why is it necessary to separately establish an independent validation set to evaluate the model’s ability in reconstructing the missing echoes?
Reply: Since this paper adopts an unmasked learning framework, the model will simultaneously reconstruct both the missing areas and the non-missing areas during the training process. Therefore, in the conventional validation process, the overall evaluation metrics are prone to be influenced by a large number of non-missing areas. The non-missing areas usually have high quality, and their reconstruction errors are relatively small. Thus, the overall metrics cannot fully reflect the model’s ability to restore the truly missing areas. Based on this, this paper sets up an independent validation set and focuses on evaluating the missing areas separately to more accurately measure the model’s performance in recovering missing echoes. At the same time, the results of this independent validation can also be used to assist in parameter selection and model optimization during the model training process, thereby improving the robustness of model.
- Why wasn’t the optimization of various hyperparameters carried out in the validation set during the period of March to May 2022?
Reply: Since the overall evaluation metrics were adopted in the validation set from March to May in 2022, and the non-missing areas accounted for a large proportion of the whole sample, the overall metrics were prone to be dominated by the non-missing areas, and thus could not fully reflect the model’s actual recovery ability for the missing areas. Therefore, this paper additionally established an independent validation mechanism for missing areas to further evaluate the model’s recovery performance in the missing areas and assist in the optimization of relevant hyperparameters.
- In the method of constructing the loss dataset, if the radar echoes within the selected area are set to zero using rectangles, will this cause the model to be more inclined to repair the rectangular missing areas?
Reply: This study does not rely on explicit missing mask during training and inference. This means that when the model is learning, it does not know whether the zero values in the input are real no radar echoes or missing rectangular areas. It treats all zero values uniformly, that is, by inferring their most likely values by capturing the intrinsic spatiotemporal evolution patterns and structural features of the radar echo sequence. If the model develops a preference for repairing rectangles, it first needs to be able to distinguish and identify the rectangular boundaries, which is precisely the information that this study deliberately avoids allowing the model to learn through mask-free training.
- Why was PixelShuffle used in the model design?
Reply: PixelShuffle can not only reduce the artifacts caused by upsampling, but also more effectively restore the spatial boundaries and high-frequency structures of radar echoes, while balancing the reconstruction quality and computational efficiency. Specifically, 1. It reduces the problem of checkerboard artifacts caused by the uneven coverage of the convolution kernel during deconvolution. 2. It can better preserve spatial boundaries and fine-scale structures by enabling feature learning in the low-resolution feature space and then restoring the spatial resolution through a channel-to-space rearrangement operation. This is more conducive to maintaining the continuity of echo boundaries and strong convective structures. 3. Compared with interpolation-based methods, PixelShuffle enables learnable feature reconstruction while maintaining higher computational efficiency.
- The weighted L1 loss in this paper adopts an artificial discrete weight design based on the reflectance threshold. Could you please explain the basis for setting these weights? Additionally, is this discontinuous segmented weight likely to cause unstable gradients during the optimization process and subsequently lead to a higher FAR in the experiments?
Reply: The design of the weighted L1 loss is mainly aimed at addressing the severe imbalance issue of strong and weak radar echoes in the samples. Since although the strong echo areas account for a relatively small proportion in the samples, they have higher importance in meteorological services, this paper increases the loss weight of the strong echo areas to guide the model to pay more attention to the recovery ability of the strong convective structure and the high reflectivity core area. The current weight design is indeed an empirical solution; however, the ablation experiment results show that this design can effectively improve the CSI and POD under strong echo conditions, indicating its effectiveness in enhancing the recovery ability of strong radar echoes. Moreover, we performed an ablation study comparing the standard L1 loss against the weighted L1 loss. The results consistently demonstrated that the standard L1 loss performs worse than the weighted L1 loss in terms of metrics such as PSNR, SSIM, and CSI.
- Why not adopt the model learning with missing mask instead of using mask-free learning?
Reply: In the actual application of radar mosaic data, the positions, shapes and temporal continuity of the missing radar echoes usually exhibit strong randomness and irregularity. Moreover, high-quality and accurate missing mask is often difficult to obtain stably in many cases. If the model overly relies on explicit mask information, it is prone to developing a dependency on specific missing patterns, thereby reducing its generalization ability for unknown missing patterns. At the same time, the traditional mask-based restoration methods essentially belong to local completion under the condition of known missing positions. The model needs to know in advance which areas have missing data. This study aims to explore a more general spatiotemporal sequence reconstruction framework, enabling the model to rely solely on the spatiotemporal evolution laws of the radar echoes, autonomously identify and restore abnormal areas, thereby reducing its reliance on prior missing information. Of course, we also recognize that mask-free learning still has certain limitations. For example, in large-scale weak echo or long-duration weak precipitation scenarios, there may be some confusion between the real low-value areas and the missing areas. Therefore, in future work, we will further explore related research such as uncertainty-aware modeling, attention-guided spatiotemporal restoration, etc.
- Will post-processing result in discontinuity at the boundaries between the missing and non-missing regions?
Reply: Since the model adopts an overall reconstruction strategy, the missing areas and the non-missing areas establish temporal and spatial consistency during the restoration process. Therefore, in the post-processing step, only replacing the original values of the non-missing areas will not cause obvious boundary discontinuities. Moreover, the visualization results in the experiment show that the model can maintain good spatial continuity and structural coordination near the missing boundaries.
- In the non-missing area impact analysis experiment of section 4.3, the statement “Collectively, these results demonstrate that the post-processing strategy effectively preserves structural fidelity in non-missing regions while maintaining robust reconstruction capability in missing regions.” is not very appropriate. From the results, it only relates to the non-missing areas and does not address the missing areas.
Reply: Thank you for your reminder. The description here is indeed not precise enough. The sentence should be revised as follows: “Collectively, these results demonstrate that the post-processing strategy effectively preserves structural fidelity in non-missing regions while not affecting the reconstruction results in the missing areas, thereby improving the quality of the final restoration outcome.”
- This paper mainly verified the model’s adaptability to different lengths of missing data and changes in temporal continuity. But do these missing patterns overlap with the missing distribution during the training stage? Could the model’s generalization ability be derived from the memory of the statistical characteristics of the missing patterns rather than truly learning the spatiotemporal evolution laws?
Reply: The generalization experiments in this paper mainly focus on the model’s adaptability to different missing lengths and changes in temporal continuity. Although the training stage employed random missing simulation, the testing stage further designed different lengths and different temporal continuities of missing echoes to minimize the direct correspondence between the training and testing missing patterns. Moreover, due to the use of the mask-free learning framework, the model cannot directly rely on fixed missing positions or shapes to complete the restoration. Therefore, its restoration ability mainly stems from the learning of the spatiotemporal evolution laws of radar echoes rather than simple memorization of specific missing patterns. The experimental results show that even in cases of longer continuous missing, the CSI and POD of the model still show a certain decline, but overall remain relatively stable, indicating that the model possesses certain spatiotemporal dynamic modeling capabilities and robustness in missing pattern recovery.
Citation: https://doi.org/10.5194/egusphere-2026-1422-AC1 -
RC2: 'Comment on egusphere-2026-1422', Anonymous Referee #2, 08 Jun 2026
reply
The manuscript proposes BiConvLSTM-UNet, a deep-learning model for restoring missing echoes in weather radar mosaic sequences. The main claim is that many existing deep-learning infilling methods rely on fixed or explicitly provided missing-value masks, while the proposed method reconstructs missing radar echoes without using such a mask during training or inference. The model combines a U-Net architecture with bidirectional ConvLSTM units and is trained with a weighted L1 and MS-SSIM loss. The authors also introduce synthetic missing-data scenarios and a post-processing step to preserve non-missing regions. The topic is relevant, but several methodological points require clarification.
Major comments
-
The data only cover March to September. If the method is intended only for the rainy season, this should be stated clearly. Otherwise, the authors should test whether the model generalizes to other months, including winter or dry-season cases, to rule out overfitting to seasonal echo patterns.
-
The paper should explain how complete radar samples were selected before artificial gaps were added. Were samples with real missing sectors removed? How was completeness checked?
-
Values below 10 dBZ are set to 0, and missing regions are also encoded as 0. This makes true no-echo pixels indistinguishable from missing pixels. The authors should test whether the model introduces false echoes in valid zero-valued regions, especially because FAR is relatively high.
-
The post-processing step naturally improves RMSE and MAE in non-missing regions because observed values are copied back. This does not prove that the final fields are physically consistent. The authors should evaluate possible discontinuities or non-physical structures near the boundaries of reconstructed regions.
-
The evaluation mainly uses synthetic missing regions. The authors should test the method on real missing radar patterns, at least qualitatively. Otherwise, it remains unclear how well the model generalizes to operational cases.
-
The novelty should be demonstrated more directly. Since a missing-value mask provides useful information, removing it may reduce performance. A comparison with an otherwise similar mask-conditioned model would clarify whether the mask-free approach is beneficial or only operationally convenient.
-
Pixel-wise losses can smooth sharp convective structures. The authors should analyze whether the model underestimates peak reflectivity, smooths intense cells or loses small-scale structures, especially at higher reflectivity thresholds.
Recommendation
The paper addresses an important problem and the mask-free formulation is potentially useful. However, I recommend revision. The most important issues are the ambiguity between zero and missing values, the lack of testing on real missing patterns and potentially a direct comparison with mask-conditioned approaches.
Citation: https://doi.org/10.5194/egusphere-2026-1422-RC2 -
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 244 | 88 | 16 | 348 | 21 | 25 |
- HTML: 244
- PDF: 88
- XML: 16
- Total: 348
- BibTeX: 21
- EndNote: 25
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Facing the problem of missing echo in radar mosaic data that often occurs in practical applications. A sequence reconstruction model for radar missing echo restoration is constructed in this study. Owing to the proposed reconstruction framework—which operates without reliance on missing masks—and the training strategy incorporating randomized missing patterns, the model demonstrates enhanced restoration performance and robust generalization capability. Furthermore, this approach also has the potential for operational applications. Nevertheless, there remain certain aspects that could be further refined in this study.
1. As described in Abstract, “Moreover, the model sustains consistent performance across varying missing data lengths and continuity patterns, indicating robust generalization capabilities.”, the statement of “consistent performance” is based on the experimental results in section 4.4. However, the numerical values are not the same. How should this be understood?
2.In Section 2.2, why is it necessary to separately establish an independent validation set to evaluate the model’s ability in reconstructing the missing echoes?
3. Why wasn't the optimization of various hyperparameters carried out in the validation set during the period of March to May 2022?
4. In the method of constructing the loss dataset, if the radar echoes within the selected area are set to zero using rectangles, will this cause the model to be more inclined to repair the rectangular missing areas?
5. Why was PixelShuffle used in the model design?
6. The weighted L1 loss in this paper adopts an artificial discrete weight design based on the reflectance threshold. Could you please explain the basis for setting these weights? Additionally, is this discontinuous segmented weight likely to cause unstable gradients during the optimization process and subsequently lead to a higher FAR in the experiments?
7. Why not adopt the model learning with missing mask instead of using mask-free learning?
8. Will post-processing result in discontinuity at the boundaries between the missing and non-missing regions?
9. In the non-missing area impact analysis experiment of section 4.3, the statement "Collectively, these results demonstrate that the post-processing strategy effectively preserves structural fidelity in non-missing regions while maintaining robust reconstruction capability in missing regions." is not very appropriate. From the results, it only relates to the non-missing areas and does not address the missing areas.
10. This paper mainly verified the model’s adaptability to different lengths of missing data and changes in temporal continuity. But do these missing patterns overlap with the missing distribution during the training stage? Could the model’s generalization ability be derived from the memory of the statistical characteristics of the missing patterns rather than truly learning the spatiotemporal evolution laws?