Preprints
https://doi.org/10.5194/egusphere-2025-3901
https://doi.org/10.5194/egusphere-2025-3901
15 Aug 2025
 | 15 Aug 2025
Status: this preprint is open for discussion.

SMN-AgroCLA: Comparison of Different Normalization Methods for Improving Rice Yield Prediction Accuracy Using Remote Sensing Data in Eastern China from 2008 to 2017

Li Liu, Kehan Zhang, Yang Zhang, Tao Lin, Weiwei Sun, Hang Sun, Peilin Song, Weiwei Liu, Biao Xiong, Dong Ren, and Jingfeng Huang

Abstract. Yield prediction is crucial for national food security and the formulation of trade policies. Most deep learning (DL) models rely on normalization methods to process input data, aiming to enhance the stability of model training and accelerate convergence speed. However, the importance of data preprocessing (i.e., input data normalization) in DL-based yield prediction is underemphasized. Furthermore, conventional methods fail to address distortions in feature scaling caused by extreme values, such as abnormally high precipitation, leading to increased prediction errors. In this study, we proposed a Sequential Midrange Normalization (SMN) method and integrated it with the newly designed Agricultural-CNN-LSTM-Attention (AgroCLA) model, collectively termed the SMN-AgroCLA framework, to improve rice yield prediction accuracy under extreme weather conditions. To validate the efficacy of the SMN, we compared it with four other commonly used normalization methods and conducted yield prediction experiments across six different DL models, by using Moderate Resolution Imaging Spectroradiometer, Global Precipitation Measurement and other multi-source remote sensing data of the Eastern China from 2008–2017. The results shown that SMN method consistently outperformed superior yield prediction performance even in years affected by extreme meteorological disasters (e.g., 2015), achieving an R² of 0.815, which was 17.3 % higher than the next best method, ZSN (Z-Score Normalization). Based on SMN, the accuracy and generalization of all models were optimized, with the AgroCLA achieved the highest accuracy (with R²=0.841). Additionally, the model's performance peaked around the flowering stage (around mid-August, R² =0.859), which is two months ahead of the harvest season. This study demonstrates the critical role of data normalization in deep learning-based yield prediction and offers a practical solution to mitigate the threat of increasing extreme meteorological disasters to food security.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Li Liu, Kehan Zhang, Yang Zhang, Tao Lin, Weiwei Sun, Hang Sun, Peilin Song, Weiwei Liu, Biao Xiong, Dong Ren, and Jingfeng Huang

Status: open

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Li Liu, Kehan Zhang, Yang Zhang, Tao Lin, Weiwei Sun, Hang Sun, Peilin Song, Weiwei Liu, Biao Xiong, Dong Ren, and Jingfeng Huang
Li Liu, Kehan Zhang, Yang Zhang, Tao Lin, Weiwei Sun, Hang Sun, Peilin Song, Weiwei Liu, Biao Xiong, Dong Ren, and Jingfeng Huang

Viewed

Total article views: 229 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
202 24 3 229 3 1
  • HTML: 202
  • PDF: 24
  • XML: 3
  • Total: 229
  • BibTeX: 3
  • EndNote: 1
Views and downloads (calculated since 15 Aug 2025)
Cumulative views and downloads (calculated since 15 Aug 2025)

Viewed (geographical distribution)

Total article views: 217 (including HTML, PDF, and XML) Thereof 217 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 13 Sep 2025
Download
Short summary
Appropriate normalization methods can mitigate feature scaling distortion caused by extreme values, which is crucial for improving the accuracy of rice yield prediction. Based on data from East China from 2008 to 2017, this study input five data normalization methods (ZSN, MN, MMN, CIN, and SMN) into six models (CNN, LSTM, SSTNN, ALSTM, DDCN, and AgroCLA).
Share