Preprints
https://doi.org/10.5194/egusphere-2025-4613
https://doi.org/10.5194/egusphere-2025-4613
19 Nov 2025
 | 19 Nov 2025
Status: this preprint is open for discussion.

SimTA: A dual-polarization SAR time series rice mapping model based on deep feature-level fusion and spatio-temporal attention

Li Liu, Jiaxuan Liang, Dong Ren, and Jingfeng Huang

Abstract. Accurate large-scale crop mapping is critical for yield prediction, agricultural disaster monitoring, and global food security. Synthetic Aperture Radar (SAR), with its all-weather, day-and-night imaging capability, plays a vital role in remote sensing based crop mapping studies. However, most existing studies fuse VV and VH polarization channels at the data level, overlooking channels' differences in signal-to-noise characteristics and temporal dynamics, which results in rice feature redundancy or conflicts, particularly at rice field edges and in heterogeneous regions, thereby increasing misclassifications error. To address these challenges, this study proposes a novel Spatiotemporal Attention Model (SimTA) for rice mapping. (1) A VV-VH feature-level fusion scheme is designed, integrated with a Content-Guided Attention (CGA) fusion method which effectively exploits the complementary information of the dual-polarized SAR data for achieving deep spatiotemporal dynamics fusion. (2) A Central Difference Convolution Spatial Extraction Conv (CDCSE Conv) Block is designed, enhancing sensitivity to edge variations of rice field by combining standard and central difference convolutions. (3) To achieve efficient spatiotemporal feature integration across SAR time series, a Temporal-Spatial Attention (TSA) Block is developed, utilizing large-kernel convolutions for spatial feature extraction and a squeeze-and-excitation mechanism for capturing long-range temporal dependencies of rice time series. Extensive experiments were conducted by comparing SimTA with different models under five fusion schemes. Results demonstrate that feature-level fusion consistently outperforms other schemes, with SimTA achieving the best performance: OA = 91.1 %, F1 Score = 90.9 %, and mIoU = 86.2 %. Compared to the baseline SimVP, SimTA improves F1 Score and mIoU by 0.8 % and 2.1 %, respectively. The CGA enhanced feature-level fusion further boosts SimTA's performance to OA = 91.5 % and F1 = 91.4 %. SimTA bridges the gap between existing VV-VH deep fusion schemes and modern spatiotemporal modeling demands, offering a more accurate and generalizable approach for large-scale rice mapping. 

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Li Liu, Jiaxuan Liang, Dong Ren, and Jingfeng Huang

Status: open

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Li Liu, Jiaxuan Liang, Dong Ren, and Jingfeng Huang
Li Liu, Jiaxuan Liang, Dong Ren, and Jingfeng Huang

Viewed

Total article views: 122 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
67 47 8 122 4 4
  • HTML: 67
  • PDF: 47
  • XML: 8
  • Total: 122
  • BibTeX: 4
  • EndNote: 4
Views and downloads (calculated since 19 Nov 2025)
Cumulative views and downloads (calculated since 19 Nov 2025)

Viewed (geographical distribution)

Total article views: 122 (including HTML, PDF, and XML) Thereof 122 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 01 Dec 2025
Download
Short summary
(1) A VV-VH feature-level fusion scheme is designed, integrated with a Content-Guided Attention (CGA) fusion method. (2) A Central Difference Convolution Spatial Extraction Conv (CDCSE Conv) Block of SimTA is designed for effectively enhancing model's sensitivity. (3) A Temporal-Spatial Attention (TSA) Block of SimTA.
Share