Preprints
https://doi.org/10.5194/egusphere-2025-112
https://doi.org/10.5194/egusphere-2025-112
12 Mar 2025
 | 12 Mar 2025
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

Assessment of gap-filling techniques applied to satellite phytoplankton composition products for the Atlantic Ocean

Ehsan Mehdipour, Hongyan Xi, Alexander Barth, Aida Alvera-Azcárate, Adalbert Wilhelm, and Astrid Bracher

Abstract. Phytoplankton are vital to marine biogeochemical cycles and form the base of the marine food web. Comprehensive datasets offering a spatiotemporal perspective on phytoplankton composition are essential for assessing the impacts of climate change on marine ecosystems. Phytoplankton functional types (PFTs) classify phytoplankton based on their biogeochemical functions, enabling assessments of nutrient cycling, primary productivity, and ecosystem structure. However, satellite-derived ocean colour products like PFTs chlorophyll-a (Chla) concentrations are challenged by limited temporal and spatial coverage due to the exclusion of data collected under non-optimal observing conditions such as strong sun glint, clouds, thick aerosols, straylight, and large viewing angles or due to the specific sensor configuration and sensor malfunction. This highlights the importance of gap-filling techniques for producing consistent datasets, which are currently missing for operational data sets. This study evaluates two robust gap-filling methods for satellite observations: Data Interpolating Empirical Orthogonal Functions (DINEOF) and Data Interpolating Convolutional Auto Encoder (DINCAE). These methods were applied to Sentinel 3A/B OLCI-derived Chla concentration products in several regions of the Atlantic Ocean over three years of data, including total chlorophyll-a (TChla) and Chla concentration of five major PFTs, namely diatoms, dinoflagellates, haptophytes, green algae, and prokaryotic phytoplankton. The reconstructed datasets were assessed using test dataset evaluation and validated with in situ measurements collected during the transatlantic RV Polarstern expedition PS113 in 2018. The test dataset evaluation indicates that DINCAE outperforms DINEOF, particularly in capturing transient-scale features. DINCAE achieves an average root-mean-square-logarithmic-error (RMSLE) in cross-validation that is 66 % lower for TChla and 16 % lower for PFTs compared to DINEOF. However, external validation using in situ measurements indicates better performance for DINEOF than DINCAE, with improved regression metrics for PFTs, including a 12.5 % better slope, 13.6 % better intercept, and 68 % higher coefficient of determination (R²). The gap-filled datasets exhibit slightly reduced but still robust accuracy compared to the original satellite data while preserving statistical trends, improving spatial structure restoration, and increasing matchup data for validation. It is concluded that DINCAE and DINEOF each have unique strengths for gap-filling ocean colour products. DINCAE performs well in complex water bodies, effectively reproducing patterns from the original satellite product. In contrast, DINEOF shows higher overall reliability, supported by independent validation, and is better suited for larger areas due to its lower computational demands.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.
Share
Ehsan Mehdipour, Hongyan Xi, Alexander Barth, Aida Alvera-Azcárate, Adalbert Wilhelm, and Astrid Bracher

Status: open (until 07 May 2025)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Ehsan Mehdipour, Hongyan Xi, Alexander Barth, Aida Alvera-Azcárate, Adalbert Wilhelm, and Astrid Bracher

Interactive computing environment

Preprocessing, processing and post-processing scripts and environment Ehsan Mehdipour https://github.com/EhsanMehdipour/PFT_gapfilling

Ehsan Mehdipour, Hongyan Xi, Alexander Barth, Aida Alvera-Azcárate, Adalbert Wilhelm, and Astrid Bracher

Viewed

Total article views: 57 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
41 13 3 57 6 1 1
  • HTML: 41
  • PDF: 13
  • XML: 3
  • Total: 57
  • Supplement: 6
  • BibTeX: 1
  • EndNote: 1
Views and downloads (calculated since 12 Mar 2025)
Cumulative views and downloads (calculated since 12 Mar 2025)

Viewed (geographical distribution)

Total article views: 79 (including HTML, PDF, and XML) Thereof 79 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 18 Mar 2025
Download
Short summary
Phytoplankton are vital for marine ecosystems and nutrient cycling, detectable by optical satellites. Data gaps caused by clouds and other non-optimal conditions limit comprehensive analyses like trend monitoring. This study evaluated DINCAE and DINEOF gap-filling methods for reconstructing chlorophyll-a datasets, including total chlorophyll-a and five major phytoplankton groups. Both methods showed robust reconstruction capabilities, aiding pattern detection and long-term ocean colour analysis.
Share