the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Direct-sun versus Sky-Scan Pandora Formaldehyde Retrievals: Implications for OMI Validation in Tropical Southeast Asia
Abstract. Ground-based Pandora spectrometers are widely used for validating satellite formaldehyde (HCHO) retrievals; however, the influence of scanning geometry and spatiotemporal representativeness remains insufficiently quantified in tropical environments. This study evaluates Pandora Level-2 HCHO total vertical columns from five Southeast Asian stations (Bangkok, Bandung, Agam, Pontianak, and Singapore-NUS) over 2021–2025, comparing Direct-sun and Sky-scan retrievals and assessing their consistency with OMI Aura observations. HCHO distributions exhibit strong inter-site variability and pronounced skewness, with Direct-sun retrievals showing higher medians and substantially larger variance than Sky-scan observations. Mean Direct-sun HCHO columns are strongly influenced by episodic enhancements at biomass-burning-affected sites, particularly Agam, whereas Sky-scan retrievals display lower central values and reduced variability, consistent with broader atmospheric sampling and diminished sensitivity to localized plumes. Satellite–ground comparisons are conducted using nine spatiotemporal averaging configurations that vary OMI spatial footprints (nearest grid, 3 × 3, and 5 × 5) and Pandora temporal averaging. Direct-sun comparisons generally yield weak or unstable correlations (R ≈ −0.1 to 0.3) and large errors (RMSE ≈ 8–14 × 1015 molecules cm-2). In contrast, Sky-scan retrievals show systematically improved agreement, with optimized configurations achieving RMSE values of ~5 × 1015 molecules cm⁻², MAE of ~4–7 × 1015 molecules cm⁻², and moderate positive correlations (R ≈ 0.4–0.6) at several sites. Solar zenith angle–dependent analysis reveals persistent positive biases in Direct-sun retrievals (~10–20 × 1015 molecules cm-2), while Sky-scan retrievals exhibit near-zero bias at low to moderate SZAs and substantially reduced extremes. Overall, the results demonstrate that scanning geometry exerts a first-order control on Pandora–OMI consistency in the tropics, with Sky-scan observations providing a more spatially representative reference for satellite validation, although optimal configurations remain site dependent.
- Preprint
(2617 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 31 Mar 2026)
-
RC1: 'Comment on egusphere-2026-716', Anonymous Referee #1, 12 Mar 2026
reply
The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2026/egusphere-2026-716/egusphere-2026-716-RC1-supplement.pdfReplyCitation: https://doi.org/
10.5194/egusphere-2026-716-RC1 -
RC2: 'Comment on egusphere-2026-716', Anonymous Referee #2, 16 Mar 2026
reply
The authors present a comparison of ground based HCHO retrievals from 5 Pandora sites with retrievals from OMI. The authors compare both direct sun and sky scan retrievals from Pandora to OMI and use those comparisons to suggest the use of sky scans would be preferable for satellite validation in the tropics. This work presents interesting data from an understudied part of the world. Unfortunately, the manuscripts conclusions rely on an incomplete analysis and questionable experimental design. Substantial revisions are needed before this work can be published.Major issues:This paper doesn't work without a more robust intercomparison of the two ground-based datasets to support the conclusions. Comparing both to OMI and discussing the differences between each and OMI is not sufficient. The authors point out that SZA dependent uncertainties exist in OMI products, so why are the authors conducting their analysis assuming OMI is the more trustworthy observation? We use ground-based measurements to evaluate satellite-based retrievals, not the other way around. How do the retrieved columns compare to each other? What are the conditions where they diverge from each other. What are the conditions when the direct sun and sky scan agree and disagree? Are there potential explanations that might impact the utility of each for satellite validation?Sky scan retrievals are not sensitive to the whole column, one would expect that the direct sun retrieval would typically be higher since it is sensitive to the whole column. Sky scans that use a temporally local zenith reference are typically only sensitive to the lowest 2 km of the atmosphere. This doesn't necessarily imply mean direct sun retrievals are "highly sensitive to episodic enhancements" as a general rule, but they are more likely to pick up lofted plumes than a sky scan observation where the plume would impact the reference spectrum and not impact the measured slant columns in the same way as a direct sun observation.The chosen experiments for comparison don't all have utility for satellite evaluation, so it is unclear why these 9 scenarios were chosen.
- Given that formaldehyde columns generally have a strong temperature and sunlight dependence and thus vary throughout the day, I'm unclear what the utility of daily averaging is in a satellite evaluation context, where the overpass time is known. Most studies just consider the average around the overpass time .
- The daytime averaging period is given as 07-09 local, is this a typo or should the label be changed from daytime to early morning? Assuming a typo, for measurements that require sunlight, what is the utility of separate daytime and daily averages? Aren't they pretty much the same aside from some less reliable measurements in low light conditions that would typically be discarded anyway?
- Similarly, given Rayleigh scattering limits the effective horizontal pathlength of the Pandora measurements to ~20 km under clear sky conditions in this fit window, it makes little sense to average over 2 adjacent OMI pixels (5x5) for comparison as the Pandora may not even be sampling adjacent pixels let alone two over.
There doesn't appear to be sufficient data quality checks done on the Pandora data. While there is a case to be made for not relying solely on Pandora L2 QC flags (e.g. Rawat et al 2025), one should still check fit quality (RMS) and do cloud screening before comparing to satellite based measurements. For example, the statistics presented for your retrievals at Agam show unrealistically large columns with no explanation. Are these actual events or retrieval artifacts?It looks like the Pandora data are filtered when making the Figures 4 and 5, but not when calculating the statistics in Table 3. You need consistent treatment throughout the analysis.Minor PointsFigures 2 and 3: I think your analysis would be better served by correlation plots of these data rather than frequency distributions. If the authors want to present frequency distributions, all three retrievals should be present on the same axis for each site, so the reader can more easily compare the distributions.Line 241: Elevated has an ambiguous meaning when discussing atmospheric measurements, do you mean aloft or enhanced relative to background.OMI while providing a long timeseries is not really the most widely used HCHO product used these days, the community would likely find more benefit from comparisons with TROPOMI and GEMS. Spatial averaging can be utilized to deal with the Pandora path crossing multiple pixels.References:Dimitropoulou, E., Hendrick, F., Friedrich, M. M., Tack, F., Pinardi, G., Merlaud, A., et al. (2022). Horizontal distribution of tropospheric NO2 and aerosols derived by dual-scan multi-wavelength multi-axis differential optical absorption spectroscopy (MAX-DOAS) measurements in Uccle, Belgium. Atmospheric Measurement Techniques, 15(15), 4503–4529. https://doi.org/10.5194/amt-15-4503-2022Rawat, P., Crawford, J. H., Travis, K. R., Judd, L. M., Demetillo, M. A. G., Valin, L. C., et al. (2025). Maximizing the scientific application of Pandora column observations of HCHO and NO2. Atmospheric Measurement Techniques, 18(13), 2899–2917. https://doi.org/10.5194/amt-18-2899-2025Citation: https://doi.org/10.5194/egusphere-2026-716-RC2
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 152 | 43 | 13 | 208 | 12 | 25 |
- HTML: 152
- PDF: 43
- XML: 13
- Total: 208
- BibTeX: 12
- EndNote: 25
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1