Direct-sun versus Sky-Scan Pandora Formaldehyde Retrievals: Implications for OMI Validation in Tropical Southeast Asia

Arul, Santanasawry A. L. David; Chang, Jackson Hian-Wui; Wong, Yong Jie; Ooi, Maggie Chel-Gee; Liew, Juneng; Chee, Fuei Pien; Dayou, Jedol; Sentian, Justin; Aryastana, Putu; Lin, Neng-Huei

doi:10.5194/egusphere-2026-716

Preprints

https://doi.org/10.5194/egusphere-2026-716

Preprints

23 Feb 2026

| 23 Feb 2026

Status: this preprint is open for discussion and under review for Atmospheric Measurement Techniques (AMT).

Direct-sun versus Sky-Scan Pandora Formaldehyde Retrievals: Implications for OMI Validation in Tropical Southeast Asia

Santanasawry A. L. David Arul, Jackson Hian-Wui Chang, Yong Jie Wong, Maggie Chel-Gee Ooi, Juneng Liew, Fuei Pien Chee, Jedol Dayou, Justin Sentian, Putu Aryastana, and Neng-Huei Lin

Abstract. Ground-based Pandora spectrometers are widely used for validating satellite formaldehyde (HCHO) retrievals; however, the influence of scanning geometry and spatiotemporal representativeness remains insufficiently quantified in tropical environments. This study evaluates Pandora Level-2 HCHO total vertical columns from five Southeast Asian stations (Bangkok, Bandung, Agam, Pontianak, and Singapore-NUS) over 2021–2025, comparing Direct-sun and Sky-scan retrievals and assessing their consistency with OMI Aura observations. HCHO distributions exhibit strong inter-site variability and pronounced skewness, with Direct-sun retrievals showing higher medians and substantially larger variance than Sky-scan observations. Mean Direct-sun HCHO columns are strongly influenced by episodic enhancements at biomass-burning-affected sites, particularly Agam, whereas Sky-scan retrievals display lower central values and reduced variability, consistent with broader atmospheric sampling and diminished sensitivity to localized plumes. Satellite–ground comparisons are conducted using nine spatiotemporal averaging configurations that vary OMI spatial footprints (nearest grid, 3 × 3, and 5 × 5) and Pandora temporal averaging. Direct-sun comparisons generally yield weak or unstable correlations (R ≈ −0.1 to 0.3) and large errors (RMSE ≈ 8–14 × 10¹⁵ molecules cm^-2). In contrast, Sky-scan retrievals show systematically improved agreement, with optimized configurations achieving RMSE values of ~5 × 10¹⁵ molecules cm⁻², MAE of ~4–7 × 10¹⁵ molecules cm⁻², and moderate positive correlations (R ≈ 0.4–0.6) at several sites. Solar zenith angle–dependent analysis reveals persistent positive biases in Direct-sun retrievals (~10–20 × 10¹⁵ molecules cm^-2), while Sky-scan retrievals exhibit near-zero bias at low to moderate SZAs and substantially reduced extremes. Overall, the results demonstrate that scanning geometry exerts a first-order control on Pandora–OMI consistency in the tropics, with Sky-scan observations providing a more spatially representative reference for satellite validation, although optimal configurations remain site dependent.

Received: 06 Feb 2026 – Discussion started: 23 Feb 2026

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Santanasawry A. L. David Arul, Jackson Hian-Wui Chang, Yong Jie Wong, Maggie Chel-Gee Ooi, Juneng Liew, Fuei Pien Chee, Jedol Dayou, Justin Sentian, Putu Aryastana, and Neng-Huei Lin

Status: open (until 31 Mar 2026)

Post a comment Subscribe to comment alert

RC1: 'Comment on egusphere-2026-716', Anonymous Referee #1, 12 Mar 2026 reply

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2026/egusphere-2026-716/egusphere-2026-716-RC1-supplement.pdf
Reply

Citation: https://doi.org/10.5194/egusphere-2026-716-RC1
RC2:
'Comment on egusphere-2026-716', Anonymous Referee #2, 16 Mar 2026 reply
The authors present a comparison of ground based HCHO retrievals from 5 Pandora sites with retrievals from OMI. The authors compare both direct sun and sky scan retrievals from Pandora to OMI and use those comparisons to suggest the use of sky scans would be preferable for satellite validation in the tropics. This work presents interesting data from an understudied part of the world. Unfortunately, the manuscripts conclusions rely on an incomplete analysis and questionable experimental design. Substantial revisions are needed before this work can be published.

Major issues:

This paper doesn't work without a more robust intercomparison of the two ground-based datasets to support the conclusions. Comparing both to OMI and discussing the differences between each and OMI is not sufficient. The authors point out that SZA dependent uncertainties exist in OMI products, so why are the authors conducting their analysis assuming OMI is the more trustworthy observation? We use ground-based measurements to evaluate satellite-based retrievals, not the other way around. How do the retrieved columns compare to each other? What are the conditions where they diverge from each other. What are the conditions when the direct sun and sky scan agree and disagree? Are there potential explanations that might impact the utility of each for satellite validation?

Sky scan retrievals are not sensitive to the whole column, one would expect that the direct sun retrieval would typically be higher since it is sensitive to the whole column. Sky scans that use a temporally local zenith reference are typically only sensitive to the lowest 2 km of the atmosphere. This doesn't necessarily imply mean direct sun retrievals are "highly sensitive to episodic enhancements" as a general rule, but they are more likely to pick up lofted plumes than a sky scan observation where the plume would impact the reference spectrum and not impact the measured slant columns in the same way as a direct sun observation.

The chosen experiments for comparison don't all have utility for satellite evaluation, so it is unclear why these 9 scenarios were chosen.

Given that formaldehyde columns generally have a strong temperature and sunlight dependence and thus vary throughout the day, I'm unclear what the utility of daily averaging is in a satellite evaluation context, where the overpass time is known. Most studies just consider the average around the overpass time .

The daytime averaging period is given as 07-09 local, is this a typo or should the label be changed from daytime to early morning? Assuming a typo, for measurements that require sunlight, what is the utility of separate daytime and daily averages? Aren't they pretty much the same aside from some less reliable measurements in low light conditions that would typically be discarded anyway?

Similarly, given Rayleigh scattering limits the effective horizontal pathlength of the Pandora measurements to ~20 km under clear sky conditions in this fit window, it makes little sense to average over 2 adjacent OMI pixels (5x5) for comparison as the Pandora may not even be sampling adjacent pixels let alone two over.

There doesn't appear to be sufficient data quality checks done on the Pandora data. While there is a case to be made for not relying solely on Pandora L2 QC flags (e.g. Rawat et al 2025), one should still check fit quality (RMS) and do cloud screening before comparing to satellite based measurements. For example, the statistics presented for your retrievals at Agam show unrealistically large columns with no explanation. Are these actual events or retrieval artifacts?

It looks like the Pandora data are filtered when making the Figures 4 and 5, but not when calculating the statistics in Table 3. You need consistent treatment throughout the analysis.

Minor Points

Figures 2 and 3: I think your analysis would be better served by correlation plots of these data rather than frequency distributions. If the authors want to present frequency distributions, all three retrievals should be present on the same axis for each site, so the reader can more easily compare the distributions.

Line 241: Elevated has an ambiguous meaning when discussing atmospheric measurements, do you mean aloft or enhanced relative to background.

OMI while providing a long timeseries is not really the most widely used HCHO product used these days, the community would likely find more benefit from comparisons with TROPOMI and GEMS. Spatial averaging can be utilized to deal with the Pandora path crossing multiple pixels.

References:

Dimitropoulou, E., Hendrick, F., Friedrich, M. M., Tack, F., Pinardi, G., Merlaud, A., et al. (2022). Horizontal distribution of tropospheric NO2 and aerosols derived by dual-scan multi-wavelength multi-axis differential optical absorption spectroscopy (MAX-DOAS) measurements in Uccle, Belgium. Atmospheric Measurement Techniques, 15(15), 4503–4529. https://doi.org/10.5194/amt-15-4503-2022

Rawat, P., Crawford, J. H., Travis, K. R., Judd, L. M., Demetillo, M. A. G., Valin, L. C., et al. (2025). Maximizing the scientific application of Pandora column observations of HCHO and NO₂. Atmospheric Measurement Techniques, 18(13), 2899–2917. https://doi.org/10.5194/amt-18-2899-2025

Reply
Citation: https://doi.org/10.5194/egusphere-2026-716-RC2

Santanasawry A. L. David Arul, Jackson Hian-Wui Chang, Yong Jie Wong, Maggie Chel-Gee Ooi, Juneng Liew, Fuei Pien Chee, Jedol Dayou, Justin Sentian, Putu Aryastana, and Neng-Huei Lin

Viewed

Total article views: 208 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
152	43	13	208	12	25

HTML: 152
PDF: 43
XML: 13
Total: 208
BibTeX: 12
EndNote: 25

Views and downloads (calculated since 23 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	88	31	6	125
Mar 2026	64	12	7	83

Cumulative views and downloads (calculated since 23 Feb 2026)

Month	HTML	PDF	XML	Total
Feb 2026	88	31	6	125
Mar 2026	64	12	7	83

Viewed (geographical distribution)

Total article views: 210 (including HTML, PDF, and XML) Thereof 210 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 16 Mar 2026

Short summary

This study compares two ways of measuring formaldehyde in the air using ground instruments in tropical Southeast Asia and checks how well they match satellite observations. We show that one method, which looks at scattered sunlight, gives more stable and representative values than direct viewing of the sun. This improves confidence in satellite data and helps scientists better track air pollution and atmospheric change in tropical regions.


Total:	0
HTML:	0
PDF:	0
XML:	0