Unleashing the Potential of Geostationary Satellite Observations in Air Quality Forecasting Through Artificial Intelligence Techniques

Zhang, Chengxin; Niu, Xinhan; Wu, Hongyu; Ding, Zhipeng; Chan, Ka Lok; Kim, Jhoon; Wagner, Thomas; Liu, Cheng

doi:https://doi.org/10.5194/egusphere-2024-2620

Preprints

https://doi.org/10.5194/egusphere-2024-2620

Preprints

30 Aug 2024

| 30 Aug 2024

Unleashing the Potential of Geostationary Satellite Observations in Air Quality Forecasting Through Artificial Intelligence Techniques

Chengxin Zhang, Xinhan Niu, Hongyu Wu, Zhipeng Ding, Ka Lok Chan, Jhoon Kim, Thomas Wagner, and Cheng Liu

Abstract. Air quality forecasting plays a critical role in mitigating air pollution. However, current physics-based air pollution predictions encounter challenges in accuracy and spatiotemporal resolution due to limitations in the understanding of atmospheric physical mechanisms, observational constraints, and computational capacity. The world’s first geostationary satellite UV-Vis spectrometer, i.e., the Geostationary Environment Monitoring Spectrometer (GEMS), offers hourly measurements of atmospheric trace gas pollutants at high spatial resolution over East Asia. In this study, we successfully incorporate Geostationary satellite observations into a neural network model (GeoNet) to forecast full-coverage surface nitrogen dioxide (NO₂) concentrations over eastern China at 4-hour intervals for the next 24 hours. GeoNet leverages spatiotemporal series of satellite NO₂observations to capture the intricate relationships among air quality, meteorology, and emissions in both temporal and spatial domains. Evaluation against ground-based measurements demonstrates that GeoNet accurately predicts diurnal variations and spatial distribution details of next-day NO₂pollution, yielding the coefficient of determination of 0.68 and root mean square of error of 12.31 μg/m³, significantly surpassing traditional air quality model forecasts. The model’s interpretability reveals that geostationary satellite observations notably improve NO₂ forecast capability more than other input features, especially over polluted regions. Our findings demonstrate the significant potential of geostationary satellite observations in artificial intelligence-based air quality forecasting, with implications for early warning of air pollution events and human health exposure.

Received: 20 Aug 2024 – Discussion started: 30 Aug 2024

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Preprint (PDF, 4501 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (4501 KB)

Supplement (2664 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

21 Jan 2025

Unleashing the potential of geostationary satellite observations in air quality forecasting through artificial intelligence techniques

Chengxin Zhang, Xinhan Niu, Hongyu Wu, Zhipeng Ding, Ka Lok Chan, Jhoon Kim, Thomas Wagner, and Cheng Liu

Atmos. Chem. Phys., 25, 759–770, https://doi.org/10.5194/acp-25-759-2025,https://doi.org/10.5194/acp-25-759-2025, 2025

Short summary

Chengxin Zhang, Xinhan Niu, Hongyu Wu, Zhipeng Ding, Ka Lok Chan, Jhoon Kim, Thomas Wagner, and Cheng Liu

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-2620', Anonymous Referee #1, 12 Sep 2024

Review of “Unleashing the Potential of Geostationary Satellite Observations in Air Quality Forecasting Through Artificial Intelligence Techniques” by Zhang et al.

Major Comments

This study by Zhang et al. entitled “Unleashing the Potential of Geostationary Satellite Observations in Air Quality Forecasting Through Artificial Intelligence Techniques” presents a new machine-learning framework – GeoNet – that synthesizes geostationary observations of columnar NO₂ from the Geostationary Environment Monitoring Spectrometer (GEMS) with meteorological parameters to forecast surface-level NO₂in East China. Overall, this study represents a significant advancement in surface-level pollution forecasting given its use of the unprecedented hourly data provided by GEMS. I believe that this manuscript is well-written and consistent; however, I have a few comments below.

First, if possible, it would be useful to validate the GEMS observations using ground-based spectrometers (e.g., PGN) specifically for the study region and time period. Additionally, unless I missed it, I don’t believe the time periods for model training and validation were ever stated; if this is the case they should be added to the main text. Second, when investigating feature importance, it would be useful to also identify variability in the feature importance to uncover whether some components are more stable than others in GeoNet and to identify if the significance of geostationary observations is consistent across different days and seasons. Lastly, I suggest that the authors update their analysis in Figure 4 to include the GeoNet predictions regridded to the CAMS grid to identify how much of the improvement in predictions is attributable specifically to enhancements in spatial resolution.

I have included line-specific comments below:

Minor Comments

L53-54: While I agree with this statement, it should be mentioned that for air pollution forecasting to facilitate health benefits, infrastructure needs to be created that communicate risks and appropriate responses to risks to the public.

L55: I think you can drop the second limited in this line.

L75: Maybe it would be useful to give an example or two here (i.e., TROPOMI + OMI).

L78-81: Another limitation of the polar orbiting satellites that is worth mentioning is that typically (at least in the case of TROPOMI) the satellite observes at roughly the same time of day (early afternoon) which makes it difficult to predict concentrations at other times of the day with different meteorological (boundary layer height) and photochemical conditions.

L92: It would be better to describe GEMS as having “unprecedented temporal and spatial resolution andcoverage” as ground-level monitors can observe hourly NO₂ but are limited in time and aircraft remote-sensing can observe NO₂ at sub hourly resolution but over a limited temporal coverage (usually a few days or weeks). The resolution alone isn’t necessarily unique but rather than combined spatial + temporal resolution with extended spatial and temporal coverage.

L117-120: Were you able to validate these data for the study time period / domain? If possible, it may be useful to compare GEMS to ground-based spectrometers in the study domain to get an idea of performance.

L207-208: I don’t think you need this sentence as it is already mentioned in the methods section.

Figure 3: It would be interesting to present the variance of these different components as well in a). Are these importance values pretty consistent regardless of season and day, or do they vary substantially day to day?

Figure 4: Have you assessed how much of the reductions in performance are attributable to resolution? If not, I suggest regridding the GeoNet prediction to the resolution of CAMS and comparing this “GeoNET_coarse” product to the observations to characterize how much of the improved performance is attributable to enhanced spatial resolution.

Figure 5: The colorbar in a is not labeled, and throughout the font is small (especially in the yaxis of c and d), I suggest updating to improve readability.

L338-339: I don’t believe the timeframe of this study was mentioned at all in the main text. What months / years was this prediction trained on and for what period was it validated?

Citation: https://doi.org/10.5194/egusphere-2024-2620-RC1
- AC1: 'Reply on RC1', Chengxin Zhang, 30 Oct 2024
  
  We thank the reviewer for the helpful comments. We have addressed their concerns in the response letter. Please refer to the attached PDF (also appended the revised manuscript with tracked changes).
  
  Citation: https://doi.org/10.5194/egusphere-2024-2620-AC1
RC2:
'Comment on egusphere-2024-2620', Anonymous Referee #2, 15 Oct 2024

The authors attempted to improve the short-term prediction of surface NO2 at a high spatial and temporal resolution by taking advantage of the GEMS NO2 prodcuts and a neural network model. They successfully forecasted full-coverage surface NO2 for the next 24 hours and identifed the critical role of GEMS NO2. Their results demonstrate the potential application of the GEMS products in air quality prediction.

Overall, this is an important study and the results presented here will be useful for future applications of GEMS products as well as the geostationary satellite observations. I am happy to see its publication in due course. However, before that, I still have a few concerns or suggestions for the authors.

I would sugget to move the model configuration and optimization into the main text. This will be very helpful for readers to understand the model.

In the handling of missing data, the authors tried to set them to a fill value of zero. Is it reasonable? It looks reasonable to fill values of diurnal NO₂ climatology (e.g., the seasonal mean diurnal NO2). In addition, as shown in Fig.2, it looks the three methods of handling missing data perform similarly in term of R2 and RMSE. So I don’t think it is necessary to highlight the “weakest” or “strongest” configuration.

I would also suggest to move Fig.S12 in the main text which shows the advantage of GEMS measurements.

The authors also show that the performance of GeoNet model degrades notably after t+16h. Is there any possible solution to overcome this short predicability?

As mentioned in my last comment, the authors also highlight the possible applications to other air pollutants. However, the chemistry and lifetime of other air pollutants might be very different from NO2. For example, if the GEMS tropospheric ozone measurement is useful for the prediction of surface ozone? Some more detailed discussions on the possible application to other pollutants would be very insightful.

BTW, I am not sure whether it is necessary to include NO2 in the Title of this manuscript since the authors didn’t talk too much about major air pollutants (i.e., PM2.5 and ozone)

Citation: https://doi.org/10.5194/egusphere-2024-2620-RC2
- AC2: 'Reply on RC2', Chengxin Zhang, 30 Oct 2024
  
  We thank the reviewer for the helpful comments. We have addressed their concerns in the response letter. Please refer to the attached PDF (also appended the revised manuscript with tracked changes).
  
  Citation: https://doi.org/10.5194/egusphere-2024-2620-AC2

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-2620', Anonymous Referee #1, 12 Sep 2024

Review of “Unleashing the Potential of Geostationary Satellite Observations in Air Quality Forecasting Through Artificial Intelligence Techniques” by Zhang et al.

Major Comments

This study by Zhang et al. entitled “Unleashing the Potential of Geostationary Satellite Observations in Air Quality Forecasting Through Artificial Intelligence Techniques” presents a new machine-learning framework – GeoNet – that synthesizes geostationary observations of columnar NO₂ from the Geostationary Environment Monitoring Spectrometer (GEMS) with meteorological parameters to forecast surface-level NO₂in East China. Overall, this study represents a significant advancement in surface-level pollution forecasting given its use of the unprecedented hourly data provided by GEMS. I believe that this manuscript is well-written and consistent; however, I have a few comments below.

First, if possible, it would be useful to validate the GEMS observations using ground-based spectrometers (e.g., PGN) specifically for the study region and time period. Additionally, unless I missed it, I don’t believe the time periods for model training and validation were ever stated; if this is the case they should be added to the main text. Second, when investigating feature importance, it would be useful to also identify variability in the feature importance to uncover whether some components are more stable than others in GeoNet and to identify if the significance of geostationary observations is consistent across different days and seasons. Lastly, I suggest that the authors update their analysis in Figure 4 to include the GeoNet predictions regridded to the CAMS grid to identify how much of the improvement in predictions is attributable specifically to enhancements in spatial resolution.

I have included line-specific comments below:

Minor Comments

L53-54: While I agree with this statement, it should be mentioned that for air pollution forecasting to facilitate health benefits, infrastructure needs to be created that communicate risks and appropriate responses to risks to the public.

L55: I think you can drop the second limited in this line.

L75: Maybe it would be useful to give an example or two here (i.e., TROPOMI + OMI).

L78-81: Another limitation of the polar orbiting satellites that is worth mentioning is that typically (at least in the case of TROPOMI) the satellite observes at roughly the same time of day (early afternoon) which makes it difficult to predict concentrations at other times of the day with different meteorological (boundary layer height) and photochemical conditions.

L92: It would be better to describe GEMS as having “unprecedented temporal and spatial resolution andcoverage” as ground-level monitors can observe hourly NO₂ but are limited in time and aircraft remote-sensing can observe NO₂ at sub hourly resolution but over a limited temporal coverage (usually a few days or weeks). The resolution alone isn’t necessarily unique but rather than combined spatial + temporal resolution with extended spatial and temporal coverage.

L117-120: Were you able to validate these data for the study time period / domain? If possible, it may be useful to compare GEMS to ground-based spectrometers in the study domain to get an idea of performance.

L207-208: I don’t think you need this sentence as it is already mentioned in the methods section.

Figure 3: It would be interesting to present the variance of these different components as well in a). Are these importance values pretty consistent regardless of season and day, or do they vary substantially day to day?

Figure 4: Have you assessed how much of the reductions in performance are attributable to resolution? If not, I suggest regridding the GeoNet prediction to the resolution of CAMS and comparing this “GeoNET_coarse” product to the observations to characterize how much of the improved performance is attributable to enhanced spatial resolution.

Figure 5: The colorbar in a is not labeled, and throughout the font is small (especially in the yaxis of c and d), I suggest updating to improve readability.

L338-339: I don’t believe the timeframe of this study was mentioned at all in the main text. What months / years was this prediction trained on and for what period was it validated?

Citation: https://doi.org/10.5194/egusphere-2024-2620-RC1
- AC1: 'Reply on RC1', Chengxin Zhang, 30 Oct 2024
  
  We thank the reviewer for the helpful comments. We have addressed their concerns in the response letter. Please refer to the attached PDF (also appended the revised manuscript with tracked changes).
  
  Citation: https://doi.org/10.5194/egusphere-2024-2620-AC1
RC2:
'Comment on egusphere-2024-2620', Anonymous Referee #2, 15 Oct 2024

The authors attempted to improve the short-term prediction of surface NO2 at a high spatial and temporal resolution by taking advantage of the GEMS NO2 prodcuts and a neural network model. They successfully forecasted full-coverage surface NO2 for the next 24 hours and identifed the critical role of GEMS NO2. Their results demonstrate the potential application of the GEMS products in air quality prediction.

Overall, this is an important study and the results presented here will be useful for future applications of GEMS products as well as the geostationary satellite observations. I am happy to see its publication in due course. However, before that, I still have a few concerns or suggestions for the authors.

I would sugget to move the model configuration and optimization into the main text. This will be very helpful for readers to understand the model.

In the handling of missing data, the authors tried to set them to a fill value of zero. Is it reasonable? It looks reasonable to fill values of diurnal NO₂ climatology (e.g., the seasonal mean diurnal NO2). In addition, as shown in Fig.2, it looks the three methods of handling missing data perform similarly in term of R2 and RMSE. So I don’t think it is necessary to highlight the “weakest” or “strongest” configuration.

I would also suggest to move Fig.S12 in the main text which shows the advantage of GEMS measurements.

The authors also show that the performance of GeoNet model degrades notably after t+16h. Is there any possible solution to overcome this short predicability?

As mentioned in my last comment, the authors also highlight the possible applications to other air pollutants. However, the chemistry and lifetime of other air pollutants might be very different from NO2. For example, if the GEMS tropospheric ozone measurement is useful for the prediction of surface ozone? Some more detailed discussions on the possible application to other pollutants would be very insightful.

BTW, I am not sure whether it is necessary to include NO2 in the Title of this manuscript since the authors didn’t talk too much about major air pollutants (i.e., PM2.5 and ozone)

Citation: https://doi.org/10.5194/egusphere-2024-2620-RC2
- AC2: 'Reply on RC2', Chengxin Zhang, 30 Oct 2024
  
  We thank the reviewer for the helpful comments. We have addressed their concerns in the response letter. Please refer to the attached PDF (also appended the revised manuscript with tracked changes).
  
  Citation: https://doi.org/10.5194/egusphere-2024-2620-AC2

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Chengxin Zhang on behalf of the Authors (30 Oct 2024) Author's response Author's tracked changes Manuscript

ED: Publish as is (19 Nov 2024) by Carl Percival

AR by Chengxin Zhang on behalf of the Authors (20 Nov 2024) Manuscript

Journal article(s) based on this preprint

21 Jan 2025

Unleashing the potential of geostationary satellite observations in air quality forecasting through artificial intelligence techniques

Chengxin Zhang, Xinhan Niu, Hongyu Wu, Zhipeng Ding, Ka Lok Chan, Jhoon Kim, Thomas Wagner, and Cheng Liu

Atmos. Chem. Phys., 25, 759–770, https://doi.org/10.5194/acp-25-759-2025,https://doi.org/10.5194/acp-25-759-2025, 2025

Short summary

Chengxin Zhang, Xinhan Niu, Hongyu Wu, Zhipeng Ding, Ka Lok Chan, Jhoon Kim, Thomas Wagner, and Cheng Liu

Supplement

https://doi.org/10.5194/egusphere-2024-2620-supplement

Chengxin Zhang, Xinhan Niu, Hongyu Wu, Zhipeng Ding, Ka Lok Chan, Jhoon Kim, Thomas Wagner, and Cheng Liu

Viewed

Total article views: 509 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
363	98	48	509	22	8	6

HTML: 363
PDF: 98
XML: 48
Total: 509
Supplement: 22
BibTeX: 8
EndNote: 6

Views and downloads (calculated since 30 Aug 2024)

Month	HTML	PDF	XML	Total
Aug 2024	33	4	3	40
Sep 2024	131	34	10	175
Oct 2024	103	42	5	150
Nov 2024	54	8	2	64
Dec 2024	35	7	0	42
Jan 2025	7	3	28	38

Cumulative views and downloads (calculated since 30 Aug 2024)

Month	HTML	PDF	XML	Total
Aug 2024	33	4	3	40
Sep 2024	131	34	10	175
Oct 2024	103	42	5	150
Nov 2024	54	8	2	64
Dec 2024	35	7	0	42
Jan 2025	7	3	28	38

Viewed (geographical distribution)

Total article views: 517 (including HTML, PDF, and XML) Thereof 517 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 21 Jan 2025

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (4501 KB)
Metadata XML

Short summary

This research utilizes hourly air pollution observations from the world’s first geostationary satellite to develop a spatiotemporal neural network model for full-coverage surface NO₂ pollution prediction over the next 24 hours, achieving outstanding forecasting performance and efficacy. These results highlight the profound impact of geostationary satellite observations in advancing air quality forecasting models, thereby contributing to future models for health exposure to air pollution.


Total:	0
HTML:	0
PDF:	0
XML:	0