the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Assessing the causal impact of the Chinese Spring Festival on PM2.5 air quality in Beijing-Tianjin-Hebei and surrounding region using a machine learning counterfactual modeling approach
Abstract. Acute short-term exposure to extremely high PM2.5 levels posed serious health risks. Human culture-based festival activities can significantly alter emission patterns, often leading to sharp yet understudied fluctuations in air quality. The Chinese Spring Festival (CSF), marked by large-scale family reunions and widespread use of fireworks, raises air pollution concerns. Commonly, this effect is quantified using receptor models or chemical transport models, but the relevant chemical component data and emission inventories are often lacking. This study presents a machine learning counterfactual approach to causally quantify PM2.5 changes associated with holiday activities. The results align well with traditional chemical composition-based estimates of fireworks contributions, highlighting the strong potential of using widely accessible routine monitoring data to quantify source contributions driven by specific interventions. Applied to the twenty-eight major cities in Beijing-Tianjin-Hebei and surrounding area, one of the most polluted regions in China, the approach revealed an average PM2.5 reduction of 19.0 ± 17.5 μg/m3 during the CSF holiday period in 2025, with fireworks accounting for ≥35 % of first-day severe deteriorated PM2.5 air quality and up to 89 % in Baoding. The approach offers a robust tool for evaluating holiday emissions and guiding air quality interventions.
Status: open (until 22 Dec 2025)
- RC1: 'Comment on egusphere-2025-4562', Anonymous Referee #1, 29 Oct 2025 reply
-
RC2: 'Comment on egusphere-2025-4562', Anonymous Referee #2, 05 Nov 2025
reply
The manuscript proposes a machine-learning counterfactual framework to estimate the impact of the Chinese Spring Festival (CSF) on PM2.5 in Hangzhou and the “2+26” cities. The study is timely and policy-relevant, with a clear intention to distinguish air-quality changes from emissions, and the manuscript is well organized and clearly presented. However, several aspects of causal ML practice, the temporal validation strategy, and issues of data representativeness need to be strengthened before publication.
Major Comments:
1. The manuscript frames its analysis within a causal framework, treating the Chinese Spring Festival (CSF) as a “treatment” and using the XGBoost model to predict a counterfactual business-as-usual (BAU) scenario. While this is a conceptually appropriate starting point, the current methodology does not yet meet a rigorous causal ML design. The CSF is a composite factor, bundling the effects of fireworks, altered traffic patterns, and changes in industrial/construction activity. This complexity challenges the core identification assumptions required for causal claims.
Furthermore, the analysis does not adequately address potential influence of these assumptions, such as the inconsistent overlap in covariate distributions between festival and non-festival periods. Some features, like the lunar calendar day, are inherently confounded with the treatment, violating conditional independence. The study could be characterized as a causally inspired counterfactual prediction for BAU rather than a causal estimator under verified identification conditions. Hence, the authors may wish to reconsider the title and tone down the causal claims to avoid overstatement.2. The current modeling approach, which relies on instantaneous covariates, does not account for the temporal auto-correlation inherent in air pollution. The concentration at any given time is also influenced by the emissions and meteorological conditions of previous periods. The choice of a random 80/20 split for model validation may introduce data leakage when evaluating the model performance. A blocked or rolling time-based cross-validation would be more appropriate here.
Separately, uncertainty quantification has been extensively discussed in ML-based atmospheric remote sensing, yet is not addressed in the present manuscript; providing calibrated predictive uncertainty would improve the interpretability of the results.3. The abstract opens with acute short-term health risks from extremely high PM2.5, but the regional result emphasizes an average decrease of 19.0 ± 17.5 μg/m3 over the extended holiday period. These two statements are not contradictory but currently feel weakly connected.
Besides that, Section 3.2 (Hangzhou) explicitly reports large concurrent source changes (e.g., vehicles -31%; dust +2790%), yet Section 3.5 (“2+26”) estimates fireworks’ contribution “under the assumption that emissions from other sources remained unchanged.” The authors need to address this inconsistency or provide sensitivity analysis under alternative assumptions.4. Section 2.1 requires several clarifications. First, key details for the ERA5 dataset, including its temporal/spatial resolution and a reference link, should be provided in the manuscript or SI (Text S1/Table S1). To address the potential for reanalysis data to smooth over urban-scale extremes, a brief comparison of ERA5 variables against ground-station data would strengthen the analysis. Additionally, the usage of total precipitation (TP) needs to be explained; since it is an accumulated value, please describe any transformation performed to make it suitable for an hourly model. The specific parameters or a reference for Emanuel’s saturated vapor pressure formula should be included.
5. There is the spatial representativeness mismatch between the machine learning model, which uses a 14-site city average, and the DN-PMF analysis, which uses chemical data from a single site. This difference could introduce a bias, particularly for localized sources like fireworks. The authors could discuss this limitation and its potential impact on their findings. Given the team’s related work (e.g., Journal of Environmental Sciences), a brief comparison of the methodological advantages and efficiency gains relative to prior work would also help position the contribution.
Minor corrections:
- Line 22: twenty-eight -> 28
- Line 119: meterorology -> meteorology
- Line 164: A -> An
- Line 207: in the midnight of the New Year Eve -> at midnight on New Year’s Eve
- Line 244: Please add units for RMSE and MAE.
- Line 260: reliablity -> reliability
- Line 261: techique -> technique
- Line 323: deterioriation -> deterioration
- Table S1: Please use Pa (not pa) for pressure unit.
Citation: https://doi.org/10.5194/egusphere-2025-4562-RC2
Viewed
Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 163 | 0 | 2 | 165 | 0 | 0 |
- HTML: 163
- PDF: 0
- XML: 2
- Total: 165
- BibTeX: 0
- EndNote: 0
Viewed (geographical distribution)
Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
This manuscript "Assessing the causal impact of the Chinese Spring Festival on PM2.5 air quality in Beijing-Tianjin-Hebei and surrounding region using a machine learning counterfactual modeling approach" by Yuan Li and team, addresses an important and interesting topic: the influence of the Chinese Spring Festival (CSF) on regional PM2.5 concentrations, particularly the attribution of emissions to fireworks. The use of a machine learning counterfactual model is an interesting approach to isolate the festival's effect. However, the core conclusions regarding the high contribution of fireworks, especially at the regional scale, are based on data and methodological interpretations that lack sufficient resolution and rigor to justify the claim. Specifically, the analysis appears to conflate highly local, transient firework plumes with persistent regional emissions from industrial and urban sources. This weakness must be addressed before the manuscript can be considered for publication.
Major Comments:
Minor/Technical Comments:
Reference:
Tiwari, P., Cohen, J.B., Lu, L. et al. Multi-platform observations and constraints reveal overlooked urban sources of black carbon in Xuzhou and Dhaka. Commun Earth Environ 6, 38 (2025). https://doi.org/10.1038/s43247-025-02012-x
Li, X., Cohen, J.B., Tiwari, P. et al. Space-based inversion reveals underestimated carbon monoxide emissions over Shanxi. Commun Earth Environ 6, 357 (2025). https://doi.org/10.1038/s43247-025-02301-5