Urban-rural patterns and driving factors of particulate matter pollution decrease in eastern china

Song, Zhihao; Chen, Bin

doi:10.5194/egusphere-2025-2194

Preprints

https://doi.org/10.5194/egusphere-2025-2194

Preprints

06 Jun 2025

| 06 Jun 2025

Urban-rural patterns and driving factors of particulate matter pollution decrease in eastern china

Zhihao Song and Bin Chen

Abstract. Urban-rural patterns of particulate matter (PM) pollution reduction in China remain poorly understood. Using an interpretable end-to-end machine learning model framework from original satellite data, we identified changes in urban and rural PM pollution and the underlying drivers. During the period 2015–2023, the average decrease rates of PM10 and PM2.5 in eastern China were -4.1±1.1 μg/m³/month and -2.4±0.8 μg/m³/month, respectively. The rate of decrease in urban areas was higher than that in rural areas, which played a dominant role in PM reduction. Significant reductions in PM concentrations were observed in urban core areas, suburbs, towns and regions with high agricultural pressure. The interpretability analysis showed that temperature and interannual variability were the main drivers of PM pollution reduction. However, only interannual variability showed a significant decreasing trend in its effect on PM pollution, while other driving factors showed periodic variations. Furthermore, there were differences in the drivers of PM reduction between urban and rural areas, particularly with interannual variability in particular contributing to PM pollution reduction in urban areas, but having a lesser impact in most rural areas. This study reveals the urban-rural patterns of PM pollution reduction in eastern China, and highlights the need for differentiated air pollution control strategies in urban and rural areas.

Received: 10 May 2025 – Discussion started: 06 Jun 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 15702 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (15702 KB)

Supplement (5903 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

13 Nov 2025

Urban-rural patterns and driving factors of particulate matter pollution decrease in eastern China

Zhihao Song and Bin Chen

Atmos. Chem. Phys., 25, 15487–15506, https://doi.org/10.5194/acp-25-15487-2025,https://doi.org/10.5194/acp-25-15487-2025, 2025

Short summary

Zhihao Song and Bin Chen

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2025-2194', Anonymous Referee #1, 15 Jul 2025

This study applies machine learning to estimate hourly PM₂.₅ and PM₁₀ concentrations across eastern China using Himawari-8 satellite data, analyzing trends, influencing factors (2015–2023), and urban–rural disparities. The results are well presented. Below are comments and suggestions for improving the manuscript:
The particulate matter designations “PM2.5” and “PM10” should consistently use subscript formatting (i.e., PM₂.₅ and PM₁₀) throughout the manuscript for scientific precision.
Numerous previous studies have derived hourly surface PM concentrations from Himawari-8 observations in China (doi:10.5194/acp-21-7863-2021). These should be briefly summarized in the Introduction.
Similarly, the Extreme Trees model has been previously applied successfully for satellite-based PM₂.₅ (doi:10.1038/s41467-023-43862-3; doi:10.1016/j.rse.2020.112136) and PM₁₀ (doi:10.1016/j.envint.2020.106290) estimation. A concise summary of these efforts should be added. In addition, a clear justification for selecting this particular model over other machine learning approaches is needed.
Line 91: The acronym “TOAR” appears before it is defined. All acronyms should be spelled out at first mention for clarity (e.g., “Tropospheric Ozone Assessment Report (TOAR)”).
Lines 97–101: The authors should clarify whether only Himawari-8 data were used, or whether Himawari-9 (which became operational in December 2022) was included in the 2022–2023 period. This is important for ensuring temporal consistency.
Lines 119–121: The data sources and preprocessing steps for elevation (HEIGHT), land use and land cover (LUCC), and population density (RK) should be explicitly described.
Equation 1: The model uses only top-of-atmosphere (TOA) reflectance, without accounting for viewing or solar illumination angles, which are known to influence aerosol retrievals. The authors should provide justification for their exclusion.
Figure 1: The methodology used to simultaneously estimate PM₂.₅ and PM₁₀ via a multi-output model is unclear. A brief explanation or schematic would improve reader understanding.
Equation 2: The terms SS_res and SS_tot should be formally defined in the text or figure caption.
Line 163: “SHAP” should be spelled out as “SHapley Additive exPlanations (SHAP)” upon first use.
Line 168: The selection of “20 times” for permutation testing appears arbitrary. A statistical or methodological justification is necessary.
Line 172: The purpose of the provided URL is unclear. The authors should clarify what resource it links to and its relevance.
Lines 212–213: The reported temporal cross-validation R² values (0.41 for PM₁₀, 0.51 for PM₂.₅) seem inconsistent with the claim of “robust stability.” The authors should address this discrepancy or revise the description accordingly.
Figure 2: The placement of accuracy labels is too close to the subplot boundaries, potentially affecting readability. Adjust the positions to improve visual clarity.
Line 243: The manuscript does not evaluate relative reduction trends (i.e., trends normalized by baseline concentrations), which are crucial for comparing changes across regions with differing pollution levels. Consider incorporating this analysis.
Figure 3: Clearly define the boundaries (e.g., interquartile range, whiskers) of the box plots in the caption. Additionally, the color bar ranges in panels C–F are too broad, masking regional differences. Narrowing the ranges would better highlight spatial variability.
Lines 263–269: The number of decimal places reported is inconsistent. Standardize numerical precision across the section, preferably to two decimal places.
Lines 277–278: The inclusion of temporal variables (year and month) as proxies for anthropogenic drivers requires further explanation. Clarify their interpretability in the context of human activity patterns.
Figure 8A: The x-axis range is too narrow, truncating some boxplot distributions. Expanding the axis limits would allow for clearer visualization of data variability.

Citation: https://doi.org/10.5194/egusphere-2025-2194-RC1
- AC2: 'Reply on RC1', B. Chen, 10 Sep 2025
  
  The authors are very grateful to the reviewers for their comments. We thank them for taking the time to review this manuscript and for their valuable suggestions, which have significantly improved the academic quality of this manuscript. The responses to the reviewers' comments are provided in the attachment. The reviewers' comments are highlighted in blue, and our responses are highlighted in red.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2194-AC2
RC2:
'Comment on egusphere-2025-2194', Anonymous Referee #2, 06 Aug 2025

This Manuscript uses an Extreme Trees based machine learning model to identify the drivers in changes of urban and rural PM in China. This is a good effort, and the authors demonstrate reasonable applicability of their approach.
Following are key points that need to be addressed:
1. Since the authors use spatial - temporal datasets, why was LSTM and Convolutional Neural Networks not applied?
2. Figure 7: Explain the physical justification for why SHAP values for temperature are negative in summer and positive in winter? Would these change between urban versus rural areas? For example, biogenic emissions might increase in summer at higher temperatures increasing secondary organic aerosol formation. In winter, reducing temperatures might increase demand for residential heating. Further discussions are needed here.
3. Figure 8 and related discussions: The figure is not clear. Discussions on lines 352-356 suggest different results for how interannual variations change between urban and rural areas using the 2 approaches: Relative contributions versus SHAP. Why are these different? Are SHAP values more reliable? The authors seem to just combine results from relative contribution and SHAP in their Abstract and Discussions. However, physical justification is needed to figure out what causes these differences.
4. What about role of photochemistry? The authors include solar radiation, however, it does not show up as a key variable in SHAP interpretability analyses.
5. Conclusions: Line 402-405: The authors rightfully acknowledge that anthropogenic influences are just represented by a time variable in their analyses. This is clearly insufficient. If possible, the authors should consider emissions, photochemistry (ozone, OH radicals, NOx, VOCs) etc. in their analyses.

Citation: https://doi.org/10.5194/egusphere-2025-2194-RC2
- AC1: 'Reply on RC2', B. Chen, 10 Sep 2025
  
  The authors are very grateful to the reviewers for their comments. We thank them for taking the time to review this manuscript and for their valuable suggestions, which have significantly improved the academic quality of this manuscript. The responses to the reviewers' comments are provided in the attachment. The reviewers' comments are highlighted in blue, and our responses are highlighted in red.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2194-AC1

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2025-2194', Anonymous Referee #1, 15 Jul 2025

This study applies machine learning to estimate hourly PM₂.₅ and PM₁₀ concentrations across eastern China using Himawari-8 satellite data, analyzing trends, influencing factors (2015–2023), and urban–rural disparities. The results are well presented. Below are comments and suggestions for improving the manuscript:
The particulate matter designations “PM2.5” and “PM10” should consistently use subscript formatting (i.e., PM₂.₅ and PM₁₀) throughout the manuscript for scientific precision.
Numerous previous studies have derived hourly surface PM concentrations from Himawari-8 observations in China (doi:10.5194/acp-21-7863-2021). These should be briefly summarized in the Introduction.
Similarly, the Extreme Trees model has been previously applied successfully for satellite-based PM₂.₅ (doi:10.1038/s41467-023-43862-3; doi:10.1016/j.rse.2020.112136) and PM₁₀ (doi:10.1016/j.envint.2020.106290) estimation. A concise summary of these efforts should be added. In addition, a clear justification for selecting this particular model over other machine learning approaches is needed.
Line 91: The acronym “TOAR” appears before it is defined. All acronyms should be spelled out at first mention for clarity (e.g., “Tropospheric Ozone Assessment Report (TOAR)”).
Lines 97–101: The authors should clarify whether only Himawari-8 data were used, or whether Himawari-9 (which became operational in December 2022) was included in the 2022–2023 period. This is important for ensuring temporal consistency.
Lines 119–121: The data sources and preprocessing steps for elevation (HEIGHT), land use and land cover (LUCC), and population density (RK) should be explicitly described.
Equation 1: The model uses only top-of-atmosphere (TOA) reflectance, without accounting for viewing or solar illumination angles, which are known to influence aerosol retrievals. The authors should provide justification for their exclusion.
Figure 1: The methodology used to simultaneously estimate PM₂.₅ and PM₁₀ via a multi-output model is unclear. A brief explanation or schematic would improve reader understanding.
Equation 2: The terms SS_res and SS_tot should be formally defined in the text or figure caption.
Line 163: “SHAP” should be spelled out as “SHapley Additive exPlanations (SHAP)” upon first use.
Line 168: The selection of “20 times” for permutation testing appears arbitrary. A statistical or methodological justification is necessary.
Line 172: The purpose of the provided URL is unclear. The authors should clarify what resource it links to and its relevance.
Lines 212–213: The reported temporal cross-validation R² values (0.41 for PM₁₀, 0.51 for PM₂.₅) seem inconsistent with the claim of “robust stability.” The authors should address this discrepancy or revise the description accordingly.
Figure 2: The placement of accuracy labels is too close to the subplot boundaries, potentially affecting readability. Adjust the positions to improve visual clarity.
Line 243: The manuscript does not evaluate relative reduction trends (i.e., trends normalized by baseline concentrations), which are crucial for comparing changes across regions with differing pollution levels. Consider incorporating this analysis.
Figure 3: Clearly define the boundaries (e.g., interquartile range, whiskers) of the box plots in the caption. Additionally, the color bar ranges in panels C–F are too broad, masking regional differences. Narrowing the ranges would better highlight spatial variability.
Lines 263–269: The number of decimal places reported is inconsistent. Standardize numerical precision across the section, preferably to two decimal places.
Lines 277–278: The inclusion of temporal variables (year and month) as proxies for anthropogenic drivers requires further explanation. Clarify their interpretability in the context of human activity patterns.
Figure 8A: The x-axis range is too narrow, truncating some boxplot distributions. Expanding the axis limits would allow for clearer visualization of data variability.

Citation: https://doi.org/10.5194/egusphere-2025-2194-RC1
- AC2: 'Reply on RC1', B. Chen, 10 Sep 2025
  
  The authors are very grateful to the reviewers for their comments. We thank them for taking the time to review this manuscript and for their valuable suggestions, which have significantly improved the academic quality of this manuscript. The responses to the reviewers' comments are provided in the attachment. The reviewers' comments are highlighted in blue, and our responses are highlighted in red.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2194-AC2
RC2:
'Comment on egusphere-2025-2194', Anonymous Referee #2, 06 Aug 2025

This Manuscript uses an Extreme Trees based machine learning model to identify the drivers in changes of urban and rural PM in China. This is a good effort, and the authors demonstrate reasonable applicability of their approach.
Following are key points that need to be addressed:
1. Since the authors use spatial - temporal datasets, why was LSTM and Convolutional Neural Networks not applied?
2. Figure 7: Explain the physical justification for why SHAP values for temperature are negative in summer and positive in winter? Would these change between urban versus rural areas? For example, biogenic emissions might increase in summer at higher temperatures increasing secondary organic aerosol formation. In winter, reducing temperatures might increase demand for residential heating. Further discussions are needed here.
3. Figure 8 and related discussions: The figure is not clear. Discussions on lines 352-356 suggest different results for how interannual variations change between urban and rural areas using the 2 approaches: Relative contributions versus SHAP. Why are these different? Are SHAP values more reliable? The authors seem to just combine results from relative contribution and SHAP in their Abstract and Discussions. However, physical justification is needed to figure out what causes these differences.
4. What about role of photochemistry? The authors include solar radiation, however, it does not show up as a key variable in SHAP interpretability analyses.
5. Conclusions: Line 402-405: The authors rightfully acknowledge that anthropogenic influences are just represented by a time variable in their analyses. This is clearly insufficient. If possible, the authors should consider emissions, photochemistry (ozone, OH radicals, NOx, VOCs) etc. in their analyses.

Citation: https://doi.org/10.5194/egusphere-2025-2194-RC2
- AC1: 'Reply on RC2', B. Chen, 10 Sep 2025
  
  The authors are very grateful to the reviewers for their comments. We thank them for taking the time to review this manuscript and for their valuable suggestions, which have significantly improved the academic quality of this manuscript. The responses to the reviewers' comments are provided in the attachment. The reviewers' comments are highlighted in blue, and our responses are highlighted in red.
  
  Citation: https://doi.org/10.5194/egusphere-2025-2194-AC1

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by B. Chen on behalf of the Authors (10 Sep 2025) Author's response Author's tracked changes Manuscript

ED: Publish subject to minor revisions (review by editor) (22 Sep 2025) by Qi Chen

AR by B. Chen on behalf of the Authors (23 Sep 2025) Author's response Author's tracked changes Manuscript

ED: Publish as is (24 Sep 2025) by Qi Chen

AR by B. Chen on behalf of the Authors (24 Sep 2025)

Journal article(s) based on this preprint

13 Nov 2025

Urban-rural patterns and driving factors of particulate matter pollution decrease in eastern China

Zhihao Song and Bin Chen

Atmos. Chem. Phys., 25, 15487–15506, https://doi.org/10.5194/acp-25-15487-2025,https://doi.org/10.5194/acp-25-15487-2025, 2025

Short summary

Zhihao Song and Bin Chen

Supplement

https://doi.org/10.5194/egusphere-2025-2194-supplement

Zhihao Song and Bin Chen

Viewed

Total article views: 690 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
598	75	17	690	35	8	13

HTML: 598
PDF: 75
XML: 17
Total: 690
Supplement: 35
BibTeX: 8
EndNote: 13

Views and downloads (calculated since 06 Jun 2025)

Month	HTML	PDF	XML	Total
Jun 2025	78	13	5	96
Jul 2025	32	19	3	54
Aug 2025	102	16	3	121
Sep 2025	356	6	4	366
Oct 2025	24	14	2	40
Nov 2025	6	7	0	13

Cumulative views and downloads (calculated since 06 Jun 2025)

Month	HTML	PDF	XML	Total
Jun 2025	78	13	5	96
Jul 2025	32	19	3	54
Aug 2025	102	16	3	121
Sep 2025	356	6	4	366
Oct 2025	24	14	2	40
Nov 2025	6	7	0	13

Viewed (geographical distribution)

Total article views: 679 (including HTML, PDF, and XML) Thereof 679 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 13 Nov 2025

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (15702 KB)
Metadata XML

Short summary

This study reveals that 2015–2023 PM (particulate matter) pollution in eastern China declined faster in urban than rural areas. Temperature and interannual variability were key drivers, with urban reductions concentrated in urban core zones and suburbs. The effect of interannual variability on PM decreased significantly, while other factors showed periodic fluctuations.


Total:	0
HTML:	0
PDF:	0
XML:	0