High spatial resolution CO<sub>2</sub> measurement using low-cost commercial sensors in Seoul megacity

Park, JaeYoung; Ahn, Jinho; Kim, Jeongeun; Salehnia, Nasrin

doi:10.5194/egusphere-2025-3775

Preprints

https://doi.org/10.5194/egusphere-2025-3775

Preprints

12 Sep 2025

| 12 Sep 2025

High spatial resolution CO₂ measurement using low-cost commercial sensors in Seoul megacity

JaeYoung Park, Jinho Ahn, Jeongeun Kim, and Nasrin Salehnia

Abstract. Carbon dioxide (CO₂) is the most significant anthropogenic greenhouse gas. However, tracking CO₂ levels can be challenging due to the uneven distribution of concentrations and the high cost of sensors. In this study, we explored several correction techniques to enable the large-scale use of affordable CO₂ sensors, thereby enhancing the spatial resolution. We found that the low-cost CO₂ sensor (HT-2000) closely aligned with the trends observed in data from a more accurate sensor (LI-840a). By applying multiple-point linear regression, we reduced the root mean square error (RMSE) to only 1–2 % of the measured value, which is accurate enough for urban monitoring at a local scale. Using a large network of low-cost sensors, we were able to map CO₂concentration in detail, capture fine spatial variations, and gain a clearer understanding of emission patterns at an urban road intersection and within a tunnel.

Received: 12 Aug 2025 – Discussion started: 12 Sep 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 1201 KB)

Supplement (366 KB)

Download & links

JaeYoung Park, Jinho Ahn, Jeongeun Kim, and Nasrin Salehnia

Status: closed

RC1:
'Comment on egusphere-2025-3775', Anonymous Referee #1, 02 Oct 2025

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-3775/egusphere-2025-3775-RC1-supplement.pdf

Citation: https://doi.org/10.5194/egusphere-2025-3775-RC1
- AC1: 'Reply on RC1', Jinho Ahn, 08 Dec 2025
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-3775/egusphere-2025-3775-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2025-3775-AC1
RC2:
'Comment on egusphere-2025-3775', Anonymous Referee #2, 14 Oct 2025
The authors aim to address the spatial heterogeneity and temporal variability of urban CO2 by applying a multiple‑point linear regression to low‑cost sensor measurements. They present two experiments—one at a high‑traffic intersection (Bongcheon) for several months, and another in a tunnel over two days—and claim that high‑resolution CO2 maps can be generated with this approach.

I remain skeptical about the practicality of this linear‑regression‑based interpolation method. A linear model ignores the geometric constraints inherent to urban environments. Moreover, the paper provides insufficient detail on how the regression coefficients are learned, leaving open the possibility of overfitting. Finally, the authors do not discuss the spatial and temporal variability of those coefficients, which is essential for assessing the method’s real‑world applicability.

Please find below more details.

General comments

Linear interpolation lacks physical realism. A purely linear model does not incorporate the geometric constraints that govern CO2 transport in cities. This shortcoming is evident in the presented concentration maps, which show little correlation with major roadways. Also, the manuscript does not explain that each spatial coordinate would require its own set of regression coefficients and time‑alignment parameters. A discussion of the spatio-temporal variability of these coefficients is necessary if the method is to be useful in practice.

Training procedure for the multiple-point linear regression is under‑described. The authors do not specify how the data are split into training and test sets, nor whether any cross‑validation is employed to guard against over‑fitting. Without this information, the reported RMSE metric cannot be trusted.

Calibration, correction, and interpolation are conflated. At times it is unclear whether the authors refer to sensor calibration, correction, or spatial interpolation. These three steps serve distinct purposes and should be clearly separated.

Specific comments

l. 17 – Clarify the local CO2 sources in Seoul (e.g. transportation, heating). When citing the city’s 9 % share of national electricity consumption, indicate whether the associated emissions stem from power plants within Seoul or elsewhere; otherwise the statement lacks context.

l. 21 – Explain why a high‑resolution CO₂ map would aid mitigation strategies. A brief illustrative example would help readers understand the use case.

l. 25 – Define “top‑down” and “bottom‑up” approaches.

l. 30 – How ground‑level CO2 measurements can improve eddy‑covariance flux estimates ?

l. 35 – Both sensors are using NDIR, maybe specify the exact sensor models used.

l. 43 – State the main contribution explicitly: the development of a spatial‑interpolation method for CO2 using low‑cost sensors.

l. 45 – Distinguish clearly among calibration, correction (post‑processing of raw measurements to remove spikes, drift, etc.), and interpolation/estimation (predicting CO2 at unmeasured locations, possibly after time‑lag alignment).

l. 60 – Provide the cost ratio between the two sensor types to understand the trade‑off between price and accuracy.

l. 64 – Consider reorganizing the methods section into three subsections: “Instrument description,” “Site description,” and “Interpolation method.”

l. 80 – Explain the nature of the reported time delays: are they instrument‑specific latencies, or delays relative to the target interpolation location? Discuss how the method would handle interpolation at a location lacking any ground‑truth measurement, and describe the optimization technique used for time‑lag correction.

l. 84–95 – Indicate which portion of the dataset is used to learn the regression coefficients and whether any cross‑validation scheme is applied to prevent over‑fitting.

l. 90 – Justify the inclusion of temperature and humidity as covariates in the interpolation model.

l. 114 – Clarify whether the displayed equation is for to sensor calibration or to the spatial interpolation itself.

l. 127 – The RMSE appears to compare CO2 values at two distinct sites simultaneously. One would expect a comparison between the interpolated value and the ground-truth.

l. 135 – Confirm whether the low‑cost sensor time-series are temporally aligned before regression.

l. 142 – If the interpolation described here is identical to the multiple‑point linear regression introduced at l. 94, state this explicitly and reference the MATLAB implementation in the “Interpolation method” subsection.

l. 143 – Expand the discussion of results: why is linear spatial interpolation justified in a complex urban environment? Address the apparent lack of correlation between major roads and elevated CO2 levels.

l. 153 – Specify whether the tunnel is two-way.

l. 168–176 – Separate statements about calibration from those about data interpolation to avoid confusion.

l. 193–205 – The reported decrease in error from the “time‑lag corrections” seem insignificant compared with the reading error (HT‑2000, LI‑870A). The observed variability (> 10 s) may stem from the optimization algorithm rather than true physical lag.

l. 194 – Describe the algorithmic procedure used to estimate the time lag.

l. 207–211 – As above, the error analysis does not appear meaningful when it is of the order of the sensor’s reading error.

l. 215–225 – I disagree with the notion of a universal set of regression coefficients for all locations. Coefficients inevitably depend on local geography and prevailing atmospheric conditions.

l. 262 – The concluding claim that the approach enables large‑scale, high‑resolution monitoring is premature. The manuscript does not examine the spatial or temporal stability of the regression coefficients, which is essential to validate the method.
Citation: https://doi.org/10.5194/egusphere-2025-3775-RC2
- AC2: 'Reply on RC2', Jinho Ahn, 08 Dec 2025
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-3775/egusphere-2025-3775-AC2-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2025-3775-AC2

Status: closed

RC1:
'Comment on egusphere-2025-3775', Anonymous Referee #1, 02 Oct 2025

The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-3775/egusphere-2025-3775-RC1-supplement.pdf

Citation: https://doi.org/10.5194/egusphere-2025-3775-RC1
- AC1: 'Reply on RC1', Jinho Ahn, 08 Dec 2025
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-3775/egusphere-2025-3775-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2025-3775-AC1
RC2:
'Comment on egusphere-2025-3775', Anonymous Referee #2, 14 Oct 2025
The authors aim to address the spatial heterogeneity and temporal variability of urban CO2 by applying a multiple‑point linear regression to low‑cost sensor measurements. They present two experiments—one at a high‑traffic intersection (Bongcheon) for several months, and another in a tunnel over two days—and claim that high‑resolution CO2 maps can be generated with this approach.

I remain skeptical about the practicality of this linear‑regression‑based interpolation method. A linear model ignores the geometric constraints inherent to urban environments. Moreover, the paper provides insufficient detail on how the regression coefficients are learned, leaving open the possibility of overfitting. Finally, the authors do not discuss the spatial and temporal variability of those coefficients, which is essential for assessing the method’s real‑world applicability.

Please find below more details.

General comments

Linear interpolation lacks physical realism. A purely linear model does not incorporate the geometric constraints that govern CO2 transport in cities. This shortcoming is evident in the presented concentration maps, which show little correlation with major roadways. Also, the manuscript does not explain that each spatial coordinate would require its own set of regression coefficients and time‑alignment parameters. A discussion of the spatio-temporal variability of these coefficients is necessary if the method is to be useful in practice.

Training procedure for the multiple-point linear regression is under‑described. The authors do not specify how the data are split into training and test sets, nor whether any cross‑validation is employed to guard against over‑fitting. Without this information, the reported RMSE metric cannot be trusted.

Calibration, correction, and interpolation are conflated. At times it is unclear whether the authors refer to sensor calibration, correction, or spatial interpolation. These three steps serve distinct purposes and should be clearly separated.

Specific comments

l. 17 – Clarify the local CO2 sources in Seoul (e.g. transportation, heating). When citing the city’s 9 % share of national electricity consumption, indicate whether the associated emissions stem from power plants within Seoul or elsewhere; otherwise the statement lacks context.

l. 21 – Explain why a high‑resolution CO₂ map would aid mitigation strategies. A brief illustrative example would help readers understand the use case.

l. 25 – Define “top‑down” and “bottom‑up” approaches.

l. 30 – How ground‑level CO2 measurements can improve eddy‑covariance flux estimates ?

l. 35 – Both sensors are using NDIR, maybe specify the exact sensor models used.

l. 43 – State the main contribution explicitly: the development of a spatial‑interpolation method for CO2 using low‑cost sensors.

l. 45 – Distinguish clearly among calibration, correction (post‑processing of raw measurements to remove spikes, drift, etc.), and interpolation/estimation (predicting CO2 at unmeasured locations, possibly after time‑lag alignment).

l. 60 – Provide the cost ratio between the two sensor types to understand the trade‑off between price and accuracy.

l. 64 – Consider reorganizing the methods section into three subsections: “Instrument description,” “Site description,” and “Interpolation method.”

l. 80 – Explain the nature of the reported time delays: are they instrument‑specific latencies, or delays relative to the target interpolation location? Discuss how the method would handle interpolation at a location lacking any ground‑truth measurement, and describe the optimization technique used for time‑lag correction.

l. 84–95 – Indicate which portion of the dataset is used to learn the regression coefficients and whether any cross‑validation scheme is applied to prevent over‑fitting.

l. 90 – Justify the inclusion of temperature and humidity as covariates in the interpolation model.

l. 114 – Clarify whether the displayed equation is for to sensor calibration or to the spatial interpolation itself.

l. 127 – The RMSE appears to compare CO2 values at two distinct sites simultaneously. One would expect a comparison between the interpolated value and the ground-truth.

l. 135 – Confirm whether the low‑cost sensor time-series are temporally aligned before regression.

l. 142 – If the interpolation described here is identical to the multiple‑point linear regression introduced at l. 94, state this explicitly and reference the MATLAB implementation in the “Interpolation method” subsection.

l. 143 – Expand the discussion of results: why is linear spatial interpolation justified in a complex urban environment? Address the apparent lack of correlation between major roads and elevated CO2 levels.

l. 153 – Specify whether the tunnel is two-way.

l. 168–176 – Separate statements about calibration from those about data interpolation to avoid confusion.

l. 193–205 – The reported decrease in error from the “time‑lag corrections” seem insignificant compared with the reading error (HT‑2000, LI‑870A). The observed variability (> 10 s) may stem from the optimization algorithm rather than true physical lag.

l. 194 – Describe the algorithmic procedure used to estimate the time lag.

l. 207–211 – As above, the error analysis does not appear meaningful when it is of the order of the sensor’s reading error.

l. 215–225 – I disagree with the notion of a universal set of regression coefficients for all locations. Coefficients inevitably depend on local geography and prevailing atmospheric conditions.

l. 262 – The concluding claim that the approach enables large‑scale, high‑resolution monitoring is premature. The manuscript does not examine the spatial or temporal stability of the regression coefficients, which is essential to validate the method.
Citation: https://doi.org/10.5194/egusphere-2025-3775-RC2
- AC2: 'Reply on RC2', Jinho Ahn, 08 Dec 2025
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2025/egusphere-2025-3775/egusphere-2025-3775-AC2-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2025-3775-AC2

JaeYoung Park, Jinho Ahn, Jeongeun Kim, and Nasrin Salehnia

Supplement

https://doi.org/10.5194/egusphere-2025-3775-supplement

JaeYoung Park, Jinho Ahn, Jeongeun Kim, and Nasrin Salehnia

Viewed

Total article views: 1,576 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
1,456	89	31	1,576	48	31	28

HTML: 1,456
PDF: 89
XML: 31
Total: 1,576
Supplement: 48
BibTeX: 31
EndNote: 28

Views and downloads (calculated since 12 Sep 2025)

Month	HTML	PDF	XML	Total
Sep 2025	1,256	4	3	1,263
Oct 2025	108	30	10	148
Nov 2025	15	11	10	36
Dec 2025	31	34	8	73
Jan 2026	46	10	0	56

Cumulative views and downloads (calculated since 12 Sep 2025)

Month	HTML	PDF	XML	Total
Sep 2025	1,256	4	3	1,263
Oct 2025	108	30	10	148
Nov 2025	15	11	10	36
Dec 2025	31	34	8	73
Jan 2026	46	10	0	56

Viewed (geographical distribution)

Total article views: 1,571 (including HTML, PDF, and XML) Thereof 1,571 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 17 Jan 2026

Short summary

This study shows that low-cost CO₂ sensors, when individually calibrated using multi-point linear regression, can achieve 1–2 % accuracy. Deployed in Seoul, they revealed local pollution patterns like idling emissions at intersections and the "piston effect" in tunnels. With proper correction, these sensors enable affordable, detailed urban CO₂ monitoring.


Total:	0
HTML:	0
PDF:	0
XML:	0

High spatial resolution CO2 measurement using low-cost commercial sensors in Seoul megacity

Supplement

Viewed

Viewed (geographical distribution)

High spatial resolution CO₂ measurement using low-cost commercial sensors in Seoul megacity