the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Inferring European Fossil Fuel CO2 Emissions using TROPOMI NO2 Data and Sector-Based NOx:CO2 Emission Ratios
Abstract. Accurate monitoring of fossil fuel CO2 (ffCO2) emissions is essential for tracking climate mitigation, yet natural carbon-cycle fluxes often obscure human-induced signals in atmospheric observations. This study presents a satellite-driven data assimilation framework that uses nitrogen oxides (NOx = NO + NO2) − short-lived trace gases co-emitted with CO2 − to estimate ffCO2 emissions. We estimate European NOx emissions for 2021 by assimilating TROPOMI NO2 observations into an Ensemble Kalman Filter (EnKF) framework, optimised within the GEOS-Chem atmospheric transport model. We use a computationally efficient offline treatment of NOx chemistry, enabling large-ensemble inversions while retaining sensitivity to changes in photochemistry. Assimilating these data leads to a systematic reduction in the state vector uncertainty, with a mean ensemble-based error reduction of 4.2–5.7 %, and an overall improvement in model agreement with observations that corresponds to an annual correlation increase, ∆r =0.12. By leveraging sector-specific NOx:CO2 emission ratios, we translate our posterior NOx flux estimates into improved ffCO2 estimates that capture enhanced seasonal variability. Our inferred ffCO2 emissions exhibit elevated values in autumn and winter, spatially concentrated over major source regions and consistent with surface temperature variability. Independent evaluation against in situ measurements confirms significant improvements in mean error statistics. While OCO-2 CO2 column data remain dominated by biogenic signals, our NO2-driven approach successfully isolates the fossil fuel component. This study demonstrates the potential of ensemble data assimilation and reduced-complexity chemistry to provide physically consistent constraints on European ffCO2 estimates, establishing a vital foundation for future joint NO2–CO2 inversion systems.
- Preprint
(25336 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
- RC1: 'Comment on egusphere-2026-1315', Anonymous Referee #1, 29 Apr 2026
-
RC2: 'Comment on egusphere-2026-1315', Anonymous Referee #2, 23 May 2026
The manuscript "Inferring European Fossil Fuel CO2 Emissions using TROPOMI NO2 Data and Sector-Based NOx:CO2 Emission Ratios" by Chlöe N. Schooling et al. documents European scale atmospheric inversions assimilating TROPOMI NO2 observations to derive CO2 fossil fuel emissions maps and national budgets for the year 2021. In the Bayesian framework of the inversion, the assimilation of the NO2 data to infer NOx emissions relies on a simplification of the chemistry modeling in the GEOS-Chem model, and the transfer of information from the TROPOMI NO2 observations to the CO2 fossil fuel emissions is based on a joint correction of the prior NOx and CO2 anthropogenic emission estimates from the CAMS-REG inventory with cross species correlations in the covariance of the uncertainties in these prior estimates.
The topic is important for the atmospheric inversion community. Overall, the inversion and experimental approaches are relevant and interesting, and the text is often clean and concise.
However, the text can also go too fast and it can easily be confusing:
- the way the information is transferred from NOx to CO2 emissions should be better explained with a better presentation of its principle in the introduction and a clearer section 2.3. The title, the abstract (line 11), the introduction (l33 and l54-55), section 2.3 (up to line 137) and the conclusion (l279-280) implicitly let the reader think that it is based on fixed sectoral NOx to FFCO2 emission ratios. The details of section 2.3 from line 152 show that it actually relies on the correlations of the uncertainties in the prior emission estimates between the two species, which is different. This topic deserves a better presentation of its principle in the introduction. More generally, the principles of the "coupled" inversion framework should be better introduced in the introduction or early in section 2 before entering into the specific parameters of the GEOS-Chem and EnKF configurations.
- the presentation of the diagnostics and figures should be more rigorous: e.g. the statistics at lines 9-11 do not correspond to a well defined set of variables (to well defined spatial and temporal scales for the values and averaging), and there are residual questions regarding this when reaching line 203 (do we have error reduction for emissions averaged over Europe -"Europe wide"- and seasons, or the average over seasons and Europe -as implied by Figure 4a- of the error reduction at the model spatial resolution and daily scale ?). The same type of comment apply to the correlation increase at line 11. The legend of Figure 2 does not specify the spatial resolution of the averaged maps. The legend of figure 5a is "full year total European fluxes" but it shows daily emissions. The legend of figure 6 is incomplete… This is just a small sample of examples illustrating a more general problem: there is a need to carefully check all the legends of figures and all the descriptions of statistics in the text.
Regarding the results, analysis and discussions, my main concerns relate to the fit to the TROPOMI NO2 observations and to the evaluation of the FFCO2 emissions resulting from the inversions:
- Line 275 acknowledges that the "posterior flux adjustments imply substantial increases in national ffCO2 emissions relative to the prior inventory" but section 3 does not highlight it enough. Emissions of the most emitting countries in continental Europe, i.e. Germany, Italy and France, are increased by 36 to 48% at annual scale, and the emissions of the other countries within the largest regional emission cluster (underlying the highest signal in the TROPOMI NO2 images), i.e. Netherlands and Belgium, are increased by ~30%. From that point of view, it is difficult to follow the general statement of "improved ffCO2 estimates" (line 12), lines 15-17 ("successfully isolates the fossil fuel component") and more generally the abstract, which ignores these very large increases in national budgets, and the conclusion at lines 273-274.
- Surprisingly, these large increases correspond to small decreases of the statistical uncertainties in the NOx emission estimates (figure 4a) and to marginal improvements of the fit to the TROPOMI NO2 data (figure 4b; lines 206-213 should better acknowledge the weakness of these improvements).
This raises major questions, and this probably requires more extensive discussions and new diagnostics:
- How can we explain that the improvement of the fit to TROPOMI NO2 observations is very limited despite very large corrections to the prior estimate of the emissions (at least of the FFCO2 emissions) far beyond their prior uncertainties at daily scale ? Does the EnKF framework allow for comparing the projection of prior and posterior uncertainties in the observation field to the observation errors and could it help answer this question ?
- How robust the comparison to the in situ CO2 data can be for the evaluation of the FFCO2 emission estimates if it supports such an increase in national budgets ? Isn’t the situation for these stations identical to that for OCO-2, i.e. they mainly catch signal from ecosystems, so that the changes due to corrections to FFCO2 emissions may just slightly compensate for the much larger uncertainties from these ecosystems (see lines 4 and 23-24) ?
The authors should document the NOx emission estimates from the inversion and the corresponding increments to the CAMS-REG inventory. This should be one of the main results from the study and this would probably help answer some of the questions raised above.
Minor comments:
- line 11: the name of the inventory behind the emission ratios deserves to be given here since the quality of their estimate is critical.
- line 21-24: clarify that these statements are about inversions assimilating CO2 data from current observation networks. "yet it is complicated …" does not correspond to bottom-up emission monitoring.
- line 28: the biogenic and agricultural emissions are probably too high to support the statement "they originate almost exclusively from combustion"
- line 60: NOx and CO2 emissions
- lines 64-65: I do not understand the meaning of "with a reported single measurement uncertainty of 0.7x10^15 molec/cm2"
- line 74-75: why not focusing the evaluation of the inversions against OCO-2 data over these major industrial regions and urban centres ?
- lines 78-79: why do you need and can you really get a consistency between the qa values used to select the TROPOMI and OCO-2 observations ?
- section 2.1: do the author use TROPOMI and OCO-2 observations over land only ? Figure 1: do the numbers correspond to GEOS-Chem land grid cells only ? (OCO-2 is missing in "one clear-sky TROPOMI observation…") ?
- lines 96-97: "Our simulation…" is unclear
- lines 134 and 137: NOx and ffCO2 emissions
- line 150: covariance -> variance
- Section 2.3: the 1-day length of the assimilation window is a prerequisite to understand that there is no indication about the temporal correlations in Pf at lines 152-167.
- Equation 4: the notation "r" is not very suitable here because it is used for all types of correlations in this manuscript
- Figure 3: what are the « prior ensemble errors »: the variance of these errors for the NOx emissions ? to which extent are they consistent with the theoretical Pf ?
- l177: the use of 1-day assimilation windows implies that the potential temporal correlations of the uncertainties in the prior estimate of the emissions are ignored. It deserves a comment.
- l202: the word « Europe » is unclear here; it seems that it includes North Africa (l207).
- Figure 6: why plotting the emissions with a log scale ?
- Figure 6 and line 230: I do not understand how a 30% change of annual emissions in the Netherlands can correspond to 80% of the prior uncertainty in these annual emissions. The relative prior uncertainties for the daily emissions at the model resolution hardly reach 60% (figure 3) and there is no spatial or temporal correlations in these uncertainties, so they should strongly decrease when aggregated at annual and national scales. Are there missing information which could explain how to bridge this gap ? A bar plot with prior and posterior national and annual budgets with the corresponding uncertainties would be useful.
- line231: not really "the largest relative changes", but rather the largest ratios between the relative changes and the prior uncertainties ?
- l239-240: unclear; these lines seem to claim that the traditional week days to week end cycle of the emissions would be an artefact in the temporal profiles of the inventories, while the loss of this cycle by the inversion is actually a potential matter of concern.
- Figure 7: considering the level of correction for the FFCO2 emissions and thus, presumably, for the NOx emissions, the impact of the inversion on the fit to the EEA in situ NO2 observations appears to be weak. However, this is consistent with the weakness of the improvement of the fit to the TROPOMI data. Same comment on Figure 8 (regarding Figure 9: the text already states that the impact of the inversion on the fit to the OCO-2 data is weak).
- l249: the improvement of the fit to the in situ NO2 data is larger in summer, however the smallest corrections to the FFCO2 emissions (and thus, presumably, to the NOx emissions) occur during this season. How to explain this ?
- l251-256: such a discussion questions the use of these in situ NO2 sites to evaluate the inversion results. Should the authors select rural sites only for the evaluation of the inversion ?
- Figures 8 and 9: the positive biases in the comparisons to OCO-2 vs. the negative biases in the comparisons to the in situ CO2 data raise concerns. How to explain this ?
- The maps are generally too small and difficult to read.
Citation: https://doi.org/10.5194/egusphere-2026-1315-RC2
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 241 | 125 | 31 | 397 | 20 | 20 |
- HTML: 241
- PDF: 125
- XML: 31
- Total: 397
- BibTeX: 20
- EndNote: 20
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
Inferring European Fossil Fuel CO2 Emissions using TROPOMI NO2 Data and Sector-Based NOx:CO2 Emission Ratios
Overall Comments
This manuscript addresses a topic that is both timely and scientifically important. The use of TROPOMI NO2 observations within an ensemble Kalman filter framework to infer European fossil fuel CO2 emissions is a promising direction, and the reduced-complexity NOx chemistry module is a useful technical contribution. I appreciate the authors' transparency in acknowledging several key limitations.
That said, I have some concerns about the framing and interpretation of the results. Some parts of the manuscript seem to overstate the capability of the proposed technique, while the methodology as presented seems to be at the proof-of-concept demonstration stage. This is not a criticism of the science itself, and I think the manuscript would benefit from more clearly reflecting the current limitations and the gap between what is demonstrated here and what would be needed for practical operational application. In particular, the assumed temporally constant NOx:CO2 emission ratios across sectors, seasons, and countries is a significant simplification whose implications for the posterior CO2 estimates are not fully explored. The authors do acknowledge this limitation toward the end of the paper, but given how central it is to the reliability of the results, it deserves more discussion and supporting analyses.
I am also concerned about the large posterior adjustments in national ffCO2 emissions, which range from 20% to 91% relative to the prior. While I recognize that bottom-up inventories can carry significant uncertainties, adjustments of this magnitude at national scales are surprising for a state-of-the-art inventory used in this study, and the manuscript currently provides no way to assess whether these reflect genuine inventory biases or artifacts of the inversion framework itself. I think this finding deserves more careful and critical discussion, including comparison against independent bottom-up emission estimates or other top-down inversion studies over Europe. Without such benchmarking, it is difficult to evaluate the reliability of the main quantitative results of the paper.
I recommend major revision before this manuscript is suitable for publication.
Specific Comments
Lines 6-7: “We estimate European NOx emissions for 2021 by assimilating TROPOMI NO2 observations into an Ensemble Kalman Filter (EnKF) framework, optimised within the GEOS-Chem atmospheric transport model.”
Although this statement is true, the manuscript never presented posterior NOx emissions. Consider adding a new figure dedicated for NOx emissions or add NOx layer onto an existing figure (i.e., Figure 6).
Lines 20–22: "Accurately monitoring these emissions is essential for tracking progress toward national mitigation goals, yet it is complicated by the difficulty of isolating fossil fuel signals from large, seasonally varying biogenic fluxes."
As written, this sentence implies that atmospheric observation-based monitoring is the primary or only method for tracking mitigation progress. In practice, self-reported national inventories, global bottom-up emission datasets, and other approaches all contribute to this goal. I suggest revising to acknowledge the broader monitoring landscape and to clarify that the atmospheric top-down approach complements rather than replaces these efforts.
Figure 1b: The blue (ICOS) and green (DECC) triangles are difficult to distinguish visually. Please consider changing the color or marker style of one of the two site types.
Figure 1b (continued): One of the NOAA in situ CO2 measurement sites appear to be located over a marine area. Could the authors confirm whether this is the Ocean Station Norway (STM) site? If so, my understanding is that observations at this site were terminated in 2009, and it would be important to clarify which dataset is actually being used here.
Lines 84–91: The authors describe the EEA NO2 network and the ICOS/DECC/NOAA CO2 sites used for evaluation, but there is no discussion of site representativeness. Many ICOS and NOAA tall tower sites are specifically designed to sample regional background air, which may make them poorly suited for detecting near-source ffCO2 emission changes. Similarly, some EEA NO2 sites may be located far from major combustion sources. Please add a brief characterization of the site types used here — urban, suburban, rural background — and discuss how their representativeness affects the interpretation of the model evaluation results.
Lines 101–103: "Prior combustion emissions of NOx and CO2 are taken from the CAMS-REG v8.1 emissions inventory at 0.05°×0.1° resolution."
Given how central this dataset is to the entire study, a brief description of how these emissions are estimated would be helpful — for example, whether they are based on national inventory reports, fuel consumption statistics, or activity data. Also, the CO2 and NOx maps in Figure 2 appear to show patterns that align with national boundaries. Is this a result of aggregating independent national inventories? If so, what are the implications for the inversion results? (i.e., do uncertainties vary by country?)
Lines 105–106: "Emissions from non-combustion sources in the CAMS-REG v8.1 inventory — which include fugitives, waste, solvents, and agriculture — are kept fixed at their prior values."
What is the justification for fixing non-combustion NOx emission sources while only adjusting combustion sources? This decision deserves more discussion. In particular, biogenic soil NOx emissions can represent a significant fraction of total NOx over agricultural area during summer, and fixing these at potentially biased values could systematically drive spurious adjustments in the posterior combustion NOx estimates. Please justify this choice more explicitly or acknowledge it as a limitation with a discussion of its potential seasonal implications.
Lines 110–114: The offline NOx chemistry parameterization is described and validated in Schooling et al. (2025) for emission perturbations on the order of +-20%, If I understand correctly. However, the posterior results reported in Section 3.1 show adjustments of up to +91% at the national scale and widespread winter increases of 10–177% at the grid level. It is unclear whether perturbations of this magnitude fall within the validated range of the linear scaling approximation. Please add a discussion of the expected error in the offline chemistry scheme under these larger perturbations, and assess whether this could introduce systematic biases in the posterior results.
Lines 123–125: The stability of the NO2:NOx partitioning ratio under emission perturbations is cited as a key assumption enabling the conversion of modeled NOx to NO2 columns for comparison with TROPOMI. However, similar to the point above, the range of perturbation magnitudes over which this stability holds is not discussed. What is the uncertainty introduced by this assumption, and how is it accounted for in the inversion framework? Given the large posterior adjustments found in this study, this assumption warrants more explicit validation or at minimum a sensitivity discussion.
Line 165: "the values of σn, σc and r are prescribed from the CAMS-REG v8.1 uncertainty estimates and the CORSO cross-species correlation product, and are assumed to be constant in time."
A few questions on this. First, which emission sector carries the largest uncertainty for NOx and CO2 in CAMS-REG v8.1? Based on Figure 3, northern Africa appears to have substantially higher uncertainty for both species, which seems to drive larger error reductions (Figure 4a) and larger CO2 flux increments (Figure 5b) in that region — yet no in situ evaluation is available there. Please discuss the implications of this large uncertainty for the NOx inversion and the derived CO2 estimates over both North Africa and the broader European domain. Second, do NOx and CO2 prior flux uncertainties have seasonality? If so, how would they affect the inversion result? What are the implications for interpreting the seasonality in monthly posterior increment (Figure 5)? Please address this.
Figure 3: Please clarify the units of the color bar. Do all four panels share the same units?
Lines 211–212: The authors note that MAE and bias improvements are largest in winter and attribute this to "challenges in bottom-up inventories for domestic heating." While plausible, this explanation is presented as a general conclusion without supporting evidence or citation. Does the prior error covariance have a seasonal component that could contribute to this pattern? Also, the seasonal pattern in prior model vs. TROPOMI agreement (higher errors in winter, lower in summer) is interesting, but is not explained. Please expand this discussion and provide supporting references for the domestic heating hypothesis, if available.
Lines 214–215: It is unclear how the posterior ffCO2 fluxes are actually computed from the posterior NOx scaling factors. The current methods section does not include an explicit description of this step. I strongly recommend adding a dedicated subsection describing the NOx-to-CO2 conversion. Specifically: is the same scalar scaling factor applied uniformly to all sectors at each grid cell? What are the implications of this approach when different sectors co-locate on the same grid cell with very different NOx:CO2 ratios? This NOx-to-CO2 conversion works when the inventory error originates from activity data but breaks down when the error comes from emission factors. This distinction is important for interpreting the posterior CO2 results and should be made explicit in the paper.
Lines 223–226: Please clarify whether "emissions" here refers specifically to ffCO2 emissions or NOx emissions, as the text could be read either way.
Figure 6: A few comments. First, the log-scale y-axis for CO2 flux makes it difficult to visually assess the magnitude of prior-to-posterior adjustments across countries. For example, the corrections for Turkey and Sweden appear visually similar to those for other countries despite being much larger in relative terms. Unless the log-scale axis is required for some reason, consider using either a linear scale or showing the relative adjustment explicitly. Second, for readers unfamiliar with European geography, adding country labels to at least one of the map figures (Figures 4 and 5) would make it much easier to connect the spatial patterns in those maps to the country-level results in Figure 6.
On the large posterior adjustments (Lines 229–231): The reported annual increases of 20–91% across all analyzed countries are surprising and require deeper discussion. CAMS-REG v8.1 is a state-of-the-art inventory, and systematic errors of this magnitude would be unusual and would likely have been identified in prior literature. Changes exceeding 6σ of prior uncertainty for Spain and Turkey should prompt careful examination of whether the inversion framework is functioning as intended — including possible issues with boundary condition, NOx chemistry, NOx:CO2 ratio assumptions under large perturbations. Please provide any independent evidence or plausible physical hypothesis that could support adjustments of this magnitude, and discuss alternative explanations more explicitly. A comparison against other bottom-up inventories (i.e., EDGAR) would provide additional context to this result.
On the large posterior adjustments and surface bias (Figure 7): Relatedly, the posterior barely improves the surface NO2 bias (-18.8 to -16.4 ug/m³) despite national emission increases of 20–91%. This is quite surprising and is not directly addressed. How would the evaluation look if restricted to in situ sites located in or near the countries with the largest adjustments (i.e., near cities or combustion sources)? This analysis would help assess how the posterior emission increases are reflected in near-surface concentrations.
On the temperature–ffCO2 relationship: The negative correlation between ffCO2 flux and surface temperature is a physically intuitive and visually compelling result. This relationship is one of the key scientific findings of the paper and would benefit from additional support, such as from bottom-up inventory analyses. A potential point of discussion: the temporal emission profiles in the prior (Figure 2, bottom row) already encode higher winter emissions for heating sectors. If the inversion is primarily amplifying a seasonal signal that was already present in the prior, the strengthened temperature-flux relationship in the posterior may partly reflect prior structure being scaled up rather than genuinely new information from TROPOMI? Also, how should energy demand for summer cooling (air conditioning) affect the ffCO2–temperature relationship?
On Figure A2 and the weekday-weekend signal: The appendix figure showing that the prior has two distinct branches in the temperature-flux scatter — corresponding to weekday and weekend emission regimes — that merge into a single coherent relationship in the posterior is an interesting finding that deserves more prominence and discussion in the main text. Is this merging physically meaningful, suggesting the inversion is correctly capturing a smoothed emission signal? Or could it reflect the Savitzky-Golay smoothing washing out the existing weekly cycle? It would also be informative to show the equivalent NOx flux vs. temperature scatter plot (a NOx version of Figure 6) to assess whether the same pattern holds for the directly constrained species.
On the OCO-2 evaluation (Figure 9): The near-absence of improvement in the OCO-2 comparison is acknowledged and expected given the dominance of biogenic signals in XCO2. However, the positive OCO-2 model bias that worsens in autumn and winter in the posterior is worth examining more carefully — what does this imply about the biospheric fluxes used in the inversion? More broadly, including OCO-2 as an evaluation dataset when it demonstrably cannot detect the signal being estimated risks creating a misleading impression of multi-dataset validation. Please either clarify the specific role and limitations of the OCO-2 evaluation more explicitly, or reconsider whether it adds sufficient scientific value to warrant inclusion.
On spatial evaluation: The model evaluation is conducted as temporally aggregated statistics across all available sites. Given that the inversion produces different results across countries — large increases in Turkey and Spain, modest changes in Netherlands and Belgium — a spatially resolved evaluation would provide much more diagnostic insight than domain-wide statistics alone. Or even a simple map of site-level prior vs. posterior bias would help assess whether the regional patterns in the posterior are supported by independent observations.
On the TROPOMI overpass time: TROPOMI observes at approximately 13:30 local time, meaning the inversion effectively constrains a single daily snapshot of emissions. The extrapolation to full daily totals relies on the sector-based diurnal scaling factors, and any errors in the assumed diurnal profile shape propagate directly into the posterior flux estimates. Please add a discussion of this limitation and its implications, and consider how complementary approaches such as geostationary satellite observations (e.g., TEMPO, which is directly relevant here), ground-based monitoring networks, or bottom-up activity data could help constrain the diurnal cycle more robustly.
On the broader methodological context: Several complementary approaches have emerged in recent years for inferring ffCO2 emissions from satellite observations, including direct plume detection methods, machine learning-based approaches, and observed cross-tracer ratio techniques. A brief discussion situating the present approach within this landscape — including its relative strengths and limitations compared to these alternatives — would help readers better understand the contribution and applicability of the method.