the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
GENOA v3: A flexible framework for reduction and exploration of highly detailed chemical mechanisms
Abstract. Comprehensive atmospheric chemical mechanisms for volatile organic compound (VOC) oxidation contain thousands to millions of reactions and species, presenting major computational challenges for large-scale or long-term simulations. As mechanism complexity continues to increase, reduction strategies are required to enable their use in atmospheric modeling while preserving accuracy.
This paper presents the GENerator of Optimized Atmospheric chemical mechanisms (GENOA v3), a major advancement over earlier GENOA versions that enables scalable reduction of highly detailed mechanisms containing up to millions of reactions and species. GENOA v3 combines fast, strategy-driven threshold-based reduction (TBR) with simulation-based reduction (SBR) that explicitly controls accuracy. The framework is modular, graph-aware, and user-configurable, resulting in compact and chemically interpretable reduced mechanisms.
Applications to GECKO-A mechanisms for diverse VOC precursors across a range of scenarios show that TBR achieves mechanism size reductions of 20–90 % while preserving reasonable accuracy for metrics related to secondary organic aerosol (SOA) formation and gas-phase chemistry, with performance systematically dependent on precursor structure and chemical complexity across mechanisms. SBR achieves further reductions in mechanism size by several orders of magnitude; when trained with 15 % mean error constraints, SBR produces schemes within 0.02 % of the original size for preservation of SOA mass and 0.05 % when also considering gas-phase reactivity (e.g., OH, O3, and NO3).
These results demonstrate for the first time that GENOA v3 can reduce highly detailed chemical mechanisms while jointly preserving SOA mass and gas-phase reactivity, achieving substantial size reductions with reasonable accuracy across a wide range of scenarios. Continued application of GENOA v3 and growth of a user community will potentially support the development of libraries of reduced mechanisms and optimized reduction strategy sets tailored to specific modeling applications.
- Preprint
(6416 KB) - Metadata XML
-
Supplement
(16367 KB) - BibTeX
- EndNote
Status: open (until 17 Jul 2026)
- RC1: 'Comment on egusphere-2026-1921', Anonymous Referee #1, 10 Jun 2026 reply
-
RC2: 'Comment on egusphere-2026-1921', Anonymous Referee #2, 15 Jun 2026
reply
Wang et al. present GENOA v3, a chemical mechanism reduction framework designed for the scalable reduction of highly detailed mechanisms containing millions of reactions and species. GENOA v3 combines two reduction approaches, beginning with a Threshold-Based Reduction (TBR) approach that uses predefined criteria to remove chemically less important reactions and species, followed by a Simulation-Based Reduction (SBR) approach that iteratively evaluates the impact of mechanism reduction on model performance while effectively preserving chemical accuracy. A graph-based representation of chemical mechanisms is employed, making the framework applicable to very large reaction networks. A wide range of scenarios has been used to evaluate the applicability of the GENOA v3 framework, demonstrating that it can successfully reduce highly detailed mechanisms while preserving SOA mass and gas-phase reactivity. Overall, the manuscript makes an important contribution to atmospheric chemistry modeling by providing a scalable strategy for translating highly detailed mechanisms into forms that can be used in large-scale regional and global simulations. The manuscript is well written, falls well within the scope of GMD, and could be published after addressing the following suggestions:
- Section 3: It is scientifically sound, but it is overly technical and detail-oriented. The explanations would benefit by including practical examples. For example, the implementation of the SBR approach discussed in Appendix B5, together with the example provided there, is much more readable and easier to understand than the theoretical explanation presented in Section 3.3 and its subsections of the main manuscript. To avoid repetition and confusion, I suggest moving and integrating this content into the main manuscript. This would reduce redundancy, improve readability, and better highlight the importance of the SBR approach. It is also encouraged the authors to explain other parts of the manuscript in a similar manner.
- Section 3: Please mention a quantitative example showing how many reactions and species can be removed during the TBR and SBR stages. This would help readers understand the practical significance of both reduction steps at this stage of manuscript.
- Section 3.2: What is the importance of the TBR approach? Since it depends on user-defined protocols and predefined thresholds, could it potentially be a source of error? It would be helpful if the TBR strategy were also explained using a mechanism example, similar to the SBR approach presented in Appendix B5. Ideally, the authors could begin with a small example mechanism and then demonstrate both the TBR and SBR reduction processes using that mechanism. The technical details could then be moved to the appendix.
- Section 4.1: mention the "MTs" in line 376 where it first appeared.
- Section 4.3: Since many similar thresholds have already been applied during GECKO-A mechanism generation, why is an additional TBR step necessary? Does this not make the mechanism reduction process more complex? Please clarify whether this step is essential and justify its inclusion.
- In Figures 5–10 and the corresponding discussion sections, the trends are reported, but the underlying mechanisms are not discussed. For example, the authors mention that reduction performance improves with increasing carbon number because larger VOCs produce lower-volatility products that dominate SOA formation; however, this is not qualitatively discussed. The authors may strengthen this argument by presenting cumulative SOA contributions by generation, volatility bin, and dominant species for C7, C12, C16, U8, U16, and LIM systems. Such an analysis would help highlighting that a limited number of dominant species control SOA formation in larger systems.
- Sections 5.2.1 and 5.3.1: It is interesting that small-VOC systems (C5–C7 and U5–U6) behave differently from larger systems and appear to be more prone to over-reduction. The authors suggest that this is due to the exclusion of short-chain oxidation pathways, including C2–C3 chemistry. It would be valuable to perform a pathway attribution analysis to identify which removed pathways (e.g., HOx recycling, PAN formation, formaldehyde/acetaldehyde chemistry, etc.) are primarily responsible for the observed errors.
- Section 4.4.1: The authors note that tracer-species correlations vary across species and VOC classes, and Figure S16 shows that NOx, NO3, and O3 tracers exhibit weaker correlations than OH and HO2. However, these tracers are still included as optimization targets within the SBR framework. This raises the question of whether preserving tracer concentrations necessarily implies preservation of the underlying chemistry. It would be useful to quantitatively relate tracer errors to actual species or pathway errors during the reduction process, particularly for NOx-, NO3-, and O3-related chemistry.
- Sections 5 and 6: Monoterpene (MT) chemistry is known to be highly complex, and this manuscript consistently shows that MT systems exhibit greater structural complexity (Figure 5) and larger reduction errors (Figure 6 and others). It would be beneficial to provide further discussion of the reasons behind this behavior. In particular, identifying which pathways are removed by GENOA during the TBR and SBR stages, and which of these pathways may be important for monoterpene SOA formation, would significantly strengthen the analysis.
- Section 5.2.1: The authors report that the mean tSOA reduction error exceeds the median for T10 reductions because a small number of mechanisms exhibit very large errors, suggesting a highly skewed error distribution and strong outlier influence. It would be useful to include additional statistical characterization to determine whether the mean errors are representative of typical mechanism behavior or are dominated by a small subset of reductions.
- Overall, the manuscript provides a thorough evaluation of reduction performance in terms of mechanism size reduction and preservation of SOA and gas-phase metrics. However, a key question remains unanswered: what chemistry is actually removed during the reduction process? An analysis of representative systems (e.g., U8, C12, and LIM) showing the fractions of removed pathways associated with OH, O3, NOx, RO2, and related chemistry would substantially strengthen the manuscript.
Citation: https://doi.org/10.5194/egusphere-2026-1921-RC2
Model code and software
GENOA v3.0.0: GENerator of Optimized Atmospheric chemical mechanisms Zhizhao Wang https://doi.org/10.5281/zenodo.18807902
Viewed
| HTML | XML | Total | Supplement | BibTeX | EndNote | |
|---|---|---|---|---|---|---|
| 272 | 91 | 24 | 387 | 69 | 22 | 18 |
- HTML: 272
- PDF: 91
- XML: 24
- Total: 387
- Supplement: 69
- BibTeX: 22
- EndNote: 18
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
The paper presents the GENOA v3 framework for reduction of atmospheric chemical mechanisms with huge number of species and reactions such as the highly detailed VOC reaction mechanisms generated by GECKO-A. The work demonstrates that GENOA v3 can effectively reduce highly detailed VOC mechanisms while preserving SOA mass production and gas-phase reactivity with reasonable accuracy across a wide range of atmospherically relevant scenarios.
The reduction follows a rule-based strategy and is performed in two subsequent approaches, first through a threshold-based reduction (TBR) and second through a simulation-based reduction (SBR) that uses the TBR-reduced mechanism as a starting point. Whereas TBR provides a rapid initial pruning of large mechanisms at low computational demand, only the subsequent SBR provides the aggressive reduction needed for application in a large-scale CTM. It is not apparent whether the TBR part can be skipped in principle or whether it is required for reordering or attenuating the starting-point mechanism before it can be used in SBR. The authors currently imply that the starting mechanism needs some refinement before it is possible to run it efficiently in SBR. However, GECKO-A should already generate a mechanism that is not too stiff for theoretically applying it in a 0-D box model.
SBR is not a black box but the reduction comes with the risk that the reduced mechanism does not remain accurate under all given or even not the most critical atmospheric conditions. The error control during mechanism reduction is somewhat vague. Are there any checks and controls happening during the reduction process chain such as atom balance (loss of C or N or O)? The user-defined error tolerances for the reduction errors in SBR are also part of this concern. Not only does the choice of the error tolerance (ε) increase the degrees of freedom of a reduced mechanism created from the same starting mechanism by different users, it is also not demonstrated in this work whether the error of the reduced mechanism is related to the value of ε. Therefore, it is not guaranteed that decreasing ε results in a monotonically decreasing error of simulated concentrations compared to the reference, and may not always provide the smallest reduced mechanism at a given simulation error. Probably different epsilon values should be tested by the user and could result in a reduced mechanism with less species but only slightly different simulation error in comparison to the reference mechanism. My suggestion is to study how number of species and simulation error (NMB) change as function of epsilon for a subset of the mechanism, for example U8.
The more compact chemical mechanism resulting from GENOA v3 may be viable for large-scale atmospheric simulations, although this is not demonstrated in the paper. Since each user-defined set of configurations of targets and error tolerances results in a differently reduced mechanism, it will be difficult for those users who apply the reduced mechanism in a CTM simulation to define the mechanism. The use of the GENOA-reduced mechanism in CTM simulations would require the publication of the full reduced mechanism including the user-specific reduction protocol either before or along with the CTM simulation when publishing model results. A nomenclature for the obtained reduced chemical mechanism should be suggested in this work.
The evaluation of the SBR reduction performance (section 6) is well organized and presented. The reaction pathway visualization is excellent and the example of reaction pathways for U8 with SOA-focused reductions at high and medium error tolerances versus multi-target reduction is well chosen. SBR is the main development of this work, and this part should be more in focus of the paper. Given the minor role of TBR in reducing the original mechanism, it would be sufficient to describe TBR in form of a technical manual in the Supplement or at least to shorten the TBR evaluation part in the manuscript.
The development of GENOA v3 is a very useful instrument for atmospheric chemistry modelling. The evaluation of reduction candidates in terms of SOA and multiple gas-phase reactivity metrices and the capacity to reduce starting mechanisms containing hundred thousands of reactions into compact mechanisms makes this work a valuable contribution.
The revision of the manuscript should include a comparison of the GENOA-reduced mechanism against other condensed chemistry mechanisms such as the Common Representative Intermediate (CRI) scheme from reducing MCM. In addition, my concerns listed below should be addressed.
Specific Comments:
1.) Introduction: two categories of systematic mechanism reduction are put forth in the introduction: chemistry informed rule-based reduction versus machine learning/data driven reduction. The agglomeration of all non-ML techniques into one category is a simplification that ignores other possibilities for mathematical chemistry-based reductions. In the field of combustion flame kinetics, rigorous mechanism reduction has been developed much earlier than in the field of atmospheric chemistry. Nagy and Turányi (2009) developed the simulation error minimization connectivity method which generates thousands of candidate reduced mechanisms on the basis of the inspection of the normalized Jacobian and produces a database, in which the errors of simulations using the reduced mechanisms are recorded. Based on this database, an optimal reduced mechanism is selected. The directed relation graph (DRG) method uses a directed graph to map the coupling of specues and consequently find unimportant species for removal based on an acceptable threshold. Sander et al. (2019) have used the graph-oriented reduction method with error propagation by Niemeyer et al. (2010) to reduce the Mainz Organic Mechanism (MOM) and compared the reduced and the full mechanism in a global simulation. For completeness, MOM that is included in the CAABA/ MECCA box model (Sander etal., 2019) should be mentioned as another representative example of detailed VOC mechanisms.
2.) Section 2.1: before introducing the highlights of GENOA v3, authors should put forth what the criteria are for measuring the performance of GENOA in this study, in terms of scalability, simulation errors in tSOA, simulation errors in O3 production, mass balance, atom balance of C, O and N, etc.
3.) Figure 2: Does the gray downwards arrow mean that output of GENOA v1 and v2 can be used as input for GENOA v3?
4.) Section 3.2.2, branch concentration removal: While it makes sense to remove minor branches, a description is missing how the remaining relative branching ratios are corrected after removing the unimportant branch.
5.) Section 3.3, introduction of SBR: It is stated that the candidate (for the same portion of the mechanism) is selected that achieves the best balance between reduction efficiency and accuracy. How is it ensured that the selected candidate is the best candidate for reduction overall? In other words, how can we be sure that the selected candidate and its subsequent further reduction preclude that a less favorable candidate turn out as better reduction option a few stages downstream? What are “user-defined stopping conditions” and where are they specified?
6.) Section 3.3.3, candidate evaluation criteria: elaborate more on the error tolerances. It seems odd to let the user define the error tolerances. Assuming that the user-defined error tolerances are carefully documented in each reduction step, this nevertheless imposes knowledge about prescription of error tolerances on the users. Please describe the objective reasoning for the specific error tolerances (also see my suggestion in the general comment).
7.) Section 4.1, test mechanism selection: Include a reference to Table 6 already here. Consider including the total number of RO2 species in Table 6.
8.) P16, L 410: Introduce briefly which SOA formation algorithm is applied in the box model simulation. Is the initial SOA seed mass non-volatile? Is the partitioning reversible?
9.) Figure 3: why is there no 10% limit shown in plot (b)?
10.) Table 5: Explain in the text, why the error tolerances increase within one stage and throughout the stages. P22, L 501: It is mentioned here that error tolerances are relaxed after stage A2 without giving the rationale behind this. P22, L 511: Did you test how sensitive is the reduced mechanism to mean reduction errors larger than 10%, for example in the prediction of tSOA, and how does the choice of larger reduction errors affect the size reduction?
11.) P24, L 550: What are the criteria to consider a tSOA error in TBR as acceptable? That should be defined from the outset and may also be different for different PVOC. Related to this (P26, L 583), it is shown that tSOA in T10 is systematically giving 10-20% lower SOA mass than the Reference. This potentially leads to a systematic underestimation of observed SOA in simulation with a box model using this reaction mechanism. Please comment on that issue and how it can be circumvented.
12.) P31, L679-686: A possible conclusion at the end of the TBR performance evaluation could be that T6 can be used for multitarget reduction (gas-phase and SOA related) and T10 only for SOA-related reductions. This also could lead to the conclusion that it is better to streamline reductions with TBR by just allowing two different options, T6 or T10, with a given set of pre-defined reduction parameters for each.
13.) Section 7.2.1, scenario dependence: Not spanning scenarios that occur under certain atmospheric conditions put the mechanism reduction at risk, although the selected 11 scenarios cover most typical environments in the lower atmosphere. An additional scenario should include light-dependent emissions of isoprene, as mixing with isoprene may potentially suppress SOA and this interference is important everywhere in the continental atmosphere. Moreover, a scenario representing conditions of the upper troposphere in the tropics would assist in the reduction for use in global chemistry-transport models.
14.) Section 7.4, integration with machine learning: Although GENOA retains mechanistic interpretability, neither ML-based approaches nor GENOA allow tracking of reduction decisions to find out why a certain reduction candidate was preferred over others. With GENOA there is at least theoretically a chance to trace a reduction decision, but I guess it is terribly complicated to trace it back after several reduction stages. Please comment.
References:
Nagy, T. and Turanyi, T.: Reduction of very large reaction mechanisms using methods based on simulation error minimization, Combustion and Flame, 156, 417–428, https://doi.org/10.1016/j.combustflame.2008.11.001, 2009.
Niemeyer, K. E., Sung, C.-J., and Raju, M. P.: Skeletal mechanism generation for surrogate fuels using directed relation graph with error propagation and sensitivity analysis, Combustion and Flame, 157, 1760–1770, https://doi.org/10.1016/j.combustflame.2009.12.022, 2010.
Sander, R., Baumgaertner, A., Cabrera-Perez, D., Frank, F., Gromov, S., Grooß, J.-U., Harder, H., Huijnen, V., Jöckel, P., Karydis, V. A., Niemeyer, K. E., Pozzer, A., Riede, H., Schultz, M. G., Taraborrelli, D., and Tauer, S.: The community atmospheric chemistry box model CAABA/MECCA-4.0, Geosci. Model Dev., 12, 1365–1385, https://doi.org/10.5194/gmd-12-1365-2019, 2019.