Preprints
https://doi.org/10.5194/egusphere-2026-1817
https://doi.org/10.5194/egusphere-2026-1817
11 May 2026
 | 11 May 2026
Status: this preprint is open for discussion and under review for Geoscientific Model Development (GMD).

A robustness diagnostic framework for NMIP ensembles: Application to NMIP2 soil N2O emission estimates

Motoko Inatomi

Abstract. Multi-model ensembles, such as the global N2O Model Intercomparison Project (NMIP), are essential for quantifying terrestrial nitrous oxide (N2O) fluxes; however, interpreting where ensemble results are reliable and where they are not remains challenging. Here, we propose a robustness diagnostic framework (RDF) that classifies each grid cell into four categories: Robust-increase, Robust-decrease, Divergent and Uncertain, based on the three metrics of ensemble-mean change, model agreement on the sign of change and inter-model standard deviation. By explicitly separating directional disagreement (Divergent) from quantitative disagreement (Uncertain), the RDF reveals the nature of model uncertainty rather than its magnitude alone. We applied the RDF to soil N2O emission estimates from eight terrestrial biosphere models participating in NMIP phase 2 (NMIP2), comparing pre-industrial (1850s) and contemporary (2010s) periods. Of 56,852 valid grid cells, 40.8 % were classified as Robust-increase, 2.6 % as Robust-decrease, 36.1 % as Divergent and 20.5 % as Uncertain. Stratification by land-use type revealed that the nature of uncertainty differed fundamentally: cropland-dominated regions were dominated by Uncertain (72.6 %), indicating an agreement on the direction of N2O increase but a large quantitative spread, whereas forest-dominated regions were dominated by Divergent (50.0 %), indicating disagreement on the direction of change itself. Pasture-dominated regions exhibited the highest robustness (59.1 % Robust-increase). The inter-model spread correlated strongly with nitrogen input intensity (Spearman ρ = 0.75), and the Divergent to Uncertain transition followed a gradient of cropland fraction. These contrasting patterns implied that different model improvement strategies were needed: observational benchmarking of emission magnitudes for croplands, and improved process understanding of the competition between carbon dioxide (CO2) fertilization and warming effects in forests. The proposed framework is general and applicable to any multi-model ensemble of biogeochemical change.

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.
Share
Motoko Inatomi

Status: open (until 06 Jul 2026)

Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor | : Report abuse
Motoko Inatomi

Model code and software

Analysis code for "A robustness diagnostic framework for NMIP ensembles: Application to NMIP2 soil N₂O emission estimates" (v1.0.0) Motoko Inatomi https://zenodo.org/records/19083012

Motoko Inatomi

Viewed

Total article views: 33 (including HTML, PDF, and XML)
HTML PDF XML Total Supplement BibTeX EndNote
26 5 2 33 3 1 1
  • HTML: 26
  • PDF: 5
  • XML: 2
  • Total: 33
  • Supplement: 3
  • BibTeX: 1
  • EndNote: 1
Views and downloads (calculated since 11 May 2026)
Cumulative views and downloads (calculated since 11 May 2026)

Viewed (geographical distribution)

Total article views: 33 (including HTML, PDF, and XML) Thereof 33 with geography defined and 0 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 
Latest update: 13 May 2026
Download
Short summary
We propose a framework to assess where multi-model estimates of soil nitrous oxide emissions are reliable. Each location is classified by direction of change, model agreement, and spread among models. Applying this to eight models from the N2O Model Intercomparison Project, we find that disagreement depends on land use: in croplands, models agree on increasing emissions but differ on magnitude; in forests, models disagree on direction. This implies different model improvement priorities.
Share