Inverse modelling for surface methane flux estimation with 4DVar: impact of a computationally efficient representation of a non-diagonal B-matrix in INVICAT v4

Bannister, Ross Noel; Wilson, Chris

doi:10.5194/egusphere-2024-655

Preprints

https://doi.org/10.5194/egusphere-2024-655

Preprints

07 Mar 2024

| 07 Mar 2024

Inverse modelling for surface methane flux estimation with 4DVar: impact of a computationally efficient representation of a non-diagonal B-matrix in INVICAT v4

Ross Noel Bannister and Chris Wilson

Abstract. Prior information is essential to most inverse problems and the surface ﬂux estimation problem is no exception. The uncertainties of the prior ﬁelds, and their inter-correlations, should ideally be reﬂected in the a-priori error covariance matrix, often called B. The B-matrix, is however, difﬁcult to quantify partly because it is typically a large matrix and partly because its numerical values are unknown.

We present a highly efﬁcient method of representing the B-matrix to represent prior errors in the initial concentration and in the time sequence of surface ﬂuxes for the 4DVar-based inverse modelling system (INVICAT) used to estimate the surface ﬂuxes of methane. Our formulation is based on a spectral formulation of the square-root of B, which we believe has not been used in any such inverse modelling system before. It allows horizontal and vertical error correlations of the initial concentration, and horizontal and temporal error correlations of the ﬂux to be represented. We provide full mathematical details. Our scheme allows the various correlation components to be switched on/off and for the respective length and timescales to be set in a way that is much more computationally efﬁcient than representing such a B-matrix explicitly.

We test 14 conﬁgurations of the B-matrix (including the diagonal conﬁguration) in a 100 day test assimilation of surface ﬂask measurements of methane. We measure the performance of each by comparing the analysis to unassimilated observations held back for evaluation purposes. We ﬁnd that the diagonal conﬁguration is amongst the poorest performing choices of B. The best performing choice uses the spectral method. It does not include correlations for the initial concentration ﬁeld, but does account for spatio-temporal correlations for the ﬂuxes. These have the form of a SOAR (second order auto-regressive) function with a correlation length-scale of 600 km and a timescale of 3 months. Our results demonstrate the effectiveness of our method, which is applicable to very high resolution inverse modelling systems. We propose that potential biases in the prior initial condition ﬁeld may be the reason for the poor performance when correlations in the prior initial concentration ﬁeld are used.

Received: 04 Mar 2024 – Discussion started: 07 Mar 2024

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Ross Noel Bannister and Chris Wilson

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-655', Anonymous Referee #1, 21 Mar 2024
General comments
The authors discuss the calculations associated with the a priori error covariance matrix in a variational inversion system. They propose a spectral formulation of the matrix in order to optimize them. Without being new, the article is interesting, stimulating, educational and generally well written. That being said, the presentation is rather paradoxical in several respects and could be misleading. As it stands, some students might invest in the spectral formulation in vain and, therefore, I cannot support the publication of the article yet.
The first paradox of the presentation is the motivation: “The spectral method is very efficient. It is applicable to systems with very high resolutions, where existing methods that explicitly represent the B-matrix would not be feasible (Appendix D)” (l. 377-379 and similar sentences in the rest of the text). Actually, looking at Figure D1 about the cost of the two approaches, one can see that the authors’ illustration is on the lower end (L = 32, small resolution), while current results with an explicit representation reach the high end (L seems to be about 400 in doi:10.22541/au.171052488.85903583/v1). Where is the better efficiency argued in l. 66? The authors seem to ignore that the explicit representation is simplified by numerous zero correlations in the 2D flux errors:
no space-time correlations (l. 160), allowing the use of a simple Kronecker product between time and space correlation matrices, see, e.g., https://doi.org/10.5194/acp-8-6341-2008, https://doi.org/10.5194/gmd-6-583-2013, or https://doi.org/10.1016/B978-0-12-814952-2.00008-3 – I am accumulating the citations here to show that this is a common method rather than a professional secret

no correlations between land and ocean for surface flux errors, and, if needed, between continents or between certain ocean basins. Therefore, the effective dimension to manage can be much smaller than for the spectral method which requires the entire globe (hidden cost of the remark made in l. 407-408 that suggests duplicating the control vector when using the spectral method)

One could argue that these two items do not apply for the 3D initial state, which is much more challenging. Actually, the authors find that assigning spatial prior error correlations for the initial state degrades the inversion (see their embarrassed explanation in l. 417-420). Leaving aside this surprising result (to say the least), we see that this part of the control vector is similar to the numerical weather prediction (NWP) context where spectral methods emerged, but is this really the goal of atmospheric inversion? Atmospheric inversions suffer from edge effects and it is usual to cut off both ends: in contrast to NWP, obtaining the optimal initial state is not strategic and therefore the representation of its prior uncertainty can be simplified.
The second paradox of the paper is related. The objective of the method is to facilitate the resolution increase, but the detail of the increments is blurred by the horizontal reconfiguration operator Rh. Errera and Ménard (2012, cited twice in the paper) clearly demonstrated that “The practice of producing analysis increments on a horizontal Gaussian grid and then interpolating to an equally spaced grid is also shown to produce a degradation of the analysis.” In that case, who wants to use it?
Minor comments
l. 52-53: this is not specific to 4D-Var but concerns all ill-posed inverse problems, explicitly or not

l. 176: the horizontal reconfiguration operator Rh should be detailed (in particular, the need of local mass conservation and the way to handle mixed land-ocean pixels on one of the grids)

l. 219: that way of doing renders the test useless. In the diagonal case, variances should be inflated to compensate for missing correlations and conserve a realistic total error budget

l. 467: the word “unfortunate” is too subjective

Section A3: the result of a classical adjoint test at the machine epsilon should be given to support the implementation
Citation: https://doi.org/10.5194/egusphere-2024-655-RC1
- AC1: 'Reply on RC1', Ross Bannister, 01 Apr 2024
  
  We would like to thank Referee #1 for his/her comments on our work. Referee #1 has raised some interesting points, which we would like to resolve. Please see the attached file.
  
  Citation: https://doi.org/10.5194/egusphere-2024-655-AC1
RC2:
'Comment on egusphere-2024-655', Anonymous Referee #2, 28 Apr 2024
This paper presents a spectral method to efficiently account for prior error correlations in 4DVAR inversions with very large state vectors.
The paper poses an important problem regarding the specification of prior error covariance matrices for inversions with large state vectors, and I found the reduced spectral representation to be interesting. Whether it’s actually practical is not clear, and the demo seems like an odd choice since it is so poorly observationally constrained. The work seems original for atmospheric chemistry applications though, and the method appears to be sound in principle though I could not follow all the math. I am supportive of publication because it could be useful to some inversion practitioners.
Specific comments:
Lines 38-40: the ‘direct inversion’ is usually called analytical inversion in the atmospheric chemistry literature and has been used extensively for methane by the Harvard group. It is not really fair to say that it is limited ‘to a relatively small number of large-area surface regions’ because the Jacobian matrix can be computed as an embarrassingly parallel problem. Maasakkers 2021 cited in the paper used the native 50-km resolution of the transport model, and a more recent application by Nesser et al. uses native 25-km resolution (https://doi.org/10.5194/egusphere-2023-946 ). A big advantage of the analytical inversion is that it provides closed-form posterior error covariance matrices enabling characterization of information content, and it allows immediate generation of inversion ensembles, cf. Jacob et al. https://doi.org/10.5194/acp-22-9617-2022

Line 42: should also mention the LETKF approach used by Myazaki at JPL, for example https://doi.org/10.5194/acp-17-807-2017, which has similarity to the transform done here.

Lines 43-46: There should be some mention that the size of the state vector is limited not only by computational resources but also by information content. A problem with 4DVAR and EnKF methods is that information content is not directly characterized.

Line 60: another problem in empirical construction of B is ensuring that it is positive definite – this typically requires massaging the matrix after empirical construction of the off-diagonals.

Line 131: why does the change in gridbox size with latitude matter?

Line 152: will Bsp always be positive definite?

Lines 169-172: Spectral methods and spherical harmonics don’t work great for atmospheric chemistry problems because the variability of chemical species does not follow wave structures. I’m not sure if this is relevant here, but the spectral approach is most useful when there is global structure in the error to be resolved and that is generally not the case for atmospheric chemistry problems.

Line 218: ‘weak sink over Antarctica’. Weird. Where does this sink come from? Admittedly it doesn’t matter for your demo.

Line 226: errors in atmospheric chemistry problems generally do not show ‘homogeneous and isotropic correlations’ .

Figure 2: The correlation length scale looks more like ten degrees, so ~1000 km. Seems long, particularly for methane fluxes which come from a diversity of uncorrelated source sectors. What is the rationale for a spatial error correlation in methane fluxes?

Figure 3: weird to have the vertical error correlation plot emphasize the stratosphere in a demo for methane fluxes.

Line 284: The small information content to be obtained from the 60 NOAA stations is incommensurate to the size of the state vector. That would explain why the results are disappointing,
Citation: https://doi.org/10.5194/egusphere-2024-655-RC2
- AC2: 'Reply on RC2', Ross Bannister, 12 Jul 2024
  
  Many thanks to reviewer 2 for their careful review of our manuscript.
  
  Our responses can be round in the attached file.
  
  Citation: https://doi.org/10.5194/egusphere-2024-655-AC2

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-655', Anonymous Referee #1, 21 Mar 2024
General comments
The authors discuss the calculations associated with the a priori error covariance matrix in a variational inversion system. They propose a spectral formulation of the matrix in order to optimize them. Without being new, the article is interesting, stimulating, educational and generally well written. That being said, the presentation is rather paradoxical in several respects and could be misleading. As it stands, some students might invest in the spectral formulation in vain and, therefore, I cannot support the publication of the article yet.
The first paradox of the presentation is the motivation: “The spectral method is very efficient. It is applicable to systems with very high resolutions, where existing methods that explicitly represent the B-matrix would not be feasible (Appendix D)” (l. 377-379 and similar sentences in the rest of the text). Actually, looking at Figure D1 about the cost of the two approaches, one can see that the authors’ illustration is on the lower end (L = 32, small resolution), while current results with an explicit representation reach the high end (L seems to be about 400 in doi:10.22541/au.171052488.85903583/v1). Where is the better efficiency argued in l. 66? The authors seem to ignore that the explicit representation is simplified by numerous zero correlations in the 2D flux errors:
no space-time correlations (l. 160), allowing the use of a simple Kronecker product between time and space correlation matrices, see, e.g., https://doi.org/10.5194/acp-8-6341-2008, https://doi.org/10.5194/gmd-6-583-2013, or https://doi.org/10.1016/B978-0-12-814952-2.00008-3 – I am accumulating the citations here to show that this is a common method rather than a professional secret

no correlations between land and ocean for surface flux errors, and, if needed, between continents or between certain ocean basins. Therefore, the effective dimension to manage can be much smaller than for the spectral method which requires the entire globe (hidden cost of the remark made in l. 407-408 that suggests duplicating the control vector when using the spectral method)

One could argue that these two items do not apply for the 3D initial state, which is much more challenging. Actually, the authors find that assigning spatial prior error correlations for the initial state degrades the inversion (see their embarrassed explanation in l. 417-420). Leaving aside this surprising result (to say the least), we see that this part of the control vector is similar to the numerical weather prediction (NWP) context where spectral methods emerged, but is this really the goal of atmospheric inversion? Atmospheric inversions suffer from edge effects and it is usual to cut off both ends: in contrast to NWP, obtaining the optimal initial state is not strategic and therefore the representation of its prior uncertainty can be simplified.
The second paradox of the paper is related. The objective of the method is to facilitate the resolution increase, but the detail of the increments is blurred by the horizontal reconfiguration operator Rh. Errera and Ménard (2012, cited twice in the paper) clearly demonstrated that “The practice of producing analysis increments on a horizontal Gaussian grid and then interpolating to an equally spaced grid is also shown to produce a degradation of the analysis.” In that case, who wants to use it?
Minor comments
l. 52-53: this is not specific to 4D-Var but concerns all ill-posed inverse problems, explicitly or not

l. 176: the horizontal reconfiguration operator Rh should be detailed (in particular, the need of local mass conservation and the way to handle mixed land-ocean pixels on one of the grids)

l. 219: that way of doing renders the test useless. In the diagonal case, variances should be inflated to compensate for missing correlations and conserve a realistic total error budget

l. 467: the word “unfortunate” is too subjective

Section A3: the result of a classical adjoint test at the machine epsilon should be given to support the implementation
Citation: https://doi.org/10.5194/egusphere-2024-655-RC1
- AC1: 'Reply on RC1', Ross Bannister, 01 Apr 2024
  
  We would like to thank Referee #1 for his/her comments on our work. Referee #1 has raised some interesting points, which we would like to resolve. Please see the attached file.
  
  Citation: https://doi.org/10.5194/egusphere-2024-655-AC1
RC2:
'Comment on egusphere-2024-655', Anonymous Referee #2, 28 Apr 2024
This paper presents a spectral method to efficiently account for prior error correlations in 4DVAR inversions with very large state vectors.
The paper poses an important problem regarding the specification of prior error covariance matrices for inversions with large state vectors, and I found the reduced spectral representation to be interesting. Whether it’s actually practical is not clear, and the demo seems like an odd choice since it is so poorly observationally constrained. The work seems original for atmospheric chemistry applications though, and the method appears to be sound in principle though I could not follow all the math. I am supportive of publication because it could be useful to some inversion practitioners.
Specific comments:
Lines 38-40: the ‘direct inversion’ is usually called analytical inversion in the atmospheric chemistry literature and has been used extensively for methane by the Harvard group. It is not really fair to say that it is limited ‘to a relatively small number of large-area surface regions’ because the Jacobian matrix can be computed as an embarrassingly parallel problem. Maasakkers 2021 cited in the paper used the native 50-km resolution of the transport model, and a more recent application by Nesser et al. uses native 25-km resolution (https://doi.org/10.5194/egusphere-2023-946 ). A big advantage of the analytical inversion is that it provides closed-form posterior error covariance matrices enabling characterization of information content, and it allows immediate generation of inversion ensembles, cf. Jacob et al. https://doi.org/10.5194/acp-22-9617-2022

Line 42: should also mention the LETKF approach used by Myazaki at JPL, for example https://doi.org/10.5194/acp-17-807-2017, which has similarity to the transform done here.

Lines 43-46: There should be some mention that the size of the state vector is limited not only by computational resources but also by information content. A problem with 4DVAR and EnKF methods is that information content is not directly characterized.

Line 60: another problem in empirical construction of B is ensuring that it is positive definite – this typically requires massaging the matrix after empirical construction of the off-diagonals.

Line 131: why does the change in gridbox size with latitude matter?

Line 152: will Bsp always be positive definite?

Lines 169-172: Spectral methods and spherical harmonics don’t work great for atmospheric chemistry problems because the variability of chemical species does not follow wave structures. I’m not sure if this is relevant here, but the spectral approach is most useful when there is global structure in the error to be resolved and that is generally not the case for atmospheric chemistry problems.

Line 218: ‘weak sink over Antarctica’. Weird. Where does this sink come from? Admittedly it doesn’t matter for your demo.

Line 226: errors in atmospheric chemistry problems generally do not show ‘homogeneous and isotropic correlations’ .

Figure 2: The correlation length scale looks more like ten degrees, so ~1000 km. Seems long, particularly for methane fluxes which come from a diversity of uncorrelated source sectors. What is the rationale for a spatial error correlation in methane fluxes?

Figure 3: weird to have the vertical error correlation plot emphasize the stratosphere in a demo for methane fluxes.

Line 284: The small information content to be obtained from the 60 NOAA stations is incommensurate to the size of the state vector. That would explain why the results are disappointing,
Citation: https://doi.org/10.5194/egusphere-2024-655-RC2
- AC2: 'Reply on RC2', Ross Bannister, 12 Jul 2024
  
  Many thanks to reviewer 2 for their careful review of our manuscript.
  
  Our responses can be round in the attached file.
  
  Citation: https://doi.org/10.5194/egusphere-2024-655-AC2

Ross Noel Bannister and Chris Wilson

Model code and software

Inverse modelling for surface methane flux estimation with 4DVar: impact of a computationally efficient representation of a non-diagonal B-matrix Ross Bannister and Chris Wilson https://doi.org/10.5281/zenodo.10777737

Ross Noel Bannister and Chris Wilson

Viewed

Total article views: 914 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
604	262	48	914	62	79

HTML: 604
PDF: 262
XML: 48
Total: 914
BibTeX: 62
EndNote: 79

Views and downloads (calculated since 07 Mar 2024)

Month	HTML	PDF	XML	Total
Mar 2024	121	41	8	170
Apr 2024	49	15	11	75
May 2024	20	5	1	26
Jun 2024	19	8	4	31
Jul 2024	39	11	6	56
Aug 2024	29	2	0	31
Sep 2024	14	11	0	25
Oct 2024	11	24	0	35
Nov 2024	6	14	0	20
Dec 2024	4	7	0	11
Jan 2025	4	10	1	15
Feb 2025	8	18	0	26
Mar 2025	11	8	0	19
Apr 2025	12	14	0	26
May 2025	13	10	1	24
Jun 2025	15	8	0	23
Jul 2025	7	4	1	12
Aug 2025	35	13	1	49
Sep 2025	119	10	4	133
Oct 2025	20	12	3	35
Nov 2025	26	9	2	37
Dec 2025	19	8	5	32
Jan 2026	3	0	3

Cumulative views and downloads (calculated since 07 Mar 2024)

Month	HTML	PDF	XML	Total
Mar 2024	121	41	8	170
Apr 2024	49	15	11	75
May 2024	20	5	1	26
Jun 2024	19	8	4	31
Jul 2024	39	11	6	56
Aug 2024	29	2	0	31
Sep 2024	14	11	0	25
Oct 2024	11	24	0	35
Nov 2024	6	14	0	20
Dec 2024	4	7	0	11
Jan 2025	4	10	1	15
Feb 2025	8	18	0	26
Mar 2025	11	8	0	19
Apr 2025	12	14	0	26
May 2025	13	10	1	24
Jun 2025	15	8	0	23
Jul 2025	7	4	1	12
Aug 2025	35	13	1	49
Sep 2025	119	10	4	133
Oct 2025	20	12	3	35
Nov 2025	26	9	2	37
Dec 2025	19	8	5	32
Jan 2026	3	0	3

Viewed (geographical distribution)

Total article views: 904 (including HTML, PDF, and XML) Thereof 904 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 05 Jan 2026

Short summary

Prior information is essential for the top-down estimation of CH₄ surface fluxes. Errors in the prior are correlated in time/space, but accounting for correlations can be costly. We report on an efficient scheme to represent correlations in the inverse modelling system, INVICAT. The method is tested by assimilating CH₄ observations using the scheme. Our findings show that accounting for spatio-temporal correlations improve CH₄ flux estimates, demonstrating that the method should be further used.


Total:	0
HTML:	0
PDF:	0
XML:	0