Consistent Point Data Assimilation in Firedrake and Icepack

Nixon-Hill, Reuben W.; Shapero, Daniel; Cotter, Colin J.; Ham, David A.

doi:https://doi.org/10.48550/arXiv.2304.06058

Preprints

https://doi.org/10.48550/arXiv.2304.06058

Preprints

27 Jun 2023

| 27 Jun 2023

Consistent Point Data Assimilation in Firedrake and Icepack

Reuben W. Nixon-Hill, Daniel Shapero, Colin J. Cotter, and David A. Ham

Abstract. When estimating quantities and fields that are difficult to measure directly, such as the fluidity of ice, from point data sources, such as satellite altimetry, it is important to solve a numerical inverse problem that is formulated with Bayesian consistency. Otherwise, the resultant probability density function for the difficult to measure quantity or field will not be appropriately clustered around the truth. In particular, the inverse problem should be formulated by evaluating the numerical solution at the true point locations for direct comparison with the point data source. If the data are first fitted to a gridded or meshed field on the computational grid or mesh, and the inverse problem formulated by comparing the numerical solution to the fitted field, the benefits of additional point data values below the grid density will be lost. We demonstrate, with examples in the fields of groundwater hydrology and glaciology, that a consistent formulation can increase the accuracy of results and aid discourse between modellers and observationalists.

To do this, we bring point data into the finite element method ecosystem as discontinuous fields on meshes of disconnected vertices. Point evaluation can then be formulated as a finite element interpolation operation (dual-evaluation). This new abstraction is well-suited to automation, including automatic differentiation. We demonstrate this through implementation in Firedrake, which generates highly optimised code for solving Partial Differential Equations (PDEs) with the finite element method. Our solution integrates with dolfin-adjoint/pyadjoint, allowing PDE-constrained optimisation problems, such as data assimilation, to be solved through forward and adjoint mode automatic differentiation.

Received: 15 May 2023 – Discussion started: 27 Jun 2023

Download & links

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (0 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

12 Jul 2024

Consistent point data assimilation in Firedrake and Icepack

Reuben W. Nixon-Hill, Daniel Shapero, Colin J. Cotter, and David A. Ham

Geosci. Model Dev., 17, 5369–5386, https://doi.org/10.5194/gmd-17-5369-2024,https://doi.org/10.5194/gmd-17-5369-2024, 2024

Short summary

Reuben W. Nixon-Hill, Daniel Shapero, Colin J. Cotter, and David A. Ham

Interactive discussion

Status: closed

RC1: 'Comment on egusphere-2023-984', Douglas Brinkerhoff, 28 Jul 2023

Please see attached supplement

Citation: https://doi.org/10.5194/egusphere-2023-984-RC1
RC2:
'Comment on egusphere-2023-984', Umberto Villa, 19 Dec 2023
The manuscript “Consistent Point Data Assimilation in Firedrake and Icepack” documents the implementation in Firedrake of a UFL-compatible point-interpolation operator of a finite element function. The key benefit of the proposed implementation is that it enables the computation of gradients and second-order derivatives using the hybrid automatic differentiation capabilities of dolfin-adjoint. The parallel implementation is achieved by use of PETSc data structures for easily computing collisions between points and elements of a finite element mesh. Although from the software point of view the contribution is remarkable, the overall presentation of the manuscript could be improved.
Major issues:
The use of unstructured point data in inverse problems and data assimilation is well established. For example, the hIPPYlib software [1,2,3], an inverse problem Python library based on FEniCS (of which I am a developer), already implements point-wise observations while automating the computations of first- and second-order derivatives by use of FEniCS symbolic differentiation of weak forms. Furthermore, the three examples presented in the paper are related to some applications of hIPPYlib, including the inversion for a diffusion coefficient in the Poisson equation [1], poroelasticity [4], and ice sheet flow [5].

In the numerical results, the choice of the interpolation methods to map the discrete observation to the grid is naïve. A better approach would have been to compare with Gaussian process interpolation (and its 2D/3D counterparts). Gaussian processes do account for uncertainty in the measurement and provide a spatially varying estimate of the variance of the interpolator, thus allowing for more consistent handling of uncertainty measurement and of the fact that the interpolation is less reliable far away from observation points.

The authors claim that the numerical results demonstrate Bayesian consistency of the proposed interpolation method. Why this claim is true, the numerical results do not support such claim as the authors do not solve a Bayesian inverse problem. In particular,
The authors only compute the MAP point, while the solution of the Bayesian inverse problem requires the ability to characterize (either by sampling or using variational inference the posterior distribution.

The authors use Tikhonov regularization on the spatial gradient of the unknown parameter as the logarithmic of the prior distribution. However, since the inverse of the Laplace operator in two or three spatial dimensions is not a trace-class operator, the Bayesian inverse problem is not well-posed with that choice of prior [6].

The L-curve, Morozov, or cross-validation criterion are commonly used in the deterministic formulation of inverse problems. In the Bayesian formulation, hyper-priors are often used to tune the variance and correlation length of the prior distribution [7].

The definition of Bayesian consistency is not interpreted correctly. Below formula (31) the authors claim that as the number of observation points goes to infinity then q_est should converge toward q_true. This is not what Bayesian consistency is. In fact, there are several cases in which it is impossible to recover the true parameter exactly no matter how many observation points are used, simply because there is an intrinsic loss of information in the continuous map from the parameter to the state variable.

It is not clear what is the solution approach in the second example. Are the authors employing a Kalman filter or are they solving for a MAP point and then invoking the Laplace approximation to the posterior to estimate the posterior PDF?

Given that the main contribution of the paper is the implementation in UFL/Firedrake of a point-wise observation operator, it would be good to numerically demonstrate the scalability of the implementation with respect to the number of observation points and the number of MPI processes for both 2D and 3D geometries.

The overall quality of the exposition is not suitable for publication.
There are several inconsistencies in the mathematical notation that make the manuscript difficult to follow. For example, sometimes the authors use J^point and sometimes simply J to denote the point-wise misfit, and similarly sometimes use J^field and sometimes J’ to denote the interpolation-based data fidelity.

Section 2 is a mix between the very basic theory of finite elements and more advanced concepts like dual basis. However, the section lacks of mathematical rigor in the notation and there is an imbalance of content. In particular, the description of the foundation of finite elements is too long and well-known (e.g. Fig 1 could be removed), while the most advanced most relevevant to this work (the dual basis and the Charlet triplet) lack of details.

The presentation of the paper is very informal (e.g. the use of “our”, to refer to “our” finite element space, “our” finite element function, “our” inverse problem) and there are several typos in the mathematical formulas (e.g. there appear to be an extra semicolon in Eq (11) and (12); or inconsistency between \tilde{x} in Eq (13) and the text below where \hat{x} is used.

In consideration of the major issues noted above I do not believe that the manuscript is suitable for publication in the present form.
Also, given the fact that in general the use of point-wise observations is well understood by the inverse problem and data assimilation communities, I am wondering if the manuscript should be more focused on the true (and remarkable) contribution that is the implementation of point-wise observation within UFL/Firedrake. As such, a manuscript focused on the implementation details and algorithms as well as the assessment of its numerical performances may be better suited to a software journal such as SoftwareX or TOMs.

References:

[1] Umberto Villa, Noemi Petra, and Omar Ghattas. "HIPPYlib: an extensible software framework for large-scale inverse problems governed by PDEs: part I: deterministic inversion and linearized Bayesian inference." ACM Transactions on Mathematical Software (TOMS) 47, no. 2 (2021): 1-34.
[2] Kim, Ki-Tae, Umberto Villa, Matthew Parno, Youssef Marzouk, Omar Ghattas, and Noemi Petra. "hIPPYlib-MUQ: A Bayesian inference software framework for integration of data with complex predictive models under uncertainty." ACM Transactions on Mathematical Software (2023).
[3] Villa, Umberto, Noemi Petra, and Omar Ghattas. "hIPPYlib: An extensible software framework for large-scale inverse problems." Journal of Open Source Software 3, no. 30 (2018).
[4] Alghamdi, Amal, Marc A. Hesse, Jingyi Chen, and Omar Ghattas. "Bayesian poroelastic aquifer characterization from InSAR surface deformation data. Part I: Maximum a posteriori estimate." Water Resources Research 56, no. 10 (2020): e2020WR027391.
[5] Olalekan Babaniyi, Ruanui Nicholson, Umberto Villa, and Noémi Petra. "Inferring the basal sliding coefficient field for the Stokes ice sheet model under rheological uncertainty." The Cryosphere 15, no. 4 (2021): 1731-1750.
[6] Andrew M Stuart "Inverse problems: a Bayesian perspective." Acta numerica 19 (2010): 451-559.
[7] Daniela Calvetti and Erkki Somersalo. "Hypermodels in the Bayesian imaging framework." Inverse Problems 24, no. 3 (2008): 034013.
Citation: https://doi.org/10.5194/egusphere-2023-984-RC2
AC1: 'Comment on egusphere-2023-984', Reuben Nixon-Hill, 16 Feb 2024

Response to reviewer 1: please see attached supplement.

Citation: https://doi.org/10.5194/egusphere-2023-984-AC1
AC2: 'Comment on egusphere-2023-984', Reuben Nixon-Hill, 16 Feb 2024

Response to reviewer 2: please see attached supplement.

Citation: https://doi.org/10.5194/egusphere-2023-984-AC2

Interactive discussion

Status: closed

RC1: 'Comment on egusphere-2023-984', Douglas Brinkerhoff, 28 Jul 2023

Please see attached supplement

Citation: https://doi.org/10.5194/egusphere-2023-984-RC1
RC2:
'Comment on egusphere-2023-984', Umberto Villa, 19 Dec 2023
The manuscript “Consistent Point Data Assimilation in Firedrake and Icepack” documents the implementation in Firedrake of a UFL-compatible point-interpolation operator of a finite element function. The key benefit of the proposed implementation is that it enables the computation of gradients and second-order derivatives using the hybrid automatic differentiation capabilities of dolfin-adjoint. The parallel implementation is achieved by use of PETSc data structures for easily computing collisions between points and elements of a finite element mesh. Although from the software point of view the contribution is remarkable, the overall presentation of the manuscript could be improved.
Major issues:
The use of unstructured point data in inverse problems and data assimilation is well established. For example, the hIPPYlib software [1,2,3], an inverse problem Python library based on FEniCS (of which I am a developer), already implements point-wise observations while automating the computations of first- and second-order derivatives by use of FEniCS symbolic differentiation of weak forms. Furthermore, the three examples presented in the paper are related to some applications of hIPPYlib, including the inversion for a diffusion coefficient in the Poisson equation [1], poroelasticity [4], and ice sheet flow [5].

In the numerical results, the choice of the interpolation methods to map the discrete observation to the grid is naïve. A better approach would have been to compare with Gaussian process interpolation (and its 2D/3D counterparts). Gaussian processes do account for uncertainty in the measurement and provide a spatially varying estimate of the variance of the interpolator, thus allowing for more consistent handling of uncertainty measurement and of the fact that the interpolation is less reliable far away from observation points.

The authors claim that the numerical results demonstrate Bayesian consistency of the proposed interpolation method. Why this claim is true, the numerical results do not support such claim as the authors do not solve a Bayesian inverse problem. In particular,
The authors only compute the MAP point, while the solution of the Bayesian inverse problem requires the ability to characterize (either by sampling or using variational inference the posterior distribution.

The authors use Tikhonov regularization on the spatial gradient of the unknown parameter as the logarithmic of the prior distribution. However, since the inverse of the Laplace operator in two or three spatial dimensions is not a trace-class operator, the Bayesian inverse problem is not well-posed with that choice of prior [6].

The L-curve, Morozov, or cross-validation criterion are commonly used in the deterministic formulation of inverse problems. In the Bayesian formulation, hyper-priors are often used to tune the variance and correlation length of the prior distribution [7].

The definition of Bayesian consistency is not interpreted correctly. Below formula (31) the authors claim that as the number of observation points goes to infinity then q_est should converge toward q_true. This is not what Bayesian consistency is. In fact, there are several cases in which it is impossible to recover the true parameter exactly no matter how many observation points are used, simply because there is an intrinsic loss of information in the continuous map from the parameter to the state variable.

It is not clear what is the solution approach in the second example. Are the authors employing a Kalman filter or are they solving for a MAP point and then invoking the Laplace approximation to the posterior to estimate the posterior PDF?

Given that the main contribution of the paper is the implementation in UFL/Firedrake of a point-wise observation operator, it would be good to numerically demonstrate the scalability of the implementation with respect to the number of observation points and the number of MPI processes for both 2D and 3D geometries.

The overall quality of the exposition is not suitable for publication.
There are several inconsistencies in the mathematical notation that make the manuscript difficult to follow. For example, sometimes the authors use J^point and sometimes simply J to denote the point-wise misfit, and similarly sometimes use J^field and sometimes J’ to denote the interpolation-based data fidelity.

Section 2 is a mix between the very basic theory of finite elements and more advanced concepts like dual basis. However, the section lacks of mathematical rigor in the notation and there is an imbalance of content. In particular, the description of the foundation of finite elements is too long and well-known (e.g. Fig 1 could be removed), while the most advanced most relevevant to this work (the dual basis and the Charlet triplet) lack of details.

The presentation of the paper is very informal (e.g. the use of “our”, to refer to “our” finite element space, “our” finite element function, “our” inverse problem) and there are several typos in the mathematical formulas (e.g. there appear to be an extra semicolon in Eq (11) and (12); or inconsistency between \tilde{x} in Eq (13) and the text below where \hat{x} is used.

In consideration of the major issues noted above I do not believe that the manuscript is suitable for publication in the present form.
Also, given the fact that in general the use of point-wise observations is well understood by the inverse problem and data assimilation communities, I am wondering if the manuscript should be more focused on the true (and remarkable) contribution that is the implementation of point-wise observation within UFL/Firedrake. As such, a manuscript focused on the implementation details and algorithms as well as the assessment of its numerical performances may be better suited to a software journal such as SoftwareX or TOMs.

References:

[1] Umberto Villa, Noemi Petra, and Omar Ghattas. "HIPPYlib: an extensible software framework for large-scale inverse problems governed by PDEs: part I: deterministic inversion and linearized Bayesian inference." ACM Transactions on Mathematical Software (TOMS) 47, no. 2 (2021): 1-34.
[2] Kim, Ki-Tae, Umberto Villa, Matthew Parno, Youssef Marzouk, Omar Ghattas, and Noemi Petra. "hIPPYlib-MUQ: A Bayesian inference software framework for integration of data with complex predictive models under uncertainty." ACM Transactions on Mathematical Software (2023).
[3] Villa, Umberto, Noemi Petra, and Omar Ghattas. "hIPPYlib: An extensible software framework for large-scale inverse problems." Journal of Open Source Software 3, no. 30 (2018).
[4] Alghamdi, Amal, Marc A. Hesse, Jingyi Chen, and Omar Ghattas. "Bayesian poroelastic aquifer characterization from InSAR surface deformation data. Part I: Maximum a posteriori estimate." Water Resources Research 56, no. 10 (2020): e2020WR027391.
[5] Olalekan Babaniyi, Ruanui Nicholson, Umberto Villa, and Noémi Petra. "Inferring the basal sliding coefficient field for the Stokes ice sheet model under rheological uncertainty." The Cryosphere 15, no. 4 (2021): 1731-1750.
[6] Andrew M Stuart "Inverse problems: a Bayesian perspective." Acta numerica 19 (2010): 451-559.
[7] Daniela Calvetti and Erkki Somersalo. "Hypermodels in the Bayesian imaging framework." Inverse Problems 24, no. 3 (2008): 034013.
Citation: https://doi.org/10.5194/egusphere-2023-984-RC2
AC1: 'Comment on egusphere-2023-984', Reuben Nixon-Hill, 16 Feb 2024

Response to reviewer 1: please see attached supplement.

Citation: https://doi.org/10.5194/egusphere-2023-984-AC1
AC2: 'Comment on egusphere-2023-984', Reuben Nixon-Hill, 16 Feb 2024

Response to reviewer 2: please see attached supplement.

Citation: https://doi.org/10.5194/egusphere-2023-984-AC2

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Reuben Nixon-Hill on behalf of the Authors (26 Mar 2024) Author's response Author's tracked changes Manuscript

ED: Referee Nomination & Report Request started (08 Apr 2024) by Qiang Wang

RR by Umberto Villa (12 Apr 2024)

ED: Publish subject to technical corrections (04 May 2024) by Qiang Wang

AR by Reuben Nixon-Hill on behalf of the Authors (17 May 2024) Manuscript

Journal article(s) based on this preprint

12 Jul 2024

Consistent point data assimilation in Firedrake and Icepack

Reuben W. Nixon-Hill, Daniel Shapero, Colin J. Cotter, and David A. Ham

Geosci. Model Dev., 17, 5369–5386, https://doi.org/10.5194/gmd-17-5369-2024,https://doi.org/10.5194/gmd-17-5369-2024, 2024

Short summary

Reuben W. Nixon-Hill, Daniel Shapero, Colin J. Cotter, and David A. Ham

Viewed

Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.

Total article views: 294 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
292	0	2	294	3	3

HTML: 292
PDF: 0
XML: 2
Total: 294
BibTeX: 3
EndNote: 3

Views and downloads (calculated since 27 Jun 2023)

Month	HTML	PDF	XML
Jun 2023	31	0	31
Jul 2023	40	2	42
Aug 2023	31	0	31
Sep 2023	17	0	17
Oct 2023	28	0	28
Nov 2023	6	0	6
Dec 2023	29	0	29
Jan 2024	9	0	9
Feb 2024	40	0	40
Mar 2024	10	0	10
Apr 2024	14	0	14
May 2024	14	0	14
Jun 2024	14	0	14
Jul 2024	9	0	9
Aug 2024	0
Sep 2024	0

Cumulative views and downloads (calculated since 27 Jun 2023)

Month	HTML	PDF	XML
Jun 2023	31	0	31
Jul 2023	40	2	42
Aug 2023	31	0	31
Sep 2023	17	0	17
Oct 2023	28	0	28
Nov 2023	6	0	6
Dec 2023	29	0	29
Jan 2024	9	0	9
Feb 2024	40	0	40
Mar 2024	10	0	10
Apr 2024	14	0	14
May 2024	14	0	14
Jun 2024	14	0	14
Jul 2024	9	0	9
Aug 2024	0
Sep 2024	0

Viewed (geographical distribution)

Since the preprint corresponding to this journal article was posted outside of Copernicus Publications, the preprint-related metrics are limited to HTML views.

Total article views: 283 (including HTML, PDF, and XML) Thereof 283 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 03 Sep 2024

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint

Final revised paper

Short summary

Scientists often use models to study complex processes, like the movement of ice sheets, and compare them to measurements for estimating hard-to-measure quantities. We highlight an approach that ensures accurate results from point data sources (such as height measurements) by evaluating the numerical solution at true point locations. This method improves accuracy, can aid communication between scientists, and is well suited for integration with specialised software that automates the processes.


Total:	0
HTML:	0
PDF:	0
XML:	0

Consistent Point Data Assimilation in Firedrake and Icepack

Journal article(s) based on this preprint

Interactive discussion

Interactive discussion

Peer review completion

Suggestions for revision or reasons for rejection

Journal article(s) based on this preprint

Viewed

Viewed (geographical distribution)