the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Fractional Empirical Orthogonal Functions for Geophysical Fields with Anomalous Transport: Theory and Validation
Abstract. Empirical Orthogonal Function (EOF) analysis and its rotated variant (REOF) are foundational tools in the geosciences for decomposing spatiotemporal variability. However, the standard methodology implicitly assumes Gaussian statistics and exponentially decaying correlations, assumptions that are violated in many geophysical systems exhibiting anomalous diffusion, heavy-tailed distributions, and long-range spatial correlations. We develop a theoretical framework for fractional EOF (fEOF) analysis that extends the standard methodology by incorporating the fractional Laplacian operator into the covariance structure. The governing dynamics are formulated using the Riemann–Liouville fractional time derivative of order μ > 0, which is not restricted to the interval (0,1] and thereby accommodates both subdiffusive and superdiffusive transport regimes within a single formalism. The resulting fractional covariance operator naturally captures power-law correlations characteristic of anomalous transport in geophysical flows. We prove that the eigenvalue spectrum of the fractional covariance operator exhibits enhanced power-law decay λm(α) ∼ m−(1+α+β/d), where the spatial fractional order α ∈ (0,2) provides a tunable control parameter independent of the underlying spectral slope β. The temporal evolution of fractional principal components follows Mittag-Leffler relaxation, interpolating between stretched exponential and power-law regimes. We validate the theoretical predictions through three independent approaches: (i) exact analytical results for fractional Brownian surfaces across three Hurst exponents (H = 0.3, 0.5, 0.7), confirming eigenvalue steepening to within 6 % of theoretical predictions with finite-domain corrections identified; (ii) spectral analysis of fields generated by the space-time fractional diffusion equation across spectral slopes β = 2, 3, 4, recovering predicted exponents to within 2–6 %; and (iii) Monte Carlo experiments over 50 realizations demonstrating that the eigenvalue scaling is distribution-independent, holding identically for Gaussian and heavy-tailed Student-t fields while achieving 5- to 8-fold reductions in the number of modes required for 95 % variance capture. The sensitivity analysis across fractional orders α ∈ [0, 1.8] confirms the predicted linear steepening relation να = ν0 + α to within 3 % throughout. The framework applies to a broad class of geophysical fields exhibiting anomalous transport, including oceanic tracer dispersion, flood inundation dynamics, atmospheric constituent spreading, and soil moisture redistribution. Connections to Okubo's empirical oceanic diffusion scaling and the Forecasting Inundation Extents using REOF (FIER) framework are discussed as illustrative applications.
- Preprint
(1473 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2026-753', Anonymous Referee #1, 29 Apr 2026
- AC1: 'Reply on RC1', Farrukh Chishtie, 15 Jun 2026
-
RC2: 'Comment on egusphere-2026-753', Anonymous Referee #2, 18 May 2026
The author proposes in this manuscript a linear generalization of the Empirical Orthogonal Functions (EOF) technique, namely the Fractional Empirical Orthogonal Functions (fEOF), for dimensionality reduction. An application linked to the fractional diffusion equation is also proposed. Some background is provided in Sec. 2 and the mathematical details of the dimensionality reduction technique itself are presented in Sec. 3. Section 4 applies the derived formulas on synthetic low-dimensional random datasets, Sec. 5 further exposes some properties of fEOF and Sec. 6 discusses some potential applications in geophysics. Section 7 summarizes the work and further discusses some technical points.
As noted in the introduction of the manuscript, geophysical fluids are subject to anomalous diffusion, which translates to power laws of the form ~k-β with β≠2. Because of this, the usual EOF technique selects the largest-scale Fourier modes (and the explained variances are linked to the power spectrum of the field). The main idea of the manuscript is essentially to compute EOFs based on a modified covariance matrix Cα (eq. (15)), which is the usual covariance matrix left- and right-multiplied by a fractional power of the Laplacian, i.e. by L-α/2 for α>0 (the case α=0 is the usual EOF technique). As shown by the author, the spectrum of eigenvalues of Cα can be made steeper by increasing α, making the largest eigenvalues even larger with respect to the small ones (see also below).
I think there is a major flaw in the technique proposed in the manuscript, and I therefore recommend not publishing this manuscript.
Main comment:
The eigenvalues of Cα are interpreted as the variances explained by the associated eigenvectors (cf. M95 number in Table 1), leading to the conclusion that the eigenvectors of Cα are more efficient to represent the field x. This is not true, the explained variances of the field x are the eigenvalues of the usual covariance matrix. Actually, the author has not shown or argued that this generalization of the covariance matrix and the associated eigenvalues are interesting objects for dimensionality reduction. It is not because the spectrum of eigenvalues of Cα is steeper for α>0 that it is meaningful to base a dimensionality reduction method on it.
Further intuition about the technique proposed by the author is gained when understanding the properties of C, Cα and L-α/2 in a Fourier basis. It is indeed expected in the geosciences community that eigenvectors of the covariance matrix are close to Fourier modes. From what I found in the literature, eigenvectors are actually exactly Fourier modes if the field is periodic and spatially homogeneous (see note at the end of the comment). The L-α/2 operator multiplies each Fourier mode by the norm of its wavevector to the power -α (eq. (10)), so L-α/2 is also diagonal in a basis of Fourier modes. C and L-α/2 commute as matrices, so that Cα = L-αC and C and Cα have the same eigenvectors (which are Fourier modes):
Cαϕ = L-αCϕ = λL-αϕ = λk-2αϕ
where
- k is the norm of the associated wavevector ϕ
- λ is the eigenvalue of ϕ with respect to C (and is related to the power spectrum of the field evaluated at the wavevector of ϕ, see line 193).
The eigenvalues of Cα are λk-2α, consistently with eq. (17), and the eigenvectors are the same as those of C, i.e. Fourier modes (this is supported by Fig. 5 of the manuscript). Since C and Cα have the same eigenvectors, why would the EOF technique based on Cα be more efficient to represent the field x ? (I personally think that there is no useful information in Cα and its eigenvalues which is not already present in C and its eigenvalues.)
The proposed technique can be seen yet in another way: the fractional covariance matrix Cα computed from x is equal to the usual covariance matrix C (=C0) computed with the field L-α/2x, so that the eigenvalues of Cα are actually the explained variances of the usual EOFs of L-α/2x. Thinking in a Fourier basis, the field L-α/2x is the original field where each Fourier mode has been weighted by k-α. Multiplying by k-α has the effect to reduce the contribution to x of small scales with respect to the large scales (for α>0), and thus steepens the power spectrum (or equivalently, the spectrum of eigenvalues of the covariance). In these terms, my critique of the technique is that x and L-α/2x are simply different (but related) fields, with different (but related) properties. Why would we base a dimensionality reduction technique on L-α/2x ? It is not because L-α/2x has better properties than x that it is meaningful to base the dimensionality reduction on it.
Conclusion of the main comment:
I apologize if I missed the point of the proposed dimensionality reduction method, or any argument supporting it, but I currently do not see the relevance of the proposed for dimensionality reduction. Therefore, I recommend not publishing this manuscript. The author needs to motivate (with either heuristic or formal arguments) the consideration of eigenvalues of Cα for dimensionality reduction or, equivalently, to motivate the consideration of the covariance of L-α/2x to reduce the dimension of x.
I actually see two points which, according to me, totally prevent any use of the proposed dimensionality reduction technique:
- The parameter α is totally unconstrained, so the spectrum of eigenvalues of Cα can be made arbitrarily steep, and M95 (Table 1) can made equal to 1 if α is increased sufficiently. Any field could then be represented with 1 mode. I think that the high level of arbitrariness here shows that the proposed technique does not provide a more efficient representation of the physical processes governing the considered field.
- EOF/PCA is known to be the most efficient linear dimensionality reduction method in terms of the Mean Square Error (L2 norm), while the proposed method here is indeed linear and using the L2 norm (since it uses explained variances). If another technique is more efficient, it will either be in terms of another metric, or the method itself will be nonlinear.
Other major comments:
The space-time fractional diffusion equation is also mentioned in the manuscript. However, it is not clear to me what is the purpose of introducing this equation in this context. Is it to propose a prognostic model for geophysical fields ? The fractional diffusion equation has already been used in geophysical applications, by Kavvas et al. (2020) for example. What is the difference between the latter work and the application proposed in this manuscript ?
The caption of Fig. 5 claims that the eigenvectors of Cα are different from the usual EOFs. I think however that they are the same (see computation above). Only small differences are visible in Fig. 5, which would be due to the finite number of samples used for numerical computations (as much as there is a ~5% accuracy on the measured steepening of the eigenvalues). A more detailed comparison is needed to support the method.
I also think that the current organization of the manuscript is too much oriented on mathematics. Section 2.2 in particular feels like a textbook introduction on fractional calculus, introducing equations which are not used in the following (eqs. (6), (11) and (13), and it is not clear to me whether eqs. (7), (8) and (9) are useful later on). To be the most accessible to the geoscientific community, I think that fractional operators should be introduced by working definitions instead of analytical equations. Equations (10) and (14) fill this role for the fractional Laplacian, but it is for example not explained how to compute fractional time derivatives of time-gridded fields.
To further connect with the works in the geoscientific community, the techniques proposed in this manuscript should be actually applied to real datasets (Sec. 6 only suggests geophysical applications). Such a work would show how the proposed fractional EOFs encode the physics more efficiently.
Note on the link between the eigenvectors of the covariance matrix and Fourier modes:
Eigenvectors of the covariance matrix are Fourier modes if the field is periodic and spatially homogeneous. For vectors (1D), this is because the covariance matrix is a circulant matrix, for which eigenvectors are Fourier modes (Gray (2006), see also Senn et al. (2026)). When computing eigenmodes of d-dimensional fields, the d-dimensional arrays are flattened to vectors. The covariance matrix is in this case block-circulant with circulant blocks, and the eigenvectors are d-dimensional Fourier modes when reshaped back in the original shape (Henriques et al., 2013).
References:
Gray RM (2006), "Toeplitz and Circulant Matrices: A Review". Foundations and Trends in Communications and Information Theory, Vol. 2 No. 3 pp. 155–239, doi:https://doi.org/10.1561/0100000006
Henriques, J. F., Carreira, J., Caseiro, R., & Batista, J. (2013). Beyond hard negative mining: Efficient detector learning via block-circulant decomposition. In proceedings of the IEEE International Conference on Computer Vision (pp. 2760-2767).
Kavvas, M. L., Tu, T., Ercan, A., and Polsinelli, J.: Fractional governing equations of transient groundwater flow in unconfined aquifers with multi-fractional dimensions in fractional time, Earth Syst. Dynam., 11, 1–12, https://doi.org/10.5194/esd-11-1-2020, 2020
Senn, G., Tjelmeland, H., Glatt-Holtz, N., Walker, M., & Holbrook, A. (2026). Bayesian Semi-Blind Deconvolution at Scale. arXiv preprint arXiv:2601.09677
Citation: https://doi.org/10.5194/egusphere-2026-753-RC2 - AC2: 'Reply on RC2', Farrukh Chishtie, 15 Jun 2026
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 570 | 240 | 60 | 870 | 73 | 65 |
- HTML: 570
- PDF: 240
- XML: 60
- Total: 870
- BibTeX: 73
- EndNote: 65
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
General comment:
The paper contributes to an extensively used method (EOF) in numerous disciplines related to geophysics. The classical method relies on Gaussian statistics and exponentially decaying correlations, which are known limiting factors. In that optic, the use of fractional derivation to introduce some global behavior is an interesting idea.
The development of the theory from Riemann–Liouville derivation is theoretically sound, clear, and really to the point for applications. The limitations of the method appear clearly, and the results are well highlighted.
The paper remains general while also incorporating ideas from the authors’ expertise (flood/risk assessment?), which provide some applications that are enjoyable to read even for people outside of the field. Part 6, for example, gives cases where the application of FEOF over EOF might be valuable.
I personally enjoyed the paper; however, my main remark is that I find the order somewhat confusing at times:
Specific comments: