the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Spatio-temporal pattern analysis of MOPITT total column CO using varimax rotation and singular spectrum analysis
Abstract. In this paper, we apply varimax empirical orthogonal function (EOF) analysis to MOPITT total column CO retrievals to answer the question of whether or not it is possible to disentangle the dominant CO sources associated with inferred modes of variability at regional to global scales. Additionally, we write this manuscript with the intention to highlight the strengths and weaknesses of EOF analysis, specifically their usage in the field of atmospheric chemistry. We analyzed daily MOPITT Version 8 Level 3 joint (TIR-NIR) products from 2005 to 2018, aggregated every 8 days on a 1° by 1° grid. Our findings show that EOF patterns of MOPITT CO are consistent across various regional subdomains, demonstrating that these spatial patterns are independent of the chosen domain. A comparison of the eigenvalue spectrum reveals that unrotated EOF analysis yields three distinct modes, while varimax rotation reduces these to two. The power spectra of the principal components indicate that the first two unrotated modes are primarily driven by annual and semi-annual cycles, while the third mode reflects seasonal variations occurring over roughly three months. To further isolate these modes, we employed singular spectrum analysis (SSA) at each grid point to generate long-term, seasonal, and residual EOF patterns. The power spectrum analysis of the principal components shows that the long-term EOFs replicate the original two dominant modes, while the seasonal EOFs reveal significant variations over 2 to 3 months, and the residual modes exhibit time scales of 2 months or shorter. By plotting the mean skewness field we show the dataset is non-Gaussian, leading us to conclude each principal component is time-dependent despite being uncorrelated. The periodic decay observed in the temporal auto-correlation function for each time series suggests a classification of wide-sense cyclostationary or wide-sense polycyclostationary behavior. We find the non-stationarity of each time series together with the temporal dependence of modes leads us to conclude that EOF analysis alone cannot fully disentangle individual CO sources. Consequently, we recommend exploring cyclostationary EOF analysis in future studies. These limitations must be carefully considered when interpreting EOF patterns of other composition datasets with similar characteristics.
- Preprint
(3016 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2024-3440', Anonymous Referee #1, 11 Jun 2025
Interactive comment on “Spatio-temporal pattern analysis of MOPITT total column CO using varimax rotation and singular spectrum analysis” by John McKinnon et al.
General Comments:
In this study, McKinnon et al. analyze the total columns of CO retrievals from MOPITT by applying varimax empirical orthogonal function (EOF). The study evaluates if the dominant CO sources associated with inferred modes of the variability at the regional to global scales can be disentangled, and identifies the strengths and weaknesses of the EOF analysis. This paper is well-structured, informative, and highlights important science questions, but will need to be strengthened with more references throughout the manuscript (especially in the Introduction section), in addition to addressing some information gaps, and making some figure-related and technical changes before it can be accepted. This manuscript also has sentences that would need to be broken down for simplicity, along with the numbering of all equations used, for clarity, and would need to be scrubbed for missing punctuation/commas/grammar.
Specific Comments:
- L1 – L3 Please state the objective more clearly and define MOPITT.
- In general, the manuscript would need to be supported throughout with more references, particularly in the Introduction section, such as L27, L59, L63, L83, L1234-127 (for TES, AIRS, IASI, TROPOMI, and CrIS), L133, L152, L171, L409 etc.
- L149 – Please define NH and SH.
- L152 – Please split the sentence after the term “burning”.
- L172 – “measurable in breath” – This is unclear.
- L207 – “ENSO … AAO” Please expand each acronym and add citations.
- “Southern Hemisphere” and “SH” (and “Northern Hemisphere” and “NH”) have been used interchangeably throughout the manuscript. Please define at the first occurrence and use the abbreviated form thorough the rest of the paper.
- Section 2/Methods lacks enough citations.
- L213 - Please provide a brief description of the term “EOF” before advancing to Section 2.4.1.
- Please add equation numbers or reference numbers for all matrices and equations shown in the paper for the ease of reference/discussion in the manuscript.
- L228 – Can the “F” matrix be identified differently to not confuse it with the original F matrix?
- Please explain the “SVD” acronym in L253.
- L259 – How is this approach different from the one shown in the manuscript?
- L267 – Please define “PCA”.
- L268 – Please explain the motivation behind taking this approach.
- L271 – Please start a new sentence.
- L272 – What are the potential drawbacks/ uncertainties associated with this approach?
- L276 – Please remove the period next to where N* is defined.
- L277 – “…not accurate for the case of Non-Gaussian red noise”. Please explain why.
- L280 – 282 – Is there a way to identify which is the case for sure?
- L285 – How do these impact the results? What are the introduced uncertainties and can they be quantified?
- L324 – Please start a new sentence.
- L358 – Please split the sentence from “for instance…”.
- L367 - 394 – Please explain this part in points for clarity.
- L395 – Please start a new sentence from “however…”.
- L424 – Please write the equation outside of the paragraph for clarity.
- L459 – “leads us to believe there …” This may be speculative. Please consider changing this to “suggests the possibility of”.
- L462 – What could be the other possible causes?
Figures: (Each number below identifies the figure number.)
- Please change the y axis magnitude to “molecules/cm2”.
- Please state the value (molecules/cm2?) shown by the colorbar next to it.
- Please add subplot numbers/names such as a), b), etc., instead of identifying them as “top left”, “top right”, etc. The values in the figures could be grid-binned, and shown using a consistent colorbar (ranging from -0.03 to 0.015?) for the bottom left and the two figures on the right. Please state the name of the value/variable/unit next to each colorbar.
- Please include subplot names/numbers, as listed for Figure 3. L451 – Can the data be grid-binned for better visualization?
- Please consider using brighter colors. The black and blue solid lines blend into each other at some points.
- Please add subplot names/numbers, grid-bin the data, use a consistent colorbar where possible, and list the name of the variable being identified by the colorbar next to it, same as the comments for Figure 3.
- Please add subplot numbers/names.
- Please add subplot numbers/names.
- Please grid-bin the data if possible, and list the name of the variable being identified by the colorbar next to it.
- Please add subplot numbers/names.
Technical Comments:
- L3 – Please change “Additionally, ……intention” to “This study aims to”.
- L27 – Ozone has been defined as “O3”, but the term “Ozone” has been used throughout the paper (L28, L39, etc.). Please change this to O3. This is also true for CO and other species.
- There are formatting errors with the citations in the paper. Please correct these. E.g., L36, L37 (missing parenthesis), L47, L49, etc. L72 – Are the citations arranged alphabetically? L77 – Please add a comma between the two citations. L123-124 – Please check the citation format.
- L262 – “mth” Please use superscript “mth”.
- L85 – Please remove “we are interested”. Please change “include the following” to “are the following”.
- L89 – “will use” --> use
- L91 – Please remove “next section on Methods”, and please “section” after “Methods”.
- Please check and add commas where necessary in L67, L93, etc.
- L95 – performs --> performed, verifies --> verified
- L97 – Please remove “current”, and please change “will be on” to “is”.
- L140 – “so that we” --> ” to”, “be able to study” --> “study”
- L186 – “has shown” --> “showed”
- L215 – Are the citations listed alphabetically or in the order of publication year?
- L240 – “,The” --> “,the”
- L293 – “very immediately” --> “an immediate”
- L349 – “plain English” --> “simpler terms”, “is named because” --> “is named so, as”
- L484 – “we have said” --> “discussed”
- L524 – Please remove the term “First”.
- L531- “Because” --> “Since”
Citation: https://doi.org/10.5194/egusphere-2024-3440-RC1 -
AC1: 'Reply on RC1', John McKinnon, 04 Aug 2025
Author’s Response to Referee 1 Commentary
The authors acknowledge the lack of references in the introduction as well as in the methods section and agree to add in numerous citations throughout the manuscript to address these shortcomings. We ensure that all our quantitative statements, or observations which carry significant importance which are not our own have been cited throughout our analysis. We add discussion (with citations) to address all specific comments which point out information gaps. We address all specific comments which have scientific questions that are intended to facilitate discussion and add citations as appropriate. The authors agree to fix all formatting issues with existing citations, and to add equation numbers throughout the manuscript. We break down statements throughout the manuscript in cases where a statement is a run on sentence, is unclear, or requires additional explanation. We also address all other grammatical and technical issues which have been pointed out. The authors would like to thank Referee 1 for giving feedback that is extremely clear and easy to follow. Referee 1’s comments about our gaps in literature highlight an important weakness in our manuscript and we believe our motivation will be much stronger by addressing them. We especially thank Referee 1 for asking us questions regarding uncertainties and the assumptions used in our approach as we believe this will address a crucial gap in our data analysis.
General Changes Throughout Manuscript
- Equation numbers have been added to all existing equations, and matrices are defined using new equations on separate lines for ease of reference and discussion.
- Many new citations are being added throughout the introduction, as well as in the methods section.
- Acronyms for Northern Hemisphere, Southern Hemisphere, and different species are defined at their first occurrence and then referred only by their acronym from then on.
- Citation formatting errors are being double checked against AMT standard and corrected accordingly.
- Grammatical and technical fixes are being implemented throughout in accordance with reviewer recommendations. Some sentences have been split or broken into simpler statements where the authors were requested, to aid with the flow and clarity of language.
- A new Section 2.4.5 in methods is being added to very briefly mention advanced methods of EOF analysis including cyclostationary EOFs so that our usage of them in results has better continuity and is not jarring.
Changes Addressing “Specific Comments” with References to Line/Page Numbers
The following changes are implemented to address the “specific comments” from Referee 1 with line/page references. Please note that the line numbers refer to the original version of the manuscript
Lines 1-3 Pg 1
We define MOPITT and more clearly describe the objectives of the study.
Line 27 Pg 2
Citations added to describe how the concentration level of NOx affects ozone production.
Line 59 Pg 2-3
Citations added to emphasize retrieval inaccuracies and the lack of monitoring stations across Africa.
Line 63 Pg 3
Citations added to bolster arguments that changes in local trends are difficult to separate from background trends.
Line 83 Pg 3
Citations added to emphasize that the separation of secondary CO from surface emission aids source attribution and inverse modeling studies.
Lines 124-127 Pg 4-5
Citations added for other satellite instruments (TES, AIRS, IASI, TROPOMI, and CrIS)
Line 133 Pg 5
Citations added regarding potential discontinuities of IASI-A retrievals, the smaller number of measurements for TES, and the cloud clearing algorithm used by AIRS.
Line 152 Pg 6
Citations added to emphasize high contribution of anthropogenic emissions from China due to coal reliance and biofuel burning.
Line 171 Pg 7
Discussion added with citations to emphasize the relative contribution to CO due to oxidation from CH4 as compared to the oxidation from NMHCs. Citations added regarding the temperature dependence of isoprene and terpenes. Discussion added with citations to clarify how the large annual contribution and high reactivity of isoprene makes it the most dominant hydrocarbon emission. The term “measurable in breath” has been removed and clarified (with citations) by describing the biological processes in animals where we can measure VOC emission.
Line 207 Pg 8
Acronyms for all 4 climate indices are expanded, with references included for each.
Line 213 Pg 8
We clarify the term “EOF” and add a brief description.
Line 228 Pg 9
We redefine the scaled matrix F’ to be the matrix J to avoid confusion with the original F matrix.
Line 253 Pg 10
We clarify the term “SVD” and add a brief description
Line 267 Pg 10
We clarify the term “PCA” and add a brief description
Line 268 Pg 10
We discuss, with supporting citations, how our motivation for using asymptotic approximation is due to simplicity, unlike similar methods that rely on statistical analysis and resampling the dataset.
Line 272 Pg 10-11
We discuss, with supporting citations, the strengths and weaknesses of using an asymptotic expansion as compared with other methods.
Line 277 Pg 11
Significant discussion has been added, together with supporting citations, to explain why our estimation for the effective sample size is only valid for the case of red noise Gaussian processes where the true sample size N is very large.
Line 280-282 Pg 11
Discussion is added, with references, regarding tests that can be used to determine if eigenvalues are degenerate or potentially represent white noise.
Line 285 Pg 11
Discussion is added, with references, to better describe specifically how the list of drawbacks for EOF analysis affect results, and how best to mitigate them.
Line 409 Pg 16
Citations added to emphasize that one potential benefit of varimax rotation is to make spatial patterns simpler and easier to interpret.
Line 462 Pg 19
Discussion is added, with references, to bolster our argument for why we do not see a connection between Residual mode 1 and oceanic variability from oceanic cycles. We mention potential issues including dependance of modes, thermal gradients near the surface, and variability from other small scale oceanic processes.
Changes to Figures
Fig 1 (Referee #1)
We change the y axis magnitude to “molecules/cm2”.
Fig 2 (Referee #1)
Please label our colorbar with the proper units molecules/cm2).
Fig 3 (Referee #1)
We add subplot names to our figures, and grid bin our data using a consistent colorbar. We label our colorbar with proper units.
Fig 4 (Referee #1)
We add subplot names to our figures.
Fig 5 (Referee #1)
We alter our color scheme so that the lines do not blend in as much.
Fig 6 (Referee #1)
We add subplot names to our figures, and grid-bin them using a consistent colorbar. We label our colorbar with proper units.
Fig 7 (Referee #1)
We add subplot names to our figures.
Fig 8 (Referee #1)
We add subplot names to our figures.
Fig 9 (Referee #1)
We grid-bin the data and we label our colorbar with proper units.
Fig 10 (Referee #1)
We add subplot names to our figures.
-
RC2: 'Comment on egusphere-2024-3440', Anonymous Referee #2, 21 Jun 2025
The manuscript provides a general tutorial for EOF and singular spectrum analysis, with an application to 8-day aggregates of near-global (60S to 60N latitude) carbon monoxide columns retrieved from the MOPITT instrument from 2005-2018. The authors demonstrate consistency in EOF patterns derived across different domains to confirm that the patterns revealed are a robust property of the dataset. Varimax rotation suggests two dominant EOFs in contrast to three emerging from the raw analysis (annual, semi-annual and 3-month variations). Singular spectrum analysis is then applied to the principal components to identify different time scales of variations. The authors initially hypothesize that the spatiotemporal patterns will reflect individual CO sources but find that this is not the case, and instead recommend future exploration of cyclostationary EOF analysis.
General Comments:
- The tutorial-style sections may be more suited to a technical note. Given that the techniques themselves are described in textbooks such as Wilks, 2011 (note there is now a 4th edition published in 2019), is AMT a good fit for this work? Note that the NCAR Climate Data Guide provides some basic introduction, which may be worth pointing to as well: https://climatedataguide.ucar.edu/climate-tools/empirical-orthogonal-function-eof-analysis-and-rotated-eof-analysis
- A stronger case could be up front for why an EOF analysis is expected to reveal source-specific signals in total column CO, such as the anthropogenic and natural variability noted in Question 1, or in global background in Question 2. How do the patterns from sources contrast to other factors contributing to spatiotemporal variations, such as weather variability or OH? To what extent do spatiotemporal patterns overlap between fires and biogenic VOC contributions to CO?
- Do the authors have a recommendation as to whether to apply Varimax rotation?
- The manuscript would benefit from additional editing throughout to improve the flow and clarity of messaging. Given the interest in identifying patterns form specific sources, it would be helpful to have a table summarizing current understanding of CO emissions from specific sources.
Specific comments:
line 87. Need to define global background. Is it all non-anthropogenic CO, and if so, how is this different from the natural CO in question 1?
Lines 90-100. The discussion is a nice summary but misses a key early application to air quality in the U.S., including: Eder, B. K., J. M. Davis, and P. Bloomfield, A characterization of the spatiotemporal variability of non-urban ozone concentrations over the eastern United States, Atmos. Environ., 27, A(16), 2645–2668, 1993.
Line 136. Given the lifetime of CO, why is this resolution a problem? Wouldn’t daily data be better suited to identify daily variations such as those due to fires?
Line 141. How is the data de-trended? Linear fit? At each satellite pixel? Does this remove the signal from long-term trends in anthropogenic or other sources?
Figure 2. Do these patterns change over time?
Lines 185-188. This is an example of discussion in need of editing. A recent review includes discussion of OH sources and sinks: Fiore et al. (2024), Climate and Tropospheric Oxidizing Capacity, Annual Reviews of Earth and Planetary Sciences, 52:321-349, https://doi.org/10.1146/annurev-earth-032320-090307
Lines 250-251. The choice here depends on the goals of the analysis. Examples of why one is better suited would be helpful here.
Lines 403-405. Is the data also area-weighted? Some discussion as to why standardizing is preferred here as opposed to giving more weight to regions with larger sources and thus larger absolute variations would be useful.
Lines 441-442. Some discussion is needed as to the spatiotemporal variations in the sources in contrast to weather-driven variations.
Line 459. What is oceanic variability in CO?
Line 476. Why is there a ~3-year time scale when the lifetime of CO is on the order of a couple months?
Figure 9. What processes might drive the sharp gradients in skewness? It would be stronger to frame the findings in terms of any hypotheses as to what drives the variations in CO emerging from this analysis. For example, what drives the annual/semiannual cycle noted in lines 534-535?
Appendix A1. Line 589. Normalization procedures can vary by software package.
Citation: https://doi.org/10.5194/egusphere-2024-3440-RC2 -
AC2: 'Reply on RC2', John McKinnon, 04 Aug 2025
Author’s Response to Referee 2 Commentary
The authors acknowledge that many EOF tutorials already exist in the literature and as such agree to reduce the amount of technical detail in the methods portion of our manuscript, moving these aspects to the appendix instead. The authors add in a table to our introduction section on source variability with an accompanying discussion to help us summarize our current understanding of the variability from specific sources of CO. We add additional discussion in our introduction to make a better case up front for why we had expected EOF analysis to have better success in being able to separate sources into independent signals. We also add in our Results section 3.2 a clearer recommendation for why in our specific case, it was better to use our unrotated modes rather than to use our varimax modes. We agree to incorporate all changes which have been requested in the specific and technical comments. The authors would like to thank Referee 2 for asking many general questions regarding our manuscript including its structure, motivation, and our approach to data analysis. We especially appreciate Referee 2’s questions about area weighting in our data and our time scales, as we believe answering these questions will address a vital gap in our methods. We also appreciate that Referee 2 was kind enough to give us 2 specific citations to help us address gaps in our literature.
General Changes Throughout Manuscript
- We add a table with discussion to Section 2.2 of our introduction to aid in summarizing current understanding CO variability from specific sources and to understand what scales of time are most relevant.
- We add additional discussion in our introduction to make a better case up front for why we had expected EOF analysis to have success in being able to separate sources into independent signals.
- We reorganize our Methods section, putting the more technical details in our appendix to reduce the emphasis of our paper being a tutorial on EOF analysis.
Changes Addressing “Specific Comments” with References to Line/Page Numbers
The following changes are implemented to address the “specific comments” from Referee 2, with line/page references. For each change we indicate in parenthesis which referee is being addressed in our proposed changes. Please note that the line numbers refer to the original version of the manuscript.
Line 87 Pg. 3
We clarify that in our specific context the global background refers to a mathematical, non-physical quantity and simply refers to an average value.
Lines 90-100 Pg 3-4
We add discussion regarding the application of EOFs to air quality monitoring in the US. We use citations including Eder, B. et. al. (1993), who used EOFs to study spatiotemporal variability of non-urban ozone.
Line 136 Pg. 5
We add discussion that daily data would not be better suited because MOPITT does not have daily global coverage. We must ensure global coverage in the analysis because EOF analysis fails to produce patterns if there are gaps in the data.
Line 141 Pg. 5
Discussion is added regarding the fact that our data was detrended by taking the globally averaged time series and then applying a Fourier fit to it. Because the trends in the total column data are a non-linear combination of the trends from individual sources, it is not immediately clear what sources may have had their long-term trends removed.
Lines 185-188 Pg 7
We edit our discussion with appropriate citations including Fiore et al. (2024) to incorporate up-to-date information on the sinks and sources of OH.
Lines 250-251 Pg 9-10
We add discussion regarding the fact that for our specific application, it makes very little difference in whether we choose to normalize our covariance since we are only using one observation field. If we were to compare EOFs of different datasets, then we should normalize the covariance.
Results 3.1 Pg 15-16
We respond to a comment that initially refers to Figure 2, however because it refers to multiple patterns, we assume Referee 2 had intended to ask about how the EOF patterns may change over time in Figure 3. We clarify that the EOF patterns did not show significant changes when we segmented the data into yearly sets to compare them from year to year. It’s possible this could be because the data had long-term trends removed.
Lines 403-405 Pg 15-16
We clarify that our data is area weighted, and we include a new equation to indicate the weighting in Methods 2.1. We discuss, using citations, that this choice is to ensure that the EOFs are invariant to the choice of grid and each pixel contributes equally to the total variance.
Results 3.2 Pg 16-19
We add discussion to clarify that for our study, varimax rotation did not improve the representation of modes and that for further analysis with our seasonal decomposition we used our unrotated EOF modes. We note that decision should not be done in general for climate data. Our decision is because our unrotated modes were reproducible across multiple domains and had better representation with respect to variance.
Lines 441-442 Pg 17
We add discussion to indicate that we have no reason to believe the patterns fit the behavior of source variability *or* that of weather patterns either large scale or local scale.
Line 459 Pg 18-19
We add discussion in the introduction as well as to results Section 3.2 to describe sources of oceanic CO including photoproduction from colored dissolved organic matter (CDOM), direct production from phytoplankton, and dark production from CDOM in the absence of sunlight.
Line 476 Pg 20-21
We clarify, using citations, that the use of time scales for CO makes sense even when the scales are longer than its lifetime. Some variability of CO is driven by large scale weather patterns, for example mixing time is about 6 months and hemispheric exchanges can happen across 1-2 years. It also makes sense from a perspective of studying the impact of climate drivers
Line 534 Pg 25
We add discussion as to what physical processes drive the sharp gradients in skewness related to Figure 9. The annual cycle should be driven by seasonal changes in its sources and sinks and will differ depending on hemisphere and region. For example, in the northern hemisphere it’s primarily driven by changes in fossil fuel burning usage i.e. it’s largest in winter and smallest in summer because of changes in heating demands. In the southern hemisphere biomass plumes drive variability in central Africa and over part of Australia while biogenic and VOC emission mostly drives changes in South America. The large banding effect over latitude in the SH variability is probably because of a combination of regional transport as well as long term variability due to the North/South movement of the ITCZ which affects the strength of hemispheric exchanges through the Hadley circulation.
Appendix A1. Line 589. Pg 27
Here we clarify the reason for our normalization and mention other potential normalizations which could be used.
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
301 | 69 | 17 | 387 | 20 | 38 |
- HTML: 301
- PDF: 69
- XML: 17
- Total: 387
- BibTeX: 20
- EndNote: 38
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1