the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Global variability of high nutrient low chlorophyll regions using neural networks and wavelet coherence analysis
Abstract. We examine 20-years of monthly global ocean color data and modelling outputs of nutrients using self-organizing map analysis (SOM) to identify characteristic spatial and temporal patterns of High Nutrient Low Chlorophyll (HNLC) regions and their association with different climate modes. Analyzing the properties of the probability distribution function of the global nitrate to chlorophyll ratio (NO3:Chl), we estimate that NO3:Chl>17 (mmol NO3/mg Chl) is a good indicator of the distribution limit of this unproductive biome that extends over ~25 % of the ocean. Trends in satellite-derived surface chlorophyll (0.6±0.4 to 2±0.4 % yr-1) suggest that HNLC regions in polar and subpolar areas have experienced an increase in phytoplankton biomass over the last decades. However, much of this variation is produced by a foremost climate-driven transition occurring after the year 2010, which resulted in a reduction in the extension of polar HNLC regions and an increase in their productivity. Chlorophyll variations at HNLC regions respond to all three major climate variability signals (Sea Surface Temperature, SST; El Niño Southern Oscillation, ENSO; and Meridional Overturning Circulation, MOC) and their annual and semiannual variabilities are coherent with seasonal temperature variations. At larger scales, ENSO driven variability (2–4 yr) and decadal-scale processes of heat uptake and redistribution by ocean circulation influence the HNLC extension. Our results are indicative of the long-term changes in phytoplankton biomass and productivity in the ocean and suggest global coupling in the functioning of distant biogeochemical regions.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(1710 KB)
-
Supplement
(811 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(1710 KB) - Metadata XML
-
Supplement
(811 KB) - BibTeX
- EndNote
- Final revised paper
Journal article(s) based on this preprint
Interactive discussion
Status: closed
-
CC1: 'Comment on egusphere-2022-827', Yonggang Liu, 21 Sep 2022
Comments on “Global variability of high nutrient low chlorophyll regions using neural networks and wavelet coherence analysis” by Basterretxea et al.
It is encouraging to see the growing number of innovative Self-Organizing Map (SOM) applications in the ocean science community since its early applications in oceanography more than a decade ago (e.g., Liu and Weisberg, 2005; Liu et al., 2006). The SOM is a powerful machine learning technique that can be widely used in oceanographic research in the future. In order for readers to better understand this, it would be good to provide a brief literature review of SOM applications in oceanography, at least in Supplement part of the paper.
In the main text, it is important to put your SOM application in the context of similar or relevant SOM applications that have already been published in peer-reviewed journals. For example, the SOM has been used to identify the regions of sea level variability, and the characteristic time series further analyzed using a joint wavelet power spectral analysis (Liu et al., 2016). The current work extended in that direction by using coherent wavelet analysis to reveal relationship between the nitrate/chlorophyll and the climate indices. It is important to properly accredit previous relevant work.
Line 166, “For typical satellite datasets, the SOM can be applied to both space and time domains”. It would be good to add the following sentences to properly put the current analysis in the proper context of relevant research of SOM applications:
“By applying the SOM in the spatial domain, one can extract characteristic spatial patterns of the input data. If transposing the input data matrix and applying the SOM in the time domain, one can extract characteristic temporal patterns, i.e., the characteristic time series. Since each of these time series represents the temporal variability of a particular region, this method can be used to identify regions of differentiated variability on a map. The SOM, when applied to both space and time domains of the same data, provides a powerful tool for diagnosing ocean processes from such different perspectives. Thus, it was called "dual SOM" analysis by Liu et al. (2016). In this study, we only focus on the second type.”
Line 172, “patterns R1 to R5” should be changed to “regions R1 to R5”.
Line 198, it would be good to add a sentence as follows:
“Joint SOM-wavelet power spectral analysis was demonstrated by Liu et al. (2016) in the study of characteristic time series of sea level variations in different regions of Gulf of Mexico. Here in this study, we expand it further to joint SOM-wavelet coherence analysis.”
Wavelet power spectrum may be biased towards low frequency. The bias issue has been rectified by Liu et al. (2007). This bias rectification has been taken care of in some wavelet coherence analysis software packages, but it is not clear whether it is rectified in the version of software that was used in this analysis. It is important to clarify this in the main text (Line 210).
References are cited in the text (Dong et al., 2006, Dong and Sutton, 2007, Timmermann et al., 2007) but are not seen in the reference list.
References:
Liu, Y., X.S. Liang, and R.H. Weisberg (2007), Rectification of the bias in the wavelet power spectrum,Journal of Atmospheric and Oceanic Technology, 24(12), 2093-2102, https://doi.org/10.1175/2007JTECHO511.1
Liu, Y., and R.H. Weisberg (2005), Patterns of ocean current variability on the West Florida Shelf using the self-organizing map, Journal of Geophysical Research, 110, C06003, http://dx.doi.org/10.1029/2004JC002786.
Liu, Y., R.H. Weisberg, and C.N.K. Mooers (2006), Performance evaluation of the self-organizing map for feature extraction. Journal of Geophysical Research, 111, C05018, http://dx.doi.org/10.1029/2005jc003117.
Liu, Y., R.H., Weisberg, S. Vignudelli, and G.T. Mitchum (2016), Patterns of the Loop Current system and regions of sea surface height variability in the eastern Gulf of Mexico revealed by the self-organizing maps, Journal of Geophysical Research Oceans, 121, 2347-2366, http://dx.doi.org/10.1002/2015JC011493.
Citation: https://doi.org/10.5194/egusphere-2022-827-CC1 -
AC1: 'Reply on CC1', Gotzon Basterretxea, 16 Jan 2023
It is encouraging to see the growing number of innovative Self-Organizing Map (SOM) applications in the ocean science community since its early applications in oceanography more than a decade ago (e.g., Liu and Weisberg, 2005; Liu et al., 2006). The SOM is a powerful machine learning technique that can be widely used in oceanographic research in the future. In order for readers to better understand this, it would be good to provide a brief literature review of SOM applications in oceanography, at least in Supplement part of the paper.
>SOM methods have been used in the field of oceanography for some time. Although we have not strictly carried out a literature review, following the reviewer's recommendations, we have extended the technical aspects of the SOM both in the text and in the supplementary material.
In the main text, it is important to put your SOM application in the context of similar or relevant SOM applications that have already been published in peer-reviewed journals. For example, the SOM has been used to identify the regions of sea level variability, and the characteristic time series was further analyzed using a joint wavelet power spectral analysis (Liu et al., 2016). The current work extended in that direction by using coherent wavelet analysis to reveal the relationship between the nitrate/chlorophyll and the climate indices. It is important to properly accredit previous relevant work.
>We appreciate this suggestion. We have incorporated a reference to Liu et al. 2016 in the M&M section. In any case, these studies are also referenced in the supplementary material.
Line 166, “For typical satellite datasets, the SOM can be applied to both space and time domains”. It would be good to add the following sentences to properly put the current analysis in the proper context of relevant research of SOM applications: “By applying the SOM in the spatial domain, one can extract characteristic spatial patterns of the input data. If transposing the input data matrix and applying the SOM in the time domain, one can extract characteristic temporal patterns, i.e., the characteristic time series. Since each of these time series represents the temporal variability of a particular region, this method can be used to identify regions of differentiated variability on a map. The SOM, when applied to both space and time domains of the same data, provides a powerful tool for diagnosing ocean processes from such different perspectives. Thus, it was called "dual SOM" analysis by Liu et al. (2016). In this study, we only focus on the second type.”
> Changed following the reviewers’ suggestion
Line 172, “patterns R1 to R5” should be changed to “regions R1 to R5”.
> Thank you. It has been corrected.
Line 198, it would be good to add a sentence as follows: “Joint SOM-wavelet power spectral analysis was demonstrated by Liu et al. (2016) in the study of characteristic time series of sea level variations in different regions of Gulf of Mexico. Here in this study, we expand it further to joint SOM-wavelet coherence analysis.”
>The proposed sentence has been added.
Wavelet power spectrum may be biased towards low frequency. The bias issue has been rectified by Liu et al. (2007). This bias rectification has been taken care of in some wavelet coherence analysis software packages, but it is not clear whether it is rectified in the version of software that was used in this analysis. It is important to clarify this in the main text (Line 210).
>We use the matlab algorithm by Grinsted et. al (2004). However, we have introduced a comment on the low frequency bias reported by Liu et al. (2007).
References are cited in the text (Dong et al., 2006, Dong and Sutton, 2007, Timmermann et al., 2007) but are not seen in the reference list.
> All references have been reviewed
Citation: https://doi.org/10.5194/egusphere-2022-827-AC1
-
AC1: 'Reply on CC1', Gotzon Basterretxea, 16 Jan 2023
-
RC1: 'Comment on egusphere-2022-827', Anonymous Referee #1, 25 Oct 2022
This paper applies several novel statistical methods to assess the extent and variability of the ocean's three major "high-nutrient / low-chlorophyll" (HNLC) regions. Its novel contributions include articulating a new definition of HNLC in terms of the ratio NO3/Chl, and identifying new linkages between HNLC extent and some climate variability indices.
I think this is a good paper that is publishable with revisions. The English is mostly good although there are some quirks. However, I lean toward major rather than minor revision for this reason: some papers that combine Results and Discussion cry out for separation of the two, and this is one of them. I think it would be much stronger if it were rewritten in the standard I-M-R-D format. Start with: What did we learn and what is the evidence? (Results) and then: What does it mean in the context of the existing literature and potential future research? (Discussion). At present, results of this research are mixed up with speculation and literature review in a way that detracts from the paper's core messages and makes it difficult for the reader to identify what exactly the research conducted actually demonstrates. There are also passages in the Methods that I think more properly belong in the Discussion (e.g., 124-129).
I also do not think that the statistical methods are adequately explained. On 182 we have "Statistically significant trends were considered those exceeding the 95% confidence level." This would seem to be a straightforward significance test of simple linear regression. But even here some more detail is needed, e.g., what is the decorrelation time scale and therefore the effective sample size? (see e.g., www.sciencedirect.com/science/article/pii/B978012387782600003X) When we get to the CWT (which term appears only twice and is defined differently each time), we are simply told that "The thick black contour designates the 95% confidence level" (Figure 6 caption). The text says nothing about how the confidence level is calculated (see also Figure 7). It is stated in the Supplement that "Monte Carlo simulations based on two uniform white noise time series are used to determine the significance level", but more detail is required and this should be at least mentioned in the main text. Ultimately, the variation in the total HNLC extent is found to be on the order of 5% (446). How do we know this is significant and not just noise? The coherence across regions and with the climate indices suggests that it is, but the lack of clarity regarding methods, and the sometimes too-good-to-be true correlations (see next paragraph) detract from the presentation.
In 3.3 we have "The relationship observed in interannual variations in HNLC areas suggest a global scale coupling between the equator and the poles. Good inverse correlation (r=-0.99, n=20) is observed between the interannual variations in the extension of EEP and the SO, and a weaker thought significant relationship exists between the SNP and the EEP (325 r= -0.75, n=20). Therefore, as the extension of HNLC in polar regions contracts (biomass increases), the equatorial region expands and vice versa. All three regions exhibit a shift in their extension after 2010 (Fig. 5)." I find a correlation of -0.99 difficult to credit. But when I look at Figure 5 the SO and EEP interannual time-series do indeed look like mirror images of one another. But how is this possible? What physical process could account for it? This is the kind of result that readers will dismiss without more attention to detail. But the discussion following this result is vague and speculative. I think the authors need to work harder at explaining how such a tight coupling could exist and convincing the reader that it is not just an artifact of their analysis method.
In Figure 7 we see an abrupt decline in the MOC around 2010 and then recovery to a level around 17 Sv that is both lower and more stable than before 2010. The authors seem to attribute a great deal of influence on oceanographic processes in all of the HNLC regions to this apparent "regime shift" (e.g., 404-406, 412-414). But given all of the higher frequency variability that is present in both periods, are the means for before and after 2010 even significantly different? This seems like a question one would wish to ask before attributing far-reaching effects to this rather modest decline. Here again the mixing up of presentation of the results with discussion and speculation undermines the credibility of some of the claims made. I am particularly skeptical regarding claims that the MOC affects the extent of the SNP HNLC (406), and to a lesser degree the EEP one, and find the discussion of the underlying mechanisms to be quite speculative.
The "three major climate variability signals" (20) or "three main forcings" (95) of SST, ENSO, and MOC seems like a list that combines apples and oranges. In global mean SST, the biggest component of variability beyond the annual cycle is ENSO. So why does SST need to be included in this list? Anyway it appears that mean SST as an independent variable controlling HNLC extent is never actually discussed in the paper anyway; the cross-wavelet coherence results in Figure 7 are for ENSO and MOC. So why is it given such a prominent place in the Abstract and Introduction? This could confuse the reader about what their overall purpose is. (It is also probably an exaggeration to state that they "quantify the ... dynamic relationship between the observed Chl variability and three main forcings" (95). They do quantify the cross-wavelet coherence of NO3/Chl with the climate indices, but the discussion of the underlying physical processes is quite speculative.)
The model-data comparison for NO3 could be expanded on a bit, e.g., "we found good agreement between nitrate in situ data and model results (r2=0.98)" (123). I think there should be Supplemental figures or tables that show the space/time domain of these comparisons, and break them down a bit more by region. To reproduce the gross spatial pattern of surface NO3, or especially the surface-to-deep gradient, is a very weak test of model skill. If one throws together data from all depths and from HNLC and nutrient-depleted subtropical waters, of course you will get a strong correlation. If one looked only at e.g., surface concentrations in the SNP, one would get a very different result. How about including a Supplemental table that shows the correlation coefficients for the three major HNLC regions, for surface concentrations only?
Some details (Note that I have not listed here the numerous passages of Discussion that are vague or excessively speculative, but the authors should take note that there are many of these and try to trim them down (or shore them up with detail) as they restructure the paper overall.)
32 "and, therefore, of the withdrawal of atmospheric CO2" Withdrawal by what process on what time scale? HNLC regions per se do not affect atmospheric CO2, unless their extent is altered by changes in external supply of iron as suggested by Martin 1990 (see also 329)
53 "oligoelements" unnecessary jargon
60 "coarsely known aspects" not clear what this means
73 "it is arguable if these ephemeral systems share structural and functioning similitudes with the large HNLC regions" it is uncertain whether these ephemeral systems share structural and functional similarities with the large HNLC regions
82 "reporting a positive North Pacific Gyre Oscillation (NPGO) and nutrient correlation" reporting that surface nutrient concentration was correlated with the North Pacific Gyre Oscillation (NPGO)
89 change "nutrient outputs" to "nutrient concentrations"
112 "a good indicator to describe the value overall phytoplankton trend" a good indicator of the magnitude of the overall phytoplankton trend?
131 change "climatological indices" to "climate indices"
218 change "the pauperized subtropical gyres" to "low-latitude oceanic waters"
220-224 I think this discussion neglects Eastern Boundary Currents, which represent one of the largest areas of consistently high Chl + high NO3
225 delete "i.e."
231 delete "values"
243 "has remained elusive since ... requires coherent information" something missing here
245 I actually think Figure S2 could be in the main text. It would help the reader to understand what the authors are doing
246 change "corresponding" to "correspond"
252 change "therein" to "there"
260 add a comma after "ratios"
262 change "ice sheet" to "sea ice"
265 "exhibits a differentiated dynamic" I can't tell what this means.
268 "phenological variations" I don't think this term is useful or necessary here
269 "This region is also subjected to zonal variations" How about "This region has distinctive eastern and western regions"?
275 "in which ocean productivity ... importance of advection of Fe" something missing here
286 "trend robustness is provided by the coherence in the time series obtained using SOM" I can't tell what this means
290 "not exclusive of oceanic Fe-limited waters, since it has been also observed in the highly productive Patagonian shelf" Not clear what they are trying to say here. The Patagonian shelf is not oceanic and is not Fe-limited.
313 add ", respectively" after "100% in April and 70% in July"
321 "The extension of the HNLC region in the boreal winter is the boreal winter is 25%" ???
325 change "thought" to "though"
343 " As shown in Figures 6 a and b, the temporal variability of both the characteristic NO3:Chl ratios and SST at each region peaks at 12-month periodicity, being this seasonal modulation more intense and temporally consistent in the case of temperature at high latitudes and weaker in the equator." This is very poor scientific writing. The result being presented is rather mundane: the most obvious detectable periodicity is the annual cycle, and seasonality is stronger at higher latitudes. Please rewrite.
346 "transference from annual to semiannual periods since 2010" Not sure what the right word is here but I am fairly sure "transference" is not it. How about "display a semiannual mode, which accounts for a larger fraction of variance after 2010"?
349 change "in" to "on"
353 delete "value"
353 change "phytoplankton uptake" to "phytoplankton biomass"
354 "Semiannual cycles" I try to avoid referring to variability at periods other than annual as "cycles" (excepting Milankovitch frequencies of course, but this paper is concerned with subannual to decadal scales) (see also 355, 433)
368 change "Contrastingly" to "Conversely" or "By contrast"
396 change "phase out" to "out of phase"
396 "suggesting a meridional propagation of the MOC effect" vague
415 add a comma after "AMOC"
416 change "more unclear" to "less clear"
417 sea ice or glacier ice?
419 delete ", also based on remotely sensed Chl,"
424 change "this effect is unlikely" to "this effect is unlikely to be important at the time scales considered here"
442 "retrieved from the increasingly improved and longer and longer time series of remote sensing observations" retrieved from time series of remote sensing observations of increasing duration and quality
445 delete "through complex processes"
Figure 1 - the white contour lines are difficult to see in some places
Figure 2 - the red lines that indicate linear trends don't look like straight lines to me, but it's hard to tell
Figure 5 - what exactly the y axis represents is not clearly explained; the meaning of the different colored bars is fairly obvious but should still be stated
English/formatting
One quirk of English usage that appears over and over is using the word "extension" instead of "extent". There are 27 in total and I think "extent" is more appropriate in virtually every case. Another is using "at" in place of "in" a region, e.g., "at SO", "at the EEP". I would write "in the XXX" in all cases. (Interestingly, it used to be fairly common to use "at" wrt cities, as in "I attended the AGU meeting at San Francisco". But this fell out of common use a long time ago.)
There are numerous references missing from the reference list, e.g., Garnesson/Grarnesson et al., 2019 (spelling varies); Green et al., 2017; Ibanhez et al., 2017; Kumar et al., 1995; Martínez-García et al., 2009; Qui, 2002 (probably Qiu). This is NOT an exhaustive list. I have doubts about whether Takeda 2011 is a traceable reference (searching on the doi turned up only stale links).
The reference format is inconsistent in the sense that multiple references within a parenthesis are sometimes arranged alphabetically, sometimes chronologically.
Citation: https://doi.org/10.5194/egusphere-2022-827-RC1 -
AC2: 'Reply on RC1', Gotzon Basterretxea, 16 Jan 2023
We would like to sincerely acknowledge RC1 for the outstanding review of this manuscript. It is rare these days to receive such detailed and appropriate feedback. His/her comments have undoubtedly contributed to a significant improvement of the manuscript. We highly appreciate his/her dedication. In the attached document, we include detailed answers to the comments.
-
AC2: 'Reply on RC1', Gotzon Basterretxea, 16 Jan 2023
-
RC2: 'Comment on egusphere-2022-827', Anonymous Referee #2, 13 Nov 2022
This paper relies on the spatio-temporal variability of the HNLC regions identified by SOM over the three major ocean areas using satellite-derived chlorophyll-a and modeling outputs of nutrients. The authors have performed the NO3:Chl as an indicator of the distribution limit of HNLC. They have demonstrated the linkages between HNLC extent and some climate-driven factors and teleconnections.
As a first very general comment, I would say that this is a valuable case study that can be published with some major corrections. The authors presented a lot of data and analysis procedures that need accurate processing schemas and precise interpretation, which they handled well.
The introduction section is well written and presents an adequate understanding of the presented work. The methodologies are adequate, but need some improvements. Some supplementary materials may be inserted into the main text because of their essential investigations and frequent references to them (e.g., Fig. S2).
The spatio-temporal variability of HNCL/NO3:Chl should be presented more precisely and quantitatively. For example, maps of monthly climatology, charts and maps of inter-annual cycles, and spatial-temporal cross section such as Hovemoler chart may present valuable results.
Some of the findings of the results section have not been well demonstrated in this study (e.g., section 3.4.1 Influence of SST variations – the global power spectra of CWT are needed to explain the intra-annual and inter-annual cycles). Some contents of Figures need to be better framed/explained. Some words would be expressed in accurate form. e.g., it is not clear in many places in the text that the “wind” word is mean wind speed, or wind vectors, or wind stress, or wind components (zonal and meridional).
I think a discussion section is required to explain the performance and limitations of the presented methodologies and results, which are not seen in this manuscript.
Bioregionalization analysis was used in previous studies to classify the global oceans (e.g., Longhurst, A. (2007), Ecological Geography of the Sea, Academic Press, London). Could the author highlight the need to new ocean’s regionalization which are not accessible from available global regionalization? And why SOM and not the other classification methods such as k-means?
I think the findings of nutrient model are not presented well. Some additional information is required.
The Mixed Layer Depth (MLD) is one of the main oceanographic indicators that can be used for interpretation of nutrients and phytoplankton variabilities. Does the nutrient model consider the MLD? If yes, please indicate in the text. And if not, I think it is required to consider the global ocean MLD in your work.
Looking to be constructive, in addition to the overall comments above, which should be taken into account in a possible review, I would like to point out the following Remarks.
1- In the 2.1 Ocean color data:
The GlobColour data are presented in 25 km spatial resolution globally. The authors have mentioned that the composites have a 0.25º spatial resolution. We know that the spatial resolution of 25 km and 0.25° are not the same specially at higher latitudes. If the 0.25° is true, please explain the spatial interpolation methods. If not, please correct.
2- In the section: 2.2 Nitrate data
The authors have made some essential assumptions that need to be approved precisely. May be explain more in the Results.
3- Please provide more information about the setup of the SOM algorithm (which are available not in the text nor supplementary material), in particular about the initial configuration: linear or random initialization, sheet or toroidal network, etc. Which neighbor function (Gaussian, Ep, et) is used to update the neighbors of the excited neuron (BMU) after each iteration during the training process.? Did the authors check the sensitivity of the SOM pattern to linear and random initialization?
4- The output of the SOM is also not well defined beyond being a map or topology; for example, how are the errors computed? The number of neurons is chosen not only depending on the topological errors (or topographic errors), but also on quantization errors. How different are the patterns when the number of neurons is, for example, 9?
5- Figure 6 and 7. The arrows are too small.
6- I recommend to show the time-series anomaly of HNLC and teleconnections at the top of the Fig. 6 and indicate the significant inter-annual cycles.
7- It is hard for readers to infer the shift after 2010 (Fig. 5). I think more visualization/explanations of data are required.
8- There are some abruptions in the significant annual cycles (1.5 year, from year 2006 to 2010) in the Fig. 7 which need to be explained. I suggest perform the global spectral cycles graphs.
9- The authors have considered SST and the teleconnections as a factor controlling HNLC variability. Are there any environmental factors such as precipitation and wind stress that may affect HNLC variability? Please discuss.
- AC3: 'Reply on RC2', Gotzon Basterretxea, 16 Jan 2023
Interactive discussion
Status: closed
-
CC1: 'Comment on egusphere-2022-827', Yonggang Liu, 21 Sep 2022
Comments on “Global variability of high nutrient low chlorophyll regions using neural networks and wavelet coherence analysis” by Basterretxea et al.
It is encouraging to see the growing number of innovative Self-Organizing Map (SOM) applications in the ocean science community since its early applications in oceanography more than a decade ago (e.g., Liu and Weisberg, 2005; Liu et al., 2006). The SOM is a powerful machine learning technique that can be widely used in oceanographic research in the future. In order for readers to better understand this, it would be good to provide a brief literature review of SOM applications in oceanography, at least in Supplement part of the paper.
In the main text, it is important to put your SOM application in the context of similar or relevant SOM applications that have already been published in peer-reviewed journals. For example, the SOM has been used to identify the regions of sea level variability, and the characteristic time series further analyzed using a joint wavelet power spectral analysis (Liu et al., 2016). The current work extended in that direction by using coherent wavelet analysis to reveal relationship between the nitrate/chlorophyll and the climate indices. It is important to properly accredit previous relevant work.
Line 166, “For typical satellite datasets, the SOM can be applied to both space and time domains”. It would be good to add the following sentences to properly put the current analysis in the proper context of relevant research of SOM applications:
“By applying the SOM in the spatial domain, one can extract characteristic spatial patterns of the input data. If transposing the input data matrix and applying the SOM in the time domain, one can extract characteristic temporal patterns, i.e., the characteristic time series. Since each of these time series represents the temporal variability of a particular region, this method can be used to identify regions of differentiated variability on a map. The SOM, when applied to both space and time domains of the same data, provides a powerful tool for diagnosing ocean processes from such different perspectives. Thus, it was called "dual SOM" analysis by Liu et al. (2016). In this study, we only focus on the second type.”
Line 172, “patterns R1 to R5” should be changed to “regions R1 to R5”.
Line 198, it would be good to add a sentence as follows:
“Joint SOM-wavelet power spectral analysis was demonstrated by Liu et al. (2016) in the study of characteristic time series of sea level variations in different regions of Gulf of Mexico. Here in this study, we expand it further to joint SOM-wavelet coherence analysis.”
Wavelet power spectrum may be biased towards low frequency. The bias issue has been rectified by Liu et al. (2007). This bias rectification has been taken care of in some wavelet coherence analysis software packages, but it is not clear whether it is rectified in the version of software that was used in this analysis. It is important to clarify this in the main text (Line 210).
References are cited in the text (Dong et al., 2006, Dong and Sutton, 2007, Timmermann et al., 2007) but are not seen in the reference list.
References:
Liu, Y., X.S. Liang, and R.H. Weisberg (2007), Rectification of the bias in the wavelet power spectrum,Journal of Atmospheric and Oceanic Technology, 24(12), 2093-2102, https://doi.org/10.1175/2007JTECHO511.1
Liu, Y., and R.H. Weisberg (2005), Patterns of ocean current variability on the West Florida Shelf using the self-organizing map, Journal of Geophysical Research, 110, C06003, http://dx.doi.org/10.1029/2004JC002786.
Liu, Y., R.H. Weisberg, and C.N.K. Mooers (2006), Performance evaluation of the self-organizing map for feature extraction. Journal of Geophysical Research, 111, C05018, http://dx.doi.org/10.1029/2005jc003117.
Liu, Y., R.H., Weisberg, S. Vignudelli, and G.T. Mitchum (2016), Patterns of the Loop Current system and regions of sea surface height variability in the eastern Gulf of Mexico revealed by the self-organizing maps, Journal of Geophysical Research Oceans, 121, 2347-2366, http://dx.doi.org/10.1002/2015JC011493.
Citation: https://doi.org/10.5194/egusphere-2022-827-CC1 -
AC1: 'Reply on CC1', Gotzon Basterretxea, 16 Jan 2023
It is encouraging to see the growing number of innovative Self-Organizing Map (SOM) applications in the ocean science community since its early applications in oceanography more than a decade ago (e.g., Liu and Weisberg, 2005; Liu et al., 2006). The SOM is a powerful machine learning technique that can be widely used in oceanographic research in the future. In order for readers to better understand this, it would be good to provide a brief literature review of SOM applications in oceanography, at least in Supplement part of the paper.
>SOM methods have been used in the field of oceanography for some time. Although we have not strictly carried out a literature review, following the reviewer's recommendations, we have extended the technical aspects of the SOM both in the text and in the supplementary material.
In the main text, it is important to put your SOM application in the context of similar or relevant SOM applications that have already been published in peer-reviewed journals. For example, the SOM has been used to identify the regions of sea level variability, and the characteristic time series was further analyzed using a joint wavelet power spectral analysis (Liu et al., 2016). The current work extended in that direction by using coherent wavelet analysis to reveal the relationship between the nitrate/chlorophyll and the climate indices. It is important to properly accredit previous relevant work.
>We appreciate this suggestion. We have incorporated a reference to Liu et al. 2016 in the M&M section. In any case, these studies are also referenced in the supplementary material.
Line 166, “For typical satellite datasets, the SOM can be applied to both space and time domains”. It would be good to add the following sentences to properly put the current analysis in the proper context of relevant research of SOM applications: “By applying the SOM in the spatial domain, one can extract characteristic spatial patterns of the input data. If transposing the input data matrix and applying the SOM in the time domain, one can extract characteristic temporal patterns, i.e., the characteristic time series. Since each of these time series represents the temporal variability of a particular region, this method can be used to identify regions of differentiated variability on a map. The SOM, when applied to both space and time domains of the same data, provides a powerful tool for diagnosing ocean processes from such different perspectives. Thus, it was called "dual SOM" analysis by Liu et al. (2016). In this study, we only focus on the second type.”
> Changed following the reviewers’ suggestion
Line 172, “patterns R1 to R5” should be changed to “regions R1 to R5”.
> Thank you. It has been corrected.
Line 198, it would be good to add a sentence as follows: “Joint SOM-wavelet power spectral analysis was demonstrated by Liu et al. (2016) in the study of characteristic time series of sea level variations in different regions of Gulf of Mexico. Here in this study, we expand it further to joint SOM-wavelet coherence analysis.”
>The proposed sentence has been added.
Wavelet power spectrum may be biased towards low frequency. The bias issue has been rectified by Liu et al. (2007). This bias rectification has been taken care of in some wavelet coherence analysis software packages, but it is not clear whether it is rectified in the version of software that was used in this analysis. It is important to clarify this in the main text (Line 210).
>We use the matlab algorithm by Grinsted et. al (2004). However, we have introduced a comment on the low frequency bias reported by Liu et al. (2007).
References are cited in the text (Dong et al., 2006, Dong and Sutton, 2007, Timmermann et al., 2007) but are not seen in the reference list.
> All references have been reviewed
Citation: https://doi.org/10.5194/egusphere-2022-827-AC1
-
AC1: 'Reply on CC1', Gotzon Basterretxea, 16 Jan 2023
-
RC1: 'Comment on egusphere-2022-827', Anonymous Referee #1, 25 Oct 2022
This paper applies several novel statistical methods to assess the extent and variability of the ocean's three major "high-nutrient / low-chlorophyll" (HNLC) regions. Its novel contributions include articulating a new definition of HNLC in terms of the ratio NO3/Chl, and identifying new linkages between HNLC extent and some climate variability indices.
I think this is a good paper that is publishable with revisions. The English is mostly good although there are some quirks. However, I lean toward major rather than minor revision for this reason: some papers that combine Results and Discussion cry out for separation of the two, and this is one of them. I think it would be much stronger if it were rewritten in the standard I-M-R-D format. Start with: What did we learn and what is the evidence? (Results) and then: What does it mean in the context of the existing literature and potential future research? (Discussion). At present, results of this research are mixed up with speculation and literature review in a way that detracts from the paper's core messages and makes it difficult for the reader to identify what exactly the research conducted actually demonstrates. There are also passages in the Methods that I think more properly belong in the Discussion (e.g., 124-129).
I also do not think that the statistical methods are adequately explained. On 182 we have "Statistically significant trends were considered those exceeding the 95% confidence level." This would seem to be a straightforward significance test of simple linear regression. But even here some more detail is needed, e.g., what is the decorrelation time scale and therefore the effective sample size? (see e.g., www.sciencedirect.com/science/article/pii/B978012387782600003X) When we get to the CWT (which term appears only twice and is defined differently each time), we are simply told that "The thick black contour designates the 95% confidence level" (Figure 6 caption). The text says nothing about how the confidence level is calculated (see also Figure 7). It is stated in the Supplement that "Monte Carlo simulations based on two uniform white noise time series are used to determine the significance level", but more detail is required and this should be at least mentioned in the main text. Ultimately, the variation in the total HNLC extent is found to be on the order of 5% (446). How do we know this is significant and not just noise? The coherence across regions and with the climate indices suggests that it is, but the lack of clarity regarding methods, and the sometimes too-good-to-be true correlations (see next paragraph) detract from the presentation.
In 3.3 we have "The relationship observed in interannual variations in HNLC areas suggest a global scale coupling between the equator and the poles. Good inverse correlation (r=-0.99, n=20) is observed between the interannual variations in the extension of EEP and the SO, and a weaker thought significant relationship exists between the SNP and the EEP (325 r= -0.75, n=20). Therefore, as the extension of HNLC in polar regions contracts (biomass increases), the equatorial region expands and vice versa. All three regions exhibit a shift in their extension after 2010 (Fig. 5)." I find a correlation of -0.99 difficult to credit. But when I look at Figure 5 the SO and EEP interannual time-series do indeed look like mirror images of one another. But how is this possible? What physical process could account for it? This is the kind of result that readers will dismiss without more attention to detail. But the discussion following this result is vague and speculative. I think the authors need to work harder at explaining how such a tight coupling could exist and convincing the reader that it is not just an artifact of their analysis method.
In Figure 7 we see an abrupt decline in the MOC around 2010 and then recovery to a level around 17 Sv that is both lower and more stable than before 2010. The authors seem to attribute a great deal of influence on oceanographic processes in all of the HNLC regions to this apparent "regime shift" (e.g., 404-406, 412-414). But given all of the higher frequency variability that is present in both periods, are the means for before and after 2010 even significantly different? This seems like a question one would wish to ask before attributing far-reaching effects to this rather modest decline. Here again the mixing up of presentation of the results with discussion and speculation undermines the credibility of some of the claims made. I am particularly skeptical regarding claims that the MOC affects the extent of the SNP HNLC (406), and to a lesser degree the EEP one, and find the discussion of the underlying mechanisms to be quite speculative.
The "three major climate variability signals" (20) or "three main forcings" (95) of SST, ENSO, and MOC seems like a list that combines apples and oranges. In global mean SST, the biggest component of variability beyond the annual cycle is ENSO. So why does SST need to be included in this list? Anyway it appears that mean SST as an independent variable controlling HNLC extent is never actually discussed in the paper anyway; the cross-wavelet coherence results in Figure 7 are for ENSO and MOC. So why is it given such a prominent place in the Abstract and Introduction? This could confuse the reader about what their overall purpose is. (It is also probably an exaggeration to state that they "quantify the ... dynamic relationship between the observed Chl variability and three main forcings" (95). They do quantify the cross-wavelet coherence of NO3/Chl with the climate indices, but the discussion of the underlying physical processes is quite speculative.)
The model-data comparison for NO3 could be expanded on a bit, e.g., "we found good agreement between nitrate in situ data and model results (r2=0.98)" (123). I think there should be Supplemental figures or tables that show the space/time domain of these comparisons, and break them down a bit more by region. To reproduce the gross spatial pattern of surface NO3, or especially the surface-to-deep gradient, is a very weak test of model skill. If one throws together data from all depths and from HNLC and nutrient-depleted subtropical waters, of course you will get a strong correlation. If one looked only at e.g., surface concentrations in the SNP, one would get a very different result. How about including a Supplemental table that shows the correlation coefficients for the three major HNLC regions, for surface concentrations only?
Some details (Note that I have not listed here the numerous passages of Discussion that are vague or excessively speculative, but the authors should take note that there are many of these and try to trim them down (or shore them up with detail) as they restructure the paper overall.)
32 "and, therefore, of the withdrawal of atmospheric CO2" Withdrawal by what process on what time scale? HNLC regions per se do not affect atmospheric CO2, unless their extent is altered by changes in external supply of iron as suggested by Martin 1990 (see also 329)
53 "oligoelements" unnecessary jargon
60 "coarsely known aspects" not clear what this means
73 "it is arguable if these ephemeral systems share structural and functioning similitudes with the large HNLC regions" it is uncertain whether these ephemeral systems share structural and functional similarities with the large HNLC regions
82 "reporting a positive North Pacific Gyre Oscillation (NPGO) and nutrient correlation" reporting that surface nutrient concentration was correlated with the North Pacific Gyre Oscillation (NPGO)
89 change "nutrient outputs" to "nutrient concentrations"
112 "a good indicator to describe the value overall phytoplankton trend" a good indicator of the magnitude of the overall phytoplankton trend?
131 change "climatological indices" to "climate indices"
218 change "the pauperized subtropical gyres" to "low-latitude oceanic waters"
220-224 I think this discussion neglects Eastern Boundary Currents, which represent one of the largest areas of consistently high Chl + high NO3
225 delete "i.e."
231 delete "values"
243 "has remained elusive since ... requires coherent information" something missing here
245 I actually think Figure S2 could be in the main text. It would help the reader to understand what the authors are doing
246 change "corresponding" to "correspond"
252 change "therein" to "there"
260 add a comma after "ratios"
262 change "ice sheet" to "sea ice"
265 "exhibits a differentiated dynamic" I can't tell what this means.
268 "phenological variations" I don't think this term is useful or necessary here
269 "This region is also subjected to zonal variations" How about "This region has distinctive eastern and western regions"?
275 "in which ocean productivity ... importance of advection of Fe" something missing here
286 "trend robustness is provided by the coherence in the time series obtained using SOM" I can't tell what this means
290 "not exclusive of oceanic Fe-limited waters, since it has been also observed in the highly productive Patagonian shelf" Not clear what they are trying to say here. The Patagonian shelf is not oceanic and is not Fe-limited.
313 add ", respectively" after "100% in April and 70% in July"
321 "The extension of the HNLC region in the boreal winter is the boreal winter is 25%" ???
325 change "thought" to "though"
343 " As shown in Figures 6 a and b, the temporal variability of both the characteristic NO3:Chl ratios and SST at each region peaks at 12-month periodicity, being this seasonal modulation more intense and temporally consistent in the case of temperature at high latitudes and weaker in the equator." This is very poor scientific writing. The result being presented is rather mundane: the most obvious detectable periodicity is the annual cycle, and seasonality is stronger at higher latitudes. Please rewrite.
346 "transference from annual to semiannual periods since 2010" Not sure what the right word is here but I am fairly sure "transference" is not it. How about "display a semiannual mode, which accounts for a larger fraction of variance after 2010"?
349 change "in" to "on"
353 delete "value"
353 change "phytoplankton uptake" to "phytoplankton biomass"
354 "Semiannual cycles" I try to avoid referring to variability at periods other than annual as "cycles" (excepting Milankovitch frequencies of course, but this paper is concerned with subannual to decadal scales) (see also 355, 433)
368 change "Contrastingly" to "Conversely" or "By contrast"
396 change "phase out" to "out of phase"
396 "suggesting a meridional propagation of the MOC effect" vague
415 add a comma after "AMOC"
416 change "more unclear" to "less clear"
417 sea ice or glacier ice?
419 delete ", also based on remotely sensed Chl,"
424 change "this effect is unlikely" to "this effect is unlikely to be important at the time scales considered here"
442 "retrieved from the increasingly improved and longer and longer time series of remote sensing observations" retrieved from time series of remote sensing observations of increasing duration and quality
445 delete "through complex processes"
Figure 1 - the white contour lines are difficult to see in some places
Figure 2 - the red lines that indicate linear trends don't look like straight lines to me, but it's hard to tell
Figure 5 - what exactly the y axis represents is not clearly explained; the meaning of the different colored bars is fairly obvious but should still be stated
English/formatting
One quirk of English usage that appears over and over is using the word "extension" instead of "extent". There are 27 in total and I think "extent" is more appropriate in virtually every case. Another is using "at" in place of "in" a region, e.g., "at SO", "at the EEP". I would write "in the XXX" in all cases. (Interestingly, it used to be fairly common to use "at" wrt cities, as in "I attended the AGU meeting at San Francisco". But this fell out of common use a long time ago.)
There are numerous references missing from the reference list, e.g., Garnesson/Grarnesson et al., 2019 (spelling varies); Green et al., 2017; Ibanhez et al., 2017; Kumar et al., 1995; Martínez-García et al., 2009; Qui, 2002 (probably Qiu). This is NOT an exhaustive list. I have doubts about whether Takeda 2011 is a traceable reference (searching on the doi turned up only stale links).
The reference format is inconsistent in the sense that multiple references within a parenthesis are sometimes arranged alphabetically, sometimes chronologically.
Citation: https://doi.org/10.5194/egusphere-2022-827-RC1 -
AC2: 'Reply on RC1', Gotzon Basterretxea, 16 Jan 2023
We would like to sincerely acknowledge RC1 for the outstanding review of this manuscript. It is rare these days to receive such detailed and appropriate feedback. His/her comments have undoubtedly contributed to a significant improvement of the manuscript. We highly appreciate his/her dedication. In the attached document, we include detailed answers to the comments.
-
AC2: 'Reply on RC1', Gotzon Basterretxea, 16 Jan 2023
-
RC2: 'Comment on egusphere-2022-827', Anonymous Referee #2, 13 Nov 2022
This paper relies on the spatio-temporal variability of the HNLC regions identified by SOM over the three major ocean areas using satellite-derived chlorophyll-a and modeling outputs of nutrients. The authors have performed the NO3:Chl as an indicator of the distribution limit of HNLC. They have demonstrated the linkages between HNLC extent and some climate-driven factors and teleconnections.
As a first very general comment, I would say that this is a valuable case study that can be published with some major corrections. The authors presented a lot of data and analysis procedures that need accurate processing schemas and precise interpretation, which they handled well.
The introduction section is well written and presents an adequate understanding of the presented work. The methodologies are adequate, but need some improvements. Some supplementary materials may be inserted into the main text because of their essential investigations and frequent references to them (e.g., Fig. S2).
The spatio-temporal variability of HNCL/NO3:Chl should be presented more precisely and quantitatively. For example, maps of monthly climatology, charts and maps of inter-annual cycles, and spatial-temporal cross section such as Hovemoler chart may present valuable results.
Some of the findings of the results section have not been well demonstrated in this study (e.g., section 3.4.1 Influence of SST variations – the global power spectra of CWT are needed to explain the intra-annual and inter-annual cycles). Some contents of Figures need to be better framed/explained. Some words would be expressed in accurate form. e.g., it is not clear in many places in the text that the “wind” word is mean wind speed, or wind vectors, or wind stress, or wind components (zonal and meridional).
I think a discussion section is required to explain the performance and limitations of the presented methodologies and results, which are not seen in this manuscript.
Bioregionalization analysis was used in previous studies to classify the global oceans (e.g., Longhurst, A. (2007), Ecological Geography of the Sea, Academic Press, London). Could the author highlight the need to new ocean’s regionalization which are not accessible from available global regionalization? And why SOM and not the other classification methods such as k-means?
I think the findings of nutrient model are not presented well. Some additional information is required.
The Mixed Layer Depth (MLD) is one of the main oceanographic indicators that can be used for interpretation of nutrients and phytoplankton variabilities. Does the nutrient model consider the MLD? If yes, please indicate in the text. And if not, I think it is required to consider the global ocean MLD in your work.
Looking to be constructive, in addition to the overall comments above, which should be taken into account in a possible review, I would like to point out the following Remarks.
1- In the 2.1 Ocean color data:
The GlobColour data are presented in 25 km spatial resolution globally. The authors have mentioned that the composites have a 0.25º spatial resolution. We know that the spatial resolution of 25 km and 0.25° are not the same specially at higher latitudes. If the 0.25° is true, please explain the spatial interpolation methods. If not, please correct.
2- In the section: 2.2 Nitrate data
The authors have made some essential assumptions that need to be approved precisely. May be explain more in the Results.
3- Please provide more information about the setup of the SOM algorithm (which are available not in the text nor supplementary material), in particular about the initial configuration: linear or random initialization, sheet or toroidal network, etc. Which neighbor function (Gaussian, Ep, et) is used to update the neighbors of the excited neuron (BMU) after each iteration during the training process.? Did the authors check the sensitivity of the SOM pattern to linear and random initialization?
4- The output of the SOM is also not well defined beyond being a map or topology; for example, how are the errors computed? The number of neurons is chosen not only depending on the topological errors (or topographic errors), but also on quantization errors. How different are the patterns when the number of neurons is, for example, 9?
5- Figure 6 and 7. The arrows are too small.
6- I recommend to show the time-series anomaly of HNLC and teleconnections at the top of the Fig. 6 and indicate the significant inter-annual cycles.
7- It is hard for readers to infer the shift after 2010 (Fig. 5). I think more visualization/explanations of data are required.
8- There are some abruptions in the significant annual cycles (1.5 year, from year 2006 to 2010) in the Fig. 7 which need to be explained. I suggest perform the global spectral cycles graphs.
9- The authors have considered SST and the teleconnections as a factor controlling HNLC variability. Are there any environmental factors such as precipitation and wind stress that may affect HNLC variability? Please discuss.
- AC3: 'Reply on RC2', Gotzon Basterretxea, 16 Jan 2023
Peer review completion
Journal article(s) based on this preprint
Viewed
HTML | XML | Total | Supplement | BibTeX | EndNote | |
---|---|---|---|---|---|---|
439 | 189 | 23 | 651 | 47 | 5 | 6 |
- HTML: 439
- PDF: 189
- XML: 23
- Total: 651
- Supplement: 47
- BibTeX: 5
- EndNote: 6
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Gotzon Basterretxea
Joan S. Font-Muñoz
Ismael Hernández-Carrasco
Sergio Sañudo-Wilhelmy
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(1710 KB) - Metadata XML
-
Supplement
(811 KB) - BibTeX
- EndNote
- Final revised paper