the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Four-dimensional temperature, salinity and mixed layer depth in the Gulf Stream, reconstructed from remote sensing and in situ observations with neural networks
Abstract. Despite the ever-growing amount of ocean’s data, the interior of the ocean remains under sampled in regions of high variability such as the Gulf Stream. In this context, neural networks have been shown to be effective for interpolating properties and understanding ocean processes. We introduce OSnet (Ocean Stratification network), a new ocean reconstruction system aimed at providing a physically consistent analysis of the upper ocean stratification. The proposed scheme is a bootstrapped multilayer perceptron trained to predict simultaneously temperature and salinity (T-S) profiles down to 1000 m and the Mixed Layer Depth (MLD) from surface data covering 1993 to 2019. OSnet is trained to fit sea surface temperature and sea level anomalies onto all historical in-situ profiles in the Gulf Stream region. To achieve vertical coherence of the profiles, the MLD prediction is used to adjust a posteriori the vertical gradients of predicted T-S profiles, thus increasing the accuracy of the solution and removing vertical density inversions. The prediction is generalized on a 1/4◦ daily grid, producing four-dimensional fields of temperature and salinity, with their associated confidence interval issued from the bootstrap. OSnet profiles have root mean square error comparable with the observation-based Armor3D weekly product and the physics-based ocean reanalysis Glorys12. The maximum of uncertainty is located north of the Gulf Stream, between the shelf and the current, where the thermohaline variability is large. The OSnet reconstructed field is coherent even in the pre-ARGO years, demonstrating the good generalization properties of the network. It reproduces the warming trend of surface temperature, the seasonal cycle of surface salinity and mesoscale structures of temperature, salinity and MLD. While OSnet delivers an accurate interpolation of the ocean’s stratification, it is also a tool to study how the interior of the ocean’s behaviour reflects on surface data. We can compute the relative importance of each input for each T-S prediction and analyse how the network learns which surface feature influences most which property and at which depth. Our results are promising and demonstrate the power of machine learning methods to improve the prediction of ocean interior properties from observations of the ocean surface.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(3651 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(3651 KB) - Metadata XML
- BibTeX
- EndNote
- Final revised paper
Journal article(s) based on this preprint
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2022-25', Anonymous Referee #1, 26 Apr 2022
Review of “Four-dimensional temperature, salinity and mixed layer depth in the Gulf Stream, reconstructed from remote sensing and in situ observations with neural networks”
by Pauthenet et al. submitted to egusphereSummary:
This paper presents an approach for predicting the vertical profiles of temperature and salinity over the top 1000 m from satellite surface observations by training an empirical machine learning model using in-situ profiles in the western North Atlantic Ocean. The paper emphasizes the treatment of the mixed layer depth and, specifically, a procedure to remove negative stratification from the profiles.
Overall, I found the paper to be an interesting contribution with sufficient novelty to be valuable. The ultimate impact of the work remains to be seen, but I think the paper will be worthy of publication after revision.
My major concerns are as follows:
- I find it surprising and confusing that the paper does not carefully separate capability to model the 4-D climatological annual-cycle from capability to model 4-D anomalies from this climatological annual cycle. Perhaps such an approach is superior, and the methods are fine as they are, but the evaluation should clearly separate errors in the climatology from errors in the anomalies therefrom. I think the paper would be stronger if it included more explicit and quantitative evaluation of model performance on anomalies from the climatology (Nonetheless, I like the illustrative examples).
- Relatedly, given that the method predicts the climatological annual cycle, I think the paper would be stronger if results were compared to a climatology obtained by objective mapping or optimal interpolation, e.g. updated Roemmich and Gilson 2009 gridded Argo climatology or the mean of the CORA gridded product.
- In the training, it seems that the selection of cross-validation data does not account for spatial and temporal autocorrelation. It is not clear that the testing data are independent of the training data. Perhaps this is ok, given that you’re trying to predict or map the climatology. But, the paper would be stronger if more explicit effort was made to train and test on truly independent data (at least with regard to modelling the anomalies).
- Confidence intervals or uncertainty. I’m a bit confused about how these are calculated and thus how to interpret them. The paper would be stronger if this was clearer.
- The main quantitative metric used is root-mean-square-error in physical units. I appreciate that this is physically intuitive, but this may obfuscate the generic statistical properties of the predictions. The paper would be stronger if normalized error metric were included, e.g. some sort of relative error and correlation.
- The word “coherence” is used a lot to refer to a desirable property of the 4-D gridded fields. Is this related to the frequency/waveform of the signal?? I’m not sure I understand exactly what is meant by coherence and why it is a valuable property of the predicted field. For example, in some cases, it may be that "smoothness" is unrealistic, e.g. in MLD predictions from GLORYS. Is coherence related to smoothness?
- Be more specific about what properties of a gridded T/S dataset make it useful for interpreting local oceanographic measurements or for process studies. I’m not sure what you mean? Low error? Correlation with real variability
- There are several areas where minor typographical and grammar issues need to be corrected.
Citation: https://doi.org/10.5194/egusphere-2022-25-RC1 - AC1: 'Reply on RC1', Etienne Pauthenet, 22 Jun 2022
-
RC2: 'Comment on egusphere-2022-25', Michel CREPON, 29 May 2022
Review of the paper submitted to egusphere 2022
Four dimensional temperature, salinity and mixed layer depth in the Gulf Stream, reconstructed from remote sensing and in-situ observations with neural networks
By E. Pauthenet et al
The paper aims at providing four-dimensional temperature, salinity, and mixed layer depth in the Gulf Stream, from sea surface satellite observations (SST and altimetry). Interpolations of surface data at depth are done with a NN trained on 67767 vertical profiles. In the operational phase, satellite data are associated with vertical profiles (Temperature, Salinity, Density and MLD) through the NN. The authors also present a procedure based on density stability to improve the MLD estimation. The subject is of scientific interest due to the lack of vertical profiles in the ocean with respect to satellite surface data. The procedure presented (OSnet) seems efficient to associate sea surface satellite data with their vertical profiles. But I found the paper difficult to read and poorly structured. It can be published after the following corrections and the rewriting of some sections.
Major comments
The paper is quite long and can shorten by 30%. I suspect it presents the results of Ph.d. work of an enthusiastic student who would like to present all the details of his work and has some difficulties extracting the major conclusions.
The readers of Ocean Sciences are physicists and most of them are not familiar with neural networks. Section 3.1 must be rewritten with care.
I recommend specifying that the use of a NN can be decomposed into two phases well separated:
- a learning phase in which the weights of the neurons are estimated from a learning data base.
- an operational phase consisting in retrieving the profiles from the satellite data (input data base)
The learning data must be described with care: mention the origin of the profiles, which is unclear in the present form. The input data must be justified. It appears that there is some redundancy among them: are MDTs and SLAs independent data? I do not think that geostrophic currents content added information with respect to SLA. How do you compute geostrophic current anomalies? Are they seasonal anomalies or anomalies with respect to whole observation period? Information included in SLA are also included in the geostrophic currents. These remarks are comforted by section 4.4 which shows that some variables do not play an important role and can be neglected. Section 4.4 could be suppressed if the input variables are chosen adequately in section 3 by a simple physical reasoning or by doing an EOF on the input data.
Can you comment?
The procedure for improving the MLD developed in section 3.3 is an important feature of this work, but it is hard to understand. Can you reformulate it in a simpler manner? How do you estimate the parameter lambda in the K estimation? A simpler procedure would be to apply a median filter onto the density profiles for removing the hydrostatic instability.
Can you discuss this?
The significance of the sentence printed in lines 199-200 is difficult to understand.
I have appreciated the scientific content of appendix A which aims at removing the density inversion with a physical constrained loss function, which is an original contribution of OSnet
The OSnet procedure has the characteristic of a multi-entry data base. It interpolates the profiles but does not model the physical laws connecting satellite observations and the associated vertical profiles. An original procedure using hidden Markov chain, which models these physical laws has been recently developed for retrieving vertical Chl-a vertical profiles from ocean color satellite observations (Charantonis et al, 2015, Puissant et al, 2021). Can you say some words about the philosophy of these two methods, their advantages, and disadvantages?
Minor comments
Most of the figures are very small. It is difficult to extract information from them. As an example, in Figure 4, it is difficult to identify the different profiles from each other. Besides, the significance of the two horizontal dotted lines must be mentioned in the figure legend
In figure 2, what are the units for the T rmse, S rmse, sigma0 rmse?
In figure 5, why the density distributions of SST and SSS are so different.
Section 4.2 : horizontal maps of T and S (Figures 6, 7) are not very useful since the authors focus their interest on the four dimensional representation of these two variables. Besides the figures are very small. I suggest replacing them by vertical sections.
Section 4.5 is interesting. OSnet is able to reproduce the SST due to global change. It could be used to process ocean data in climate study contexts. But I do not understand the sentence (lines 413-414) “The long term…. Based on loess” What do you mean by loess?
Section 5.1 justifies the use of OSnet for providing T S profiles at any location. Do be too modest! I would change line 353 as “One major feature of OSnet is the possibility…”. The detection by OSnet of the big warm eddy crossing the mooring is impressive. Some problem, the position of the mooring W3 presented in the map figuring in little cartoon at the left top of figure 14 does not correspond to the coordinates mentioned in the figure legend!
English must be corrected by a native English-speaking person:
There are many English mistakes
Examples: line 91 “is shown on figure 1a”; line 2007 “is shown on figure 4”; line 299 “The figures 10 and 11…….”, instead of “Figures 10 and 11……..”,
line 23 “the ocean’s surface is observed …….” Instead of “the ocean surface has been observed………”
Too many uses of the possessive case: line 23 “the ocean surface has been observed ….. “. In modern English, possessive case is mainly dedicated to persons.
Data is a plural noun (singular: datum)
Conclusion
This paper is useful contribution to ocean data sampling. It can be published after the above corrections are done. I also suggest 30% concatenation of the text which is too long with unnecessary presentations.
Citation: https://doi.org/10.5194/egusphere-2022-25-RC2 - AC2: 'Reply on RC2', Etienne Pauthenet, 22 Jun 2022
Interactive discussion
Status: closed
-
RC1: 'Comment on egusphere-2022-25', Anonymous Referee #1, 26 Apr 2022
Review of “Four-dimensional temperature, salinity and mixed layer depth in the Gulf Stream, reconstructed from remote sensing and in situ observations with neural networks”
by Pauthenet et al. submitted to egusphereSummary:
This paper presents an approach for predicting the vertical profiles of temperature and salinity over the top 1000 m from satellite surface observations by training an empirical machine learning model using in-situ profiles in the western North Atlantic Ocean. The paper emphasizes the treatment of the mixed layer depth and, specifically, a procedure to remove negative stratification from the profiles.
Overall, I found the paper to be an interesting contribution with sufficient novelty to be valuable. The ultimate impact of the work remains to be seen, but I think the paper will be worthy of publication after revision.
My major concerns are as follows:
- I find it surprising and confusing that the paper does not carefully separate capability to model the 4-D climatological annual-cycle from capability to model 4-D anomalies from this climatological annual cycle. Perhaps such an approach is superior, and the methods are fine as they are, but the evaluation should clearly separate errors in the climatology from errors in the anomalies therefrom. I think the paper would be stronger if it included more explicit and quantitative evaluation of model performance on anomalies from the climatology (Nonetheless, I like the illustrative examples).
- Relatedly, given that the method predicts the climatological annual cycle, I think the paper would be stronger if results were compared to a climatology obtained by objective mapping or optimal interpolation, e.g. updated Roemmich and Gilson 2009 gridded Argo climatology or the mean of the CORA gridded product.
- In the training, it seems that the selection of cross-validation data does not account for spatial and temporal autocorrelation. It is not clear that the testing data are independent of the training data. Perhaps this is ok, given that you’re trying to predict or map the climatology. But, the paper would be stronger if more explicit effort was made to train and test on truly independent data (at least with regard to modelling the anomalies).
- Confidence intervals or uncertainty. I’m a bit confused about how these are calculated and thus how to interpret them. The paper would be stronger if this was clearer.
- The main quantitative metric used is root-mean-square-error in physical units. I appreciate that this is physically intuitive, but this may obfuscate the generic statistical properties of the predictions. The paper would be stronger if normalized error metric were included, e.g. some sort of relative error and correlation.
- The word “coherence” is used a lot to refer to a desirable property of the 4-D gridded fields. Is this related to the frequency/waveform of the signal?? I’m not sure I understand exactly what is meant by coherence and why it is a valuable property of the predicted field. For example, in some cases, it may be that "smoothness" is unrealistic, e.g. in MLD predictions from GLORYS. Is coherence related to smoothness?
- Be more specific about what properties of a gridded T/S dataset make it useful for interpreting local oceanographic measurements or for process studies. I’m not sure what you mean? Low error? Correlation with real variability
- There are several areas where minor typographical and grammar issues need to be corrected.
Citation: https://doi.org/10.5194/egusphere-2022-25-RC1 - AC1: 'Reply on RC1', Etienne Pauthenet, 22 Jun 2022
-
RC2: 'Comment on egusphere-2022-25', Michel CREPON, 29 May 2022
Review of the paper submitted to egusphere 2022
Four dimensional temperature, salinity and mixed layer depth in the Gulf Stream, reconstructed from remote sensing and in-situ observations with neural networks
By E. Pauthenet et al
The paper aims at providing four-dimensional temperature, salinity, and mixed layer depth in the Gulf Stream, from sea surface satellite observations (SST and altimetry). Interpolations of surface data at depth are done with a NN trained on 67767 vertical profiles. In the operational phase, satellite data are associated with vertical profiles (Temperature, Salinity, Density and MLD) through the NN. The authors also present a procedure based on density stability to improve the MLD estimation. The subject is of scientific interest due to the lack of vertical profiles in the ocean with respect to satellite surface data. The procedure presented (OSnet) seems efficient to associate sea surface satellite data with their vertical profiles. But I found the paper difficult to read and poorly structured. It can be published after the following corrections and the rewriting of some sections.
Major comments
The paper is quite long and can shorten by 30%. I suspect it presents the results of Ph.d. work of an enthusiastic student who would like to present all the details of his work and has some difficulties extracting the major conclusions.
The readers of Ocean Sciences are physicists and most of them are not familiar with neural networks. Section 3.1 must be rewritten with care.
I recommend specifying that the use of a NN can be decomposed into two phases well separated:
- a learning phase in which the weights of the neurons are estimated from a learning data base.
- an operational phase consisting in retrieving the profiles from the satellite data (input data base)
The learning data must be described with care: mention the origin of the profiles, which is unclear in the present form. The input data must be justified. It appears that there is some redundancy among them: are MDTs and SLAs independent data? I do not think that geostrophic currents content added information with respect to SLA. How do you compute geostrophic current anomalies? Are they seasonal anomalies or anomalies with respect to whole observation period? Information included in SLA are also included in the geostrophic currents. These remarks are comforted by section 4.4 which shows that some variables do not play an important role and can be neglected. Section 4.4 could be suppressed if the input variables are chosen adequately in section 3 by a simple physical reasoning or by doing an EOF on the input data.
Can you comment?
The procedure for improving the MLD developed in section 3.3 is an important feature of this work, but it is hard to understand. Can you reformulate it in a simpler manner? How do you estimate the parameter lambda in the K estimation? A simpler procedure would be to apply a median filter onto the density profiles for removing the hydrostatic instability.
Can you discuss this?
The significance of the sentence printed in lines 199-200 is difficult to understand.
I have appreciated the scientific content of appendix A which aims at removing the density inversion with a physical constrained loss function, which is an original contribution of OSnet
The OSnet procedure has the characteristic of a multi-entry data base. It interpolates the profiles but does not model the physical laws connecting satellite observations and the associated vertical profiles. An original procedure using hidden Markov chain, which models these physical laws has been recently developed for retrieving vertical Chl-a vertical profiles from ocean color satellite observations (Charantonis et al, 2015, Puissant et al, 2021). Can you say some words about the philosophy of these two methods, their advantages, and disadvantages?
Minor comments
Most of the figures are very small. It is difficult to extract information from them. As an example, in Figure 4, it is difficult to identify the different profiles from each other. Besides, the significance of the two horizontal dotted lines must be mentioned in the figure legend
In figure 2, what are the units for the T rmse, S rmse, sigma0 rmse?
In figure 5, why the density distributions of SST and SSS are so different.
Section 4.2 : horizontal maps of T and S (Figures 6, 7) are not very useful since the authors focus their interest on the four dimensional representation of these two variables. Besides the figures are very small. I suggest replacing them by vertical sections.
Section 4.5 is interesting. OSnet is able to reproduce the SST due to global change. It could be used to process ocean data in climate study contexts. But I do not understand the sentence (lines 413-414) “The long term…. Based on loess” What do you mean by loess?
Section 5.1 justifies the use of OSnet for providing T S profiles at any location. Do be too modest! I would change line 353 as “One major feature of OSnet is the possibility…”. The detection by OSnet of the big warm eddy crossing the mooring is impressive. Some problem, the position of the mooring W3 presented in the map figuring in little cartoon at the left top of figure 14 does not correspond to the coordinates mentioned in the figure legend!
English must be corrected by a native English-speaking person:
There are many English mistakes
Examples: line 91 “is shown on figure 1a”; line 2007 “is shown on figure 4”; line 299 “The figures 10 and 11…….”, instead of “Figures 10 and 11……..”,
line 23 “the ocean’s surface is observed …….” Instead of “the ocean surface has been observed………”
Too many uses of the possessive case: line 23 “the ocean surface has been observed ….. “. In modern English, possessive case is mainly dedicated to persons.
Data is a plural noun (singular: datum)
Conclusion
This paper is useful contribution to ocean data sampling. It can be published after the above corrections are done. I also suggest 30% concatenation of the text which is too long with unnecessary presentations.
Citation: https://doi.org/10.5194/egusphere-2022-25-RC2 - AC2: 'Reply on RC2', Etienne Pauthenet, 22 Jun 2022
Peer review completion
Journal article(s) based on this preprint
Data sets
Gulf Stream Daily Temperature, Salinity and Mixed Layer Depth fields from Ocean Stratification network (OSnet). Pauthenet, Etienne; Bachelot, Loïc; Tréguier, Anne-Marie; Balem, Kevin; Maze, Guillaume; Roquet, Fabien; Fablet, Ronan; Tandeo, Pierre https://doi.org/10.5281/zenodo.6011144
Model code and software
OSnet Gulf Stream Etienne Pauthenet, Loïc Bachelot, Guillaume Maze, Kevin Balem https://github.com/euroargodev/OSnet-GulfStream
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
480 | 192 | 22 | 694 | 9 | 9 |
- HTML: 480
- PDF: 192
- XML: 22
- Total: 694
- BibTeX: 9
- EndNote: 9
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Etienne Pauthenet
Loïc Bachelot
Kevin Balem
Guillaume Maze
Anne-Marie Tréguier
Fabien Roquet
Ronan Fablet
Pierre Tandeo
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(3651 KB) - Metadata XML