Complex fault system revealed from 3-D seismic reflection data with deep learning and fault network analysis

Wrona, Thilo; Pan, Indranil; Bell, Rebecca; Jackson, Christopher A.-L.; Gawthorpe, Robert; Fossen, Haakon; Osagiede, Edoseghe; Brune, Sascha

doi:https://doi.org/10.5194/egusphere-2022-1190

Preprints

https://doi.org/10.5194/egusphere-2022-1190

Preprints

28 Nov 2022

| 28 Nov 2022

Complex fault system revealed from 3-D seismic reflection data with deep learning and fault network analysis

Thilo Wrona, Indranil Pan, Rebecca Bell, Christopher A.-L. Jackson, Robert Gawthorpe, Haakon Fossen, Edoseghe Osagiede, and Sascha Brune

Abstract. Understanding where normal faults are is critical to an accurate assessment of seismic hazard, the successful exploration for and production of natural (including low-carbon) resources, and for the safe subsurface storage of CO2. Our current knowledge of normal fault systems is largely derived from seismic reflection data imaging intra-continental rifts and continental margins. However, exploitation of these data is limited by interpretation biases, data coverage and resolution, restricting our understanding of fault systems. Applying supervised deep learning to one of the largest offshore 3-D seismic reflection data sets from the northern North Sea allows us to image the complexity of the rift-related fault system. The derived fault score volume allows us to extract almost 8000 individual normal faults of different geometries, which together form an intricate network characterised by a multitude of splays, junctions and intersections. Combining tools from deep learning, computer vision and network analysis allows us to map and analyse the fault system in great detail and a fraction of the time required by conventional interpretation methods. As such, this study shows how we can efficiently identify and analyse fault systems in increasingly large 3-D seismic data sets.

Received: 04 Nov 2022 – Discussion started: 28 Nov 2022

Download & links

Preprint (PDF, 28458 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (28458 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

21 Nov 2023

Complex fault system revealed by 3-D seismic reflection data with deep learning and fault network analysis

Thilo Wrona, Indranil Pan, Rebecca E. Bell, Christopher A.-L. Jackson, Robert L. Gawthorpe, Haakon Fossen, Edoseghe E. Osagiede, and Sascha Brune

Solid Earth, 14, 1181–1195, https://doi.org/10.5194/se-14-1181-2023,https://doi.org/10.5194/se-14-1181-2023, 2023

Short summary

Thilo Wrona et al.

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2022-1190', Lukas Mosser, 22 Jan 2023

Dear Authors, dear Editor,

Thank you for the opportunity to review this manuscript.

I have read it with great interest, and find the manuscript well written, comprehensive, and an important study showing the potential of machine learning-enabled large-scale studies of geological structures in application to CO2 sequestration. While the proposed study is centred around subsurface storage of CO2, I am certain that similar methodologies could be applied in other areas e.g. in near-surface geophysics.

I will first outline a brief summary of the main findings, and then provide a few high-level questions/comments which I hope will further improve the manuscript. For this, I will mainly focus on the machine learning aspect, and fault extraction aspect of the presented work, as this is where I see the area that I am most confident in my own understanding. Nevertheless, I may comment on the geological aspect, and highlight here that my comments are well-intended, but misinformed due to a lack of expertise in a field in which I am confident the authors and other reviewers are well-versed.

The presented manuscript proposes the application of deep neural networks and automated fault extraction to perform rapid large-scale analysis of normal faults from 3D seismic data. The study area is focused on the North Sea covering roughly 35000 km2 of the northern North Sea rift zone. Deep neural networks have been trained on 2D patches extracted. The authors use an automated fault extraction method to ultimately analyse fault length, strike, density, and continuity. In their discussion, they highlight the advantages of deep learning-based fault interpretation and also relate this to manual, human interpretation. The authors show comprehensively how these technologies can in combination generate new insights into geological systems that would otherwise be extremely tedious to perform by manual interpretation while providing a repeatable process that can be automated.

The manuscript is accompanied by a number of well-laid-out figures that support the findings described in the text.

In summary, I do not see any major issues with the manuscript and it should therefore be accepted subject to minor revisions and comments.

Line 29: Would the same study be possible for other types of faults? Have only normal faults been labelled in the data labelling process for training the deep neural network? Would the network be able to distinguish normal faults from reverse faults for example?

Line 46 and 47: I feel the wording around “proof of concept” and “has yet to” could be improved. There is a lot of groundwork that needs to be done to understand the ability of these deep neural networks to become a reliable source of knowledge. Just because there has not been a study to evaluate the insights gained does not mean there wasn’t potential to do so coming from numerous works where the ability of the networks to detect faults have been established, some of which have even been published including Model weights e.g. Wu et. al.

Line 50: I am not sure I like the use of “<0.1% of data volume”. I have made this statement myself before, but I believe rather than focusing on the reduced volume we should focus on data quality. Apart from the fact that seismic data have strong lateral correlations making additional neighbouring data less diverse, we can also consider other criteria: How many of the relevant types of faults have been mapped, and how many noise modalities have been incorporated? Nothing to change here for now, but potentially something to address in the future?

Line 88: Accuracy is a metric that is not well suited for class imbalanced problems such as fault detection. Have you considered using the F1 score?

Line 89: Did you monitor the training or validation loss?

Line 93: Have you considered also publishing the weights?

Line 108: The faults identified at threshold < 0.3 may be small faults but also misclassifications, the same goes for larger faults where the threshold may be >0.3.

How did you determine an appropriate threshold, given that you also filter small faults during the extraction phase?

Line 174: Should we not add the training time, model validation and QC, and labelling time as well to make the comparison fair? In this case the model was created specifically for this dataset, so the cost would not amortize over the application to many other datasets.

Line 180: The fatbox toolbox was mentioned earlier, do you need to repeat it?

Line 185: It is mentioned later that the fault score should not be equated with what I assume is a calibrated fault probability, yet here you mention that it is possible to determine how likely a fault is to occur. From a miscalibrated model this judgement can be misrepresented. See

Runhai Feng, Dario Grana, and Niels Balling, (2021), "Uncertainty quantification in fault detection using convolutional neural networks," 86: M41-M48. https://doi.org/10.1190/geo2020-0424.1

Lukas Mosser and Ehsan Zabihi Naeini, (2022), "A comprehensive study of calibration and uncertainty quantification for Bayesian convolutional neural networks — An application to seismic data," 87: IM157-IM176. https://doi.org/10.1190/geo2021-0318.1

for examples of how this can be ameliorated. Including validations against synthetic data and corresponding metrics (Mosser & Naeini 2022).

Line 188: Wouldn’t you have to determine this empirically if your model does not differentiate based on strike, shape, or size? Some indications of this are the inability to detect faults that are oblique or parallel to inline crossline detection. In this case this is a given by design since the model is a 2D network. Validations using synthetic fault geometries would certainly help support or validate such assumptions.

Line 214: The statement “we can use (the fault score) as a proxy for how likely it is to encounter a fault” and the subsequent statement of “Bayesian neural networks … able to predict true fault probabilities“ are a bit contradictory. Would you agree that using the fault score as a proxy can only be done if the corresponding scores can be calibrated against independent data? I am not sure how you see the fault score being used in a quantitative manner, or whether you mean that the fault score can be used in a qualitative manner to indicate the presence of a fault which should not be mistaken for a true probability. Examples of such approaches are highlighted e.g. in Mosser & Naeini 2022 showing miscalibrations of U-Nets trained with balanced loss functions.

Line 221: Agreed, 3D fault extraction libraries would make for a great addition to the open-source software domain.

Regarding Figure 3, could the authors address why they choose to highlight only faults with a probability > 0.5 in the colour map, as opposed to figure 4?

Do the authors have any recommendations on how to choose filter sizes for the Gaussian blur? Is there a reason behind using a Gaussian blur to preprocess the fault score maps? Why would thresholding not be sufficient?

In figure 8, we can clearly identify some major faults not picked by the network and extraction in the lower right corner of the image. Is the seismic data that was used the same? It is addressed generally in the text, but I couldn’t see if it was the same seismic dataset.

Have the authors considered processing the dataset in the main fault strike orientations i.e. NE-SW and NW-SE instead of inline direction? This could help better identification of oblique faults which are otherwise not well-imaged.

How was the Fault density square area measured, and how was the size of the averaging element chosen for Figure 10 D?

Citation: https://doi.org/10.5194/egusphere-2022-1190-RC1
- AC2: 'Reply on RC1', Thilo Wrona, 21 Jun 2023
  
  Our reply is in the supplement, because the formating is completely changed by the online text box.
  
  Citation: https://doi.org/10.5194/egusphere-2022-1190-AC2
RC2:
'Comment on egusphere-2022-1190', Heather Bedle, 27 Mar 2023

Overall this is a brief and concise synopsis of using machine learning techniques to map faults in the North Sea. I found that the manuscript iteself was quite brief, referring to other works by the main author for details on methods, algorithms and such. Normally, this would be acceptable, if the paper focused on new insights into the geology or understanding of the geologic system being studied, or a new methodology to showcase - but, despite the rapid mapping of the faults in the area, minimal discussion was presented on what these new geologic results revealed or how they changed the overall understanding of the system. I think the contribution of this method should be discussed in more detail, whether it is an improvement on the methods that they used (that were previously published) or an improved understanding of the geologic setting. As written, in seems to be an incomplete case study of an existing method... but I suspect that the authors can easily add much more to really communicate the uniqueness and new insights that their method provides.

Citation: https://doi.org/10.5194/egusphere-2022-1190-RC2
- AC1:
  'Reply on RC2', Thilo Wrona, 21 Jun 2023
  We thank the reviewer for the feedback. The reviewer is right that the manuscript sits between a geophysical methods paper and a geological case study. It is supposed to showcase the ability of deep learning to perform high-resolution, regional fault mapping in 3-D seismic reflection datasets. In comparison to previous studies, the key geophysical advances are:
  The scale and quality of the seismic dataset
  
  The speed and accuracy of the fault prediction
  
  The fault extraction and analysis workflow
  
  The key geological advance is to reveal the incredible complexity of this fault system (L297-302):
  ‘This fault network shows large variations in fault length, strike and density, with extremely complex splays, junctions and intersections between these faults (Figs. 7-11). As such, our work goes far beyond typical seismic interpretations in previous case studies, which covered only a fraction of the rift (e.g. Duffy et al., 2015; Deng et al., 2017; Tillmans et al., 2021), or regional studies that mapped <100 of the largest faults using primarily sparse, 2-D seismic sections (e.g. Fig. 1B; Fazlikhani et al., 2017; Phillips et al., 2019).’
  As such, we hope that this work will spark a new wave of studies investigating the evolution of such an incredibly complex fault system.
  
  Citation: https://doi.org/10.5194/egusphere-2022-1190-AC1

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2022-1190', Lukas Mosser, 22 Jan 2023

Dear Authors, dear Editor,

Thank you for the opportunity to review this manuscript.

I have read it with great interest, and find the manuscript well written, comprehensive, and an important study showing the potential of machine learning-enabled large-scale studies of geological structures in application to CO2 sequestration. While the proposed study is centred around subsurface storage of CO2, I am certain that similar methodologies could be applied in other areas e.g. in near-surface geophysics.

I will first outline a brief summary of the main findings, and then provide a few high-level questions/comments which I hope will further improve the manuscript. For this, I will mainly focus on the machine learning aspect, and fault extraction aspect of the presented work, as this is where I see the area that I am most confident in my own understanding. Nevertheless, I may comment on the geological aspect, and highlight here that my comments are well-intended, but misinformed due to a lack of expertise in a field in which I am confident the authors and other reviewers are well-versed.

The presented manuscript proposes the application of deep neural networks and automated fault extraction to perform rapid large-scale analysis of normal faults from 3D seismic data. The study area is focused on the North Sea covering roughly 35000 km2 of the northern North Sea rift zone. Deep neural networks have been trained on 2D patches extracted. The authors use an automated fault extraction method to ultimately analyse fault length, strike, density, and continuity. In their discussion, they highlight the advantages of deep learning-based fault interpretation and also relate this to manual, human interpretation. The authors show comprehensively how these technologies can in combination generate new insights into geological systems that would otherwise be extremely tedious to perform by manual interpretation while providing a repeatable process that can be automated.

The manuscript is accompanied by a number of well-laid-out figures that support the findings described in the text.

In summary, I do not see any major issues with the manuscript and it should therefore be accepted subject to minor revisions and comments.

Line 29: Would the same study be possible for other types of faults? Have only normal faults been labelled in the data labelling process for training the deep neural network? Would the network be able to distinguish normal faults from reverse faults for example?

Line 46 and 47: I feel the wording around “proof of concept” and “has yet to” could be improved. There is a lot of groundwork that needs to be done to understand the ability of these deep neural networks to become a reliable source of knowledge. Just because there has not been a study to evaluate the insights gained does not mean there wasn’t potential to do so coming from numerous works where the ability of the networks to detect faults have been established, some of which have even been published including Model weights e.g. Wu et. al.

Line 50: I am not sure I like the use of “<0.1% of data volume”. I have made this statement myself before, but I believe rather than focusing on the reduced volume we should focus on data quality. Apart from the fact that seismic data have strong lateral correlations making additional neighbouring data less diverse, we can also consider other criteria: How many of the relevant types of faults have been mapped, and how many noise modalities have been incorporated? Nothing to change here for now, but potentially something to address in the future?

Line 88: Accuracy is a metric that is not well suited for class imbalanced problems such as fault detection. Have you considered using the F1 score?

Line 89: Did you monitor the training or validation loss?

Line 93: Have you considered also publishing the weights?

Line 108: The faults identified at threshold < 0.3 may be small faults but also misclassifications, the same goes for larger faults where the threshold may be >0.3.

How did you determine an appropriate threshold, given that you also filter small faults during the extraction phase?

Line 174: Should we not add the training time, model validation and QC, and labelling time as well to make the comparison fair? In this case the model was created specifically for this dataset, so the cost would not amortize over the application to many other datasets.

Line 180: The fatbox toolbox was mentioned earlier, do you need to repeat it?

Line 185: It is mentioned later that the fault score should not be equated with what I assume is a calibrated fault probability, yet here you mention that it is possible to determine how likely a fault is to occur. From a miscalibrated model this judgement can be misrepresented. See

Runhai Feng, Dario Grana, and Niels Balling, (2021), "Uncertainty quantification in fault detection using convolutional neural networks," 86: M41-M48. https://doi.org/10.1190/geo2020-0424.1

Lukas Mosser and Ehsan Zabihi Naeini, (2022), "A comprehensive study of calibration and uncertainty quantification for Bayesian convolutional neural networks — An application to seismic data," 87: IM157-IM176. https://doi.org/10.1190/geo2021-0318.1

for examples of how this can be ameliorated. Including validations against synthetic data and corresponding metrics (Mosser & Naeini 2022).

Line 188: Wouldn’t you have to determine this empirically if your model does not differentiate based on strike, shape, or size? Some indications of this are the inability to detect faults that are oblique or parallel to inline crossline detection. In this case this is a given by design since the model is a 2D network. Validations using synthetic fault geometries would certainly help support or validate such assumptions.

Line 214: The statement “we can use (the fault score) as a proxy for how likely it is to encounter a fault” and the subsequent statement of “Bayesian neural networks … able to predict true fault probabilities“ are a bit contradictory. Would you agree that using the fault score as a proxy can only be done if the corresponding scores can be calibrated against independent data? I am not sure how you see the fault score being used in a quantitative manner, or whether you mean that the fault score can be used in a qualitative manner to indicate the presence of a fault which should not be mistaken for a true probability. Examples of such approaches are highlighted e.g. in Mosser & Naeini 2022 showing miscalibrations of U-Nets trained with balanced loss functions.

Line 221: Agreed, 3D fault extraction libraries would make for a great addition to the open-source software domain.

Regarding Figure 3, could the authors address why they choose to highlight only faults with a probability > 0.5 in the colour map, as opposed to figure 4?

Do the authors have any recommendations on how to choose filter sizes for the Gaussian blur? Is there a reason behind using a Gaussian blur to preprocess the fault score maps? Why would thresholding not be sufficient?

In figure 8, we can clearly identify some major faults not picked by the network and extraction in the lower right corner of the image. Is the seismic data that was used the same? It is addressed generally in the text, but I couldn’t see if it was the same seismic dataset.

Have the authors considered processing the dataset in the main fault strike orientations i.e. NE-SW and NW-SE instead of inline direction? This could help better identification of oblique faults which are otherwise not well-imaged.

How was the Fault density square area measured, and how was the size of the averaging element chosen for Figure 10 D?

Citation: https://doi.org/10.5194/egusphere-2022-1190-RC1
- AC2: 'Reply on RC1', Thilo Wrona, 21 Jun 2023
  
  Our reply is in the supplement, because the formating is completely changed by the online text box.
  
  Citation: https://doi.org/10.5194/egusphere-2022-1190-AC2
RC2:
'Comment on egusphere-2022-1190', Heather Bedle, 27 Mar 2023

Overall this is a brief and concise synopsis of using machine learning techniques to map faults in the North Sea. I found that the manuscript iteself was quite brief, referring to other works by the main author for details on methods, algorithms and such. Normally, this would be acceptable, if the paper focused on new insights into the geology or understanding of the geologic system being studied, or a new methodology to showcase - but, despite the rapid mapping of the faults in the area, minimal discussion was presented on what these new geologic results revealed or how they changed the overall understanding of the system. I think the contribution of this method should be discussed in more detail, whether it is an improvement on the methods that they used (that were previously published) or an improved understanding of the geologic setting. As written, in seems to be an incomplete case study of an existing method... but I suspect that the authors can easily add much more to really communicate the uniqueness and new insights that their method provides.

Citation: https://doi.org/10.5194/egusphere-2022-1190-RC2
- AC1:
  'Reply on RC2', Thilo Wrona, 21 Jun 2023
  We thank the reviewer for the feedback. The reviewer is right that the manuscript sits between a geophysical methods paper and a geological case study. It is supposed to showcase the ability of deep learning to perform high-resolution, regional fault mapping in 3-D seismic reflection datasets. In comparison to previous studies, the key geophysical advances are:
  The scale and quality of the seismic dataset
  
  The speed and accuracy of the fault prediction
  
  The fault extraction and analysis workflow
  
  The key geological advance is to reveal the incredible complexity of this fault system (L297-302):
  ‘This fault network shows large variations in fault length, strike and density, with extremely complex splays, junctions and intersections between these faults (Figs. 7-11). As such, our work goes far beyond typical seismic interpretations in previous case studies, which covered only a fraction of the rift (e.g. Duffy et al., 2015; Deng et al., 2017; Tillmans et al., 2021), or regional studies that mapped <100 of the largest faults using primarily sparse, 2-D seismic sections (e.g. Fig. 1B; Fazlikhani et al., 2017; Phillips et al., 2019).’
  As such, we hope that this work will spark a new wave of studies investigating the evolution of such an incredibly complex fault system.
  
  Citation: https://doi.org/10.5194/egusphere-2022-1190-AC1

Peer review completion

AR: Author's response | RR: Referee report | ED: Editor decision | EF: Editorial file upload

AR by Thilo Wrona on behalf of the Authors (21 Jun 2023) Author's response Author's tracked changes

EF by Sarah Buchmann (23 Jun 2023) Manuscript

ED: Referee Nomination & Report Request started (27 Jun 2023) by Michal Malinowski

RR by Anonymous Referee #2 (05 Jul 2023)

ED: Publish subject to minor revisions (review by editor) (06 Jul 2023) by Michal Malinowski

AR by Thilo Wrona on behalf of the Authors (12 Jul 2023) Author's response Author's tracked changes Manuscript

ED: Publish as is (11 Aug 2023) by Michal Malinowski

ED: Publish as is (03 Oct 2023) by Susanne Buiter (Executive editor)

AR by Thilo Wrona on behalf of the Authors (05 Oct 2023) Manuscript

Journal article(s) based on this preprint

21 Nov 2023

Complex fault system revealed by 3-D seismic reflection data with deep learning and fault network analysis

Thilo Wrona, Indranil Pan, Rebecca E. Bell, Christopher A.-L. Jackson, Robert L. Gawthorpe, Haakon Fossen, Edoseghe E. Osagiede, and Sascha Brune

Solid Earth, 14, 1181–1195, https://doi.org/10.5194/se-14-1181-2023,https://doi.org/10.5194/se-14-1181-2023, 2023

Short summary

Thilo Wrona et al.

Viewed

Total article views: 587 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
392	176	19	587	9	9

HTML: 392
PDF: 176
XML: 19
Total: 587
BibTeX: 9
EndNote: 9

Views and downloads (calculated since 28 Nov 2022)

Month	HTML	PDF	XML	Total
Nov 2022	48	11	3	62
Dec 2022	87	32	2	121
Jan 2023	46	13	2	61
Feb 2023	30	15	0	45
Mar 2023	42	16	2	60
Apr 2023	23	11	0	34
May 2023	15	4	0	19
Jun 2023	28	12	6	46
Jul 2023	15	7	1	23
Aug 2023	15	13	1	29
Sep 2023	17	28	0	45
Oct 2023	18	11	1	30
Nov 2023	8	3	1	12
Dec 2023	0
Jan 2024	0

Cumulative views and downloads (calculated since 28 Nov 2022)

Month	HTML	PDF	XML	Total
Nov 2022	48	11	3	62
Dec 2022	87	32	2	121
Jan 2023	46	13	2	61
Feb 2023	30	15	0	45
Mar 2023	42	16	2	60
Apr 2023	23	11	0	34
May 2023	15	4	0	19
Jun 2023	28	12	6	46
Jul 2023	15	7	1	23
Aug 2023	15	13	1	29
Sep 2023	17	28	0	45
Oct 2023	18	11	1	30
Nov 2023	8	3	1	12
Dec 2023	0
Jan 2024	0

Viewed (geographical distribution)

Total article views: 559 (including HTML, PDF, and XML) Thereof 559 with geography defined and 0 with unknown origin.

Country	#	Views	%

Cited

Latest update: 14 Jan 2024

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (28458 KB)
Metadata XML

Short summary

We need to understand where faults are to (1) assess their seismic hazard, (2) to explore for natural resources and (3) to store CO2 safely in the subsurface. Currently we still map faults manually using seismic data i.e. acoustic images of the subsurface. Mapping these images is however difficult and time-consuming. Here we show how to use deep learning and network analysis to accelerate and simplify fault mapping.

Complex fault system revealed from 3-D seismic reflection data with deep learning and fault network analysis

Journal article(s) based on this preprint

Interactive discussion

Interactive discussion

Peer review completion

Suggestions for revision or reasons for rejection

Journal article(s) based on this preprint

Viewed

Viewed (geographical distribution)

Cited

2 citations as recorded by crossref.


Total:	0
HTML:	0
PDF:	0
XML:	0