the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0
Abstract. The clean energy transition demands a significant increase in exploration for critical minerals, particularly rare earth elements (REEs), beyond the well-explored surface deposits. Discovery rates have been declining for decades, escalating the need for new exploration methodologies. Deep learning (DL) utilizes multi-layer neural networks to automatically model high-level abstractions of data, extracting information relevant to the target task, thereby positioning itself as a potentially powerful tool for mineral prediction. But the non-linear and highly heterogeneous characteristics of complex exploration data sets paired with sparse, imbalanced training data and a lack of interpretability represent significant challenges. In this study, we have developed DEEP-SEAM v1.0, a novel explainable semi-supervised DL framework for prospectivity mapping of REE mineralisation in Northern Curnamona Province, South Australia. This framework proposes a comprehensive data preprocessing pipeline and introduces a semi-supervised anomaly detection DL model, termed the Deviation Network (DevNet). DevNet leverages a limited number of positive samples alongside a large number of unlabelled samples to effectively establish the mapping between multi-source exploration data and mineralisation probability. The results indicate that prospective mineralisation areas exhibit a strong spatial coupling with known REE deposits, with high-probability mineralisation areas primarily concentrated in faulted regions, felsic granites, and Mesoproterozoic strata. To address concerns about the poor interpretability of DL models, we incorporate a post-hoc model interpretation technique known as the SHapley Additive exPlanations method. The method facilitates an improved understanding of the decision-making mechanisms and logic underlying DevNet. By comparing DEEP-SEAM’s decisions with our understanding of mineral systems, we not only enhance the model’s transparency and interpretability but also strengthen the reliability and credibility of the predicted prospective mineralisation areas.
- Preprint
(6238 KB) - Metadata XML
- BibTeX
- EndNote
Status: open (until 15 Nov 2025)
- RC1: 'Comment on egusphere-2025-3283', Anonymous Referee #1, 10 Oct 2025 reply
-
CEC1: 'Comment on egusphere-2025-3283 - No compliance with the policy of the journal', Juan Antonio Añel, 11 Oct 2025
reply
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.htmlIn your "Code and data availability" section (and in the corresponding methods section in the manuscript) you link sites to access the data that you use in your work, which are not suitable repositories according to our policy (for example the South Australian Resources Information Gateway). Therefore, the current situation with your manuscript is irregular. Please, publish all the data that you have used to produce your manuscript (input and output) in one of the appropriate repositories according to our policy, and reply to this comment with the relevant information (link and a permanent identifier for it (e.g. DOI)) as soon as possible, as we can not accept manuscripts in Discussions that do not comply with our policy.
Also, you must include a modified 'Code and Data Availability' section in a potentially reviewed manuscript, containing the information of the new repositories.
I must note that if you do not fix this problem, we cannot accept your manuscript for publication in our journal.
Juan A. Añel
Geosci. Model Dev. Executive EditorCitation: https://doi.org/10.5194/egusphere-2025-3283-CEC1 -
AC1: 'Reply on CEC1', Zijing Luo, 13 Oct 2025
reply
Dear Editor Añel,
Thank you for your feedback regarding our manuscript's compliance with the Code and Data Policy.
I would like to clarify that all input and output data necessary to reproduce our study have been deposited in Zenodo. The repository can be accessed at: https://doi.org/10.5281/zenodo.17098677
The repository contains:
1. Input_Data_Layers/ - All processed and clipped data layers used as model inputs (derived from the South Australian Resources Information Gateway)
2. Output_Features_Generated/ - All model outputs and results
3. Code/ - Complete code to reproduce the resultsWhile the original raw data were downloaded from the South Australian Resources Information Gateway (as cited in our manuscript), all processed input data specific to our study area (e.g., clipped and projected using ArcGIS), along with all model outputs, are available in the above repository. Users can directly run our code using the provided input data to reproduce our results without needing to access the original data sources.
We acknowledge that our original "Code and Data Availability" section may not have clearly communicated this. We will revise this section in our manuscript to explicitly state the repository information and clarify the relationship between the original data sources and our processed datasets.
Please let us know if any additional information or modifications are needed.
Best regards,
ZijingCitation: https://doi.org/10.5194/egusphere-2025-3283-AC1
-
AC1: 'Reply on CEC1', Zijing Luo, 13 Oct 2025
reply
-
RC2: 'Comment on egusphere-2025-3283', Anonymous Referee #2, 12 Oct 2025
reply
This manuscript describes a generalisable method for using a variety of digital representations of geology to locate mineralisation. The application is to hydrothermal REE deposits in the Curnamona Province, Australia. The method uses deep learning for prediction to locate unknown mineralisation, and Shapley values to explain which input features contribute most to the predciction.
The submission is well-written and structured. The figures are a decent quality.
There are some issues that need rectifying before this work can be published.
Generalisability of the method. The authors can provide some discussion to how DEEP-SEAM can be used for other mineralisation types. This will help the manuscript fit within the scope of the journal.
Data reduction of geochemical assay. RPCA is used, and is a better choice than PCA. However, the results in Figure A1 tell me that RPCA is not that effective for data reduction. Six principal components are required to explain > 70% of the variance. I wonder whether a non-linear method such as UMAP may be a better option. UMAP can act like an auto-encoder.
Abundance of magnetic data derivatives versus other data. The number of magnetic layers used indicates the authors think that REE deposits are best imaged using magnetics, however this is not well-explained in the paper. What is the 2d correlation between these layers – i.e. are the same patterns presented in each, which can add bias?
Further, the justification of the various features with respect to REE mineralisation isn’t well established. Aside from the comment regarding magnetics, an example is using a DEM. More detail is needed on which “geological processes and environmental conditions influence mineralisation” are revealed by DEM. I’m assuming the DEM represents erosion and rock competency, but you may be thinking of others. This is important when justifying feature engineering (L258-259).
Each of the geophysical methods image different depths of the crust. This is partly due to the physics of the method itself, but mostly due to station spacing and line spacing. Wider line spacing means deeper minimum depths of imaging. Please explain this in context with the geophysical survey parameters.
The most interesting part of the study is finding areas on unknown mineralisation. However, there is limited analysis of the newly identified prospective areas. Examples are location ‘X’ (SW of E), the area extending SW of ‘E’ and a blob SE of ‘F’. Why would anyone go to these locations to acquire a tenement? Some geological analysis is needed.
The discussion needs to explore geological reasons how shap values align with geological understanding. Explore the mechanisms – what geological features contributing to mineralisation are magnetic? Why are others radioactive (e.g. lin469 to 472)? Are there any that are both magnetic and radioactive?
The results section opens with statements describing the interpretation of results for the results have been presented. E.g. “This study efficiently integrates multi-source exploration data and transforms them into a mineral prospectivity map utilising a semi-supervised DL model. Additionally, it provides interpretability for understanding the model’s prediction process”
and
“The DL-based mineral prospectivity model, DevNet, trained with optimised parameter configurations, effectively captures the complex mapping relationships between multidimensional features and mineralisation probability. The mineralisation probability is obtained by normalizing the anomaly scores output (the original value of the model output) by DevNet.”
The readers need to make up their minds by viewing the results first before the authors can make these statements (which are really discussion items).
I have minor comments listed below.
Intro
Not sure about anomaly detection being relevant to MPM. AD looks at a univariate signal, while MPM is a data fusion exercise. Anomalous prospectivity values - Just needs a clearer explanantion
Do occurrences, prospects and deposits have the same weight within your model?
Negative samples are drawn > 5km from known deposits. Establish that this is reasonable (all deposits are known? Then why do prospectivity analysis?)
L 439 Success rate high accuracy – against what benchmarks/other studies?
You use features which image the subsurface (magnetics and gravity) and those limited to the surface (DEM, ASTER and, at the scale of this study, radiometrics). It would be interesting to look at the SHAP values and see whether surface versus deeper signatures are more useful.
Figure 10. Std based features have border effects. Does this impact the model training and degrade results? Can these border effects be removed?
Citation: https://doi.org/10.5194/egusphere-2025-3283-RC2
Data sets
An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0 Zijing Luo, Ehsan Farahbakhsh, Stephen Hore, R. Dietmar Müller https://doi.org/10.5281/zenodo.17098677
Model code and software
An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0 Zijing Luo, Ehsan Farahbakhsh, Stephen Hore, R. Dietmar Müller https://doi.org/10.5281/zenodo.17098677
Interactive computing environment
An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0 Zijing Luo, Ehsan Farahbakhsh, Stephen Hore, R. Dietmar Müller https://doi.org/10.5281/zenodo.17098677
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 1,049 | 45 | 16 | 1,110 | 6 | 4 |
- HTML: 1,049
- PDF: 45
- XML: 16
- Total: 1,110
- BibTeX: 6
- EndNote: 4
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
The manuscript by Luo et at al. entitled “An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0” (egusphere-2025-3283) describes a novel explainable semi-supervised deep learning framework for prospectivity mapping (MPM) of REE mineralisation in Northern Curnamona Province, South Australia.
The newly developed DEEP-SEAM v1.0 tool and associated case study should be of interest to readers of EGUsphere and the broader community of (mathematical) geoscientists but the manuscript has some shortcomings that I believe will require major revisions before it can be considered for publication.
My main issues are as follows:
Introduction
Geological Setting and REE Mineralisation
Materials and Methods
Results and Discussion
Additional minor comments are provided in the attached PDF.