An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0

Luo, Zijing; Farahbakhsh, Ehsan; Hore, Stephen; Müller, R. Dietmar

doi:10.5194/egusphere-2025-3283

Preprints

https://doi.org/10.5194/egusphere-2025-3283

Preprints

15 Sep 2025

| 15 Sep 2025

An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0

Zijing Luo, Ehsan Farahbakhsh, Stephen Hore, and R. Dietmar Müller

Abstract. The clean energy transition demands a significant increase in exploration for critical minerals, particularly rare earth elements (REEs), beyond the well-explored surface deposits. Discovery rates have been declining for decades, escalating the need for new exploration methodologies. Deep learning (DL) utilizes multi-layer neural networks to automatically model high-level abstractions of data, extracting information relevant to the target task, thereby positioning itself as a potentially powerful tool for mineral prediction. But the non-linear and highly heterogeneous characteristics of complex exploration data sets paired with sparse, imbalanced training data and a lack of interpretability represent significant challenges. In this study, we have developed DEEP-SEAM v1.0, a novel explainable semi-supervised DL framework for prospectivity mapping of REE mineralisation in Northern Curnamona Province, South Australia. This framework proposes a comprehensive data preprocessing pipeline and introduces a semi-supervised anomaly detection DL model, termed the Deviation Network (DevNet). DevNet leverages a limited number of positive samples alongside a large number of unlabelled samples to effectively establish the mapping between multi-source exploration data and mineralisation probability. The results indicate that prospective mineralisation areas exhibit a strong spatial coupling with known REE deposits, with high-probability mineralisation areas primarily concentrated in faulted regions, felsic granites, and Mesoproterozoic strata. To address concerns about the poor interpretability of DL models, we incorporate a post-hoc model interpretation technique known as the SHapley Additive exPlanations method. The method facilitates an improved understanding of the decision-making mechanisms and logic underlying DevNet. By comparing DEEP-SEAM’s decisions with our understanding of mineral systems, we not only enhance the model’s transparency and interpretability but also strengthen the reliability and credibility of the predicted prospective mineralisation areas.

Received: 09 Jul 2025 – Discussion started: 15 Sep 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Zijing Luo, Ehsan Farahbakhsh, Stephen Hore, and R. Dietmar Müller

Status: final response (author comments only)

RC1:
'Comment on egusphere-2025-3283', Anonymous Referee #1, 10 Oct 2025
The manuscript by Luo et at al. entitled “An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0” (egusphere-2025-3283) describes a novel explainable semi-supervised deep learning framework for prospectivity mapping (MPM) of REE mineralisation in Northern Curnamona Province, South Australia.
The newly developed DEEP-SEAM v1.0 tool and associated case study should be of interest to readers of EGUsphere and the broader community of (mathematical) geoscientists but the manuscript has some shortcomings that I believe will require major revisions before it can be considered for publication.
My main issues are as follows:
Introduction
The Introduction requires significant shortening by at least 33%. In its current format, it is too ‘wordy’ and presents a somewhat cumbersome read in that it contains too much technical detail, which would be better suited to the Materials and Methods section.

Geological Setting and REE Mineralisation
In my opinion, this section is the weakest part of your manuscript and one that requires significant strengthening. I understand that the focus of your study is on technology development and demonstrating the capabilities of the new newly developed DEEP-SEAM v1.0 tool. However, presenting a stronger more comprehensive targeting model would help to better position your manuscript and tool as well as generate greater impact and interest.

I strongly recommend the following: (1) Restructure and expand your Table 1 to include additional information relating to host rocks (types and ages), host structures and ore controls, mineralisation styles, ore and gangue mineralogy, alteration types and assemblages, mineralisation ages, grade/tonnage information, etc. (2) The restructuring and expansion of Table 1 will go a long way to better demonstrate that the REE and U(-REE) and Cu(-REE) occurrences your work is based on form part of a coherent group of mineral deposit (i.e., part of a single mineral system). As a reader who is unfamiliar with these occurrences your manuscript does little to convince me of their similar origin. This is a critical issue as mixing mineral systems in MPM may give you garbage results. (3) Combine or link the improved Table 1 with Table 2 in a way that you clearly demonstrate which data map which expressions of the targeted mineral system. (4) The study area is relatively small, in particular for an MPM study. Why did you not include the entire Mt Babbage and Mt Painter inliers, or at least the parts that are well exposed? That's not clear to me. Superimposing artificial, man-made boundaries on mineral systems should be avoided at all costs. At the very least, a clear explanation is required here as to the design of and rationale for the study area and its boundaries. Ideally, you should expand the study area to include the entire prospective tract. This should be easily possible given the great open-source data provided by the geological surveys of South Australia and New South Wales.

Materials and Methods
Here you briefly refer to a “conceptual mineralisation model”, which, however, is nowhere presented in the manuscript. This point links back to my above comments regarding the need for including a comprehensive targeting model that links mineralising processes to mappable ingredients of the REE-U-Cu system and the datasets available for mapping the system footprints. See paper by McCuaig et al. (2010) for more information (https://www.sciencedirect.com/science/article/pii/S0169136810000612)

How did you determine buffer zone sizes?

Results and Discussion
Parts of the Discussion (highlighted in the attached commented PDF) deal with geology and methodology. These parts should be presented in the relevant Geology and Methods sections.

Edge effects in Figure 10 (p. 29) must be discussed.

Additional minor comments are provided in the attached PDF.
Citation: https://doi.org/10.5194/egusphere-2025-3283-RC1
- AC3: 'Reply on RC1', Zijing Luo, 16 Dec 2025
  
  Comment 1.1 — The manuscript by Luo et at al. entitled “An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0” (egusphere-2025-3283) describes a novel explainable semi-supervised deep learning framework for prospectivity mapping (MPM) of REE mineralisation in Northern Curnamona Province, South Australia. The newly developed DEEP-SEAM v1.0 tool and associated case study should be of interest to readers of EGUsphere and the broader community of (mathematical) geoscientists but the manuscript has some shortcomings that I believe will require major revisions before it can be considered for publication.
  Response: Thank you for your thoughtful feedback on our manuscript. We greatly appreciate your comprehensive review and the helpful suggestions you provided in the attached PDF. We have carefully addressed each of your comments and made corresponding revisions to enhance the quality of our paper. Note: As advised not to upload the revised manuscript during the response process, please refer to the attached document for more detailed information regarding the specific revisions made in response to your comments.
  Comment 1.2 — The Introduction requires significant shortening by at least 33%. In its current format, it is too ‘wordy’ and presents a somewhat cumbersome read in that it contains too much technical detail, which would be better suited to the Materials and Methods section.
  Response: Thank you for pointing this out. We agree and have relocated the technical details to the Materials and Methods section, making the Introduction more concise.
  Comment 1.3 — In my opinion, this section ‘Geological Setting and REE Mineralisation’ is the weakest part of your manuscript and one that requires significant strengthening. I understand that the focus of your study is on technology development and demonstrating the capabilities of the new newly developed DEEP-SEAM v1.0 tool. However, presenting a stronger more comprehensive targeting model would help to better position your manuscript and tool as well as generate greater impact and interest.
  Response: We have substantially revised and expanded this section by incorporating additional details on the regional geological context. We added Lines 144-168: “ Many of the Mount Painter region granites are extremely enriched in heat producing elements (U, Th, K), making this region notable for the significant heat generated by radiogenic decay of these elements. These High Heat Producing Granites (HHPG) serve as heat sources for hydrothermal circulation and have initiated convective sub-surface fluid migration, establishing a region of high geothermal gradient metamorphism (Kovacs, 2005; Neumann et al., 2000; Sandiford et al., 2002). The region has experienced a prolonged history of granitic intrusions and consequential localized radiogenic heating, spanning from the early Mesoproterozoic through the Delamerian Orogeny to the Late Palaeozoic. The Mesoproterozoic appears to have been particularly favourable for REE mineralisation, primarily attributed to: the generation of REE-enriched magmas and relatively stable tectonic environments that enhanced the preservation potential of formed deposits. The close spatial association and genetic relationship between Mesoproterozoic stratigraphic units and known mineral occurrences further substantiate the critical significance of this geological period for regional REE mineralisation. This extended geological history has led to multiple phases and pulses of magmatic and hydrothermal activity. Mineralisation processes have been driven by two primary mechanisms: (1) initial stages involving the introduction of felsic magmatism, including direct fluid activity and heat originating from magmatic processes, and (2) subsequent stages driven by ongoing radiogenic heat generation from the uranium- and thorium-rich granites and the associated mobilisation of hydrothermal fluids (Hoatson et al., 2011). Throughout these processes, structural controls played a crucial role in ore formation, with fracture systems serving as primary conduits for mineralising fluids and magmas. These coupled magmatic and radiogenic heating processes have initiated extensive sodic, potassic, and chloritic alteration of the granites, locally transforming initial lithologies into gneisses and schists. The sustained hydrothermal activity has facilitated the formation of diverse mineral deposit types, including hydrothermal vein-type, breccia-hosted, and skarn-type deposits containing Au, Cu, U, Sn, and REEs, with REE mineralisation being particularly associated with both the initial felsic magmatic events and the long-term radiogenic heating effects of the HHPG.”
  Comment 1.4 — Restructure and expand your Table 1 to include additional information relating to host rocks (types and ages), host structures and ore controls, mineralisation styles, ore and gangue mineralogy, alteration types and assemblages, mineralisation ages, grade/tonnage information, etc.
  Response：We have prepared an expanded Table 1 (please refer to the attached table) that incorporates the requested information.
  Comment 1.5 — The restructuring and expansion of Table 1 will go a long way to better demonstrate that the REE and U(-REE) and Cu(-REE) occurrences your work is based on form part of a coherent group of mineral deposit (i.e., part of a single mineral system). As a reader who is unfamiliar with these occurrences your manuscript does little to convince me of their similar origin. This is a critical issue as mixing mineral systems in MPM may give you garbage results.
  Response： We appreciate this important comment and have modified Table 1 as requested. However, we would like to respectfully clarify a fundamental aspect of our study design that addresses the reviewer's concern about mixing mineral systems. We do not propose that all REE, U(-REE), and Cu(-REE) occurrences originated from a single, unified mineral system. Rather, the study area represents a geologically complex terrain characterized by: (1) diverse sedimentary lithologies and granitic rocks; (2) multiple episodes of radiogenic heat-driven fluid circulation; (3) varying degrees of metamorphism and structural deformation; and (4) complex fluid migration and recycling processes. Given this geological complexity, REE mineralization events cannot be attributed to a single metallogenic episode at a specific time. Importantly, our proposed MPM framework—particularly the deviation network method—is specifically designed to accommodate such multi-stage, multi-source metallogenic settings. Unlike traditional approaches that may indeed produce "garbage results" when different mineral systems are mixed, the deviation network method can effectively identify and discriminate different types of anomalous behaviors under partial constraints from known anomaly information (as explained in the methodology section). This approach does not conflate different mineral systems; rather, it systematically decouples and recognizes the related yet distinct signatures (remote sensing, geochemical, and geophysical) associated with multi-stage, multi-source mineralization processes. The delineation results of our prospective mineralization areas and their corresponding interpretations demonstrate the efficacy and validity of the proposed framework in handling geologically complex, multi-system terrains.
  Regarding the data foundation: most of these 7 mineralization occurrences were targets obtained during mineral exploration in the 1970s. We have made considerable efforts to extract all available information from the limited historical exploration reports to provide the best foundation for understanding the relationship between mineralization characteristics and our input datasets. We acknowledge that such descriptions are not yet fully comprehensive, and we have expanded Table 1 to better present the available information. Further characterization of these occurrences remains an ongoing priority for future work.
  Comment 1.6 — Combine or link the improved Table 1 with Table 2 in a way that you clearly demonstrate which data map which expressions of the targeted mineral system.
  Response：Table 2 is discussed with additional evidence demonstrating how our adopted datasets effectively support mineral prospectivity mapping. We added (1) Lines 201-205 Magnetic responses to REE mineralisation can be highly variable and depend on the nature of host rocks. White (2005) interprets the Yerila Granite (Table 1) in the study area not as a true granite (although parts of the protolith may have been deformed granite) but as a metasomatic allanite-rich rock, where the highest REE enrichment (Th, U, Zr, Y, and REEs) occurs in calcsilicate rocks that exhibit relatively distinct magnetic signatures compared to surrounding lithologies. (2) Lines 229-236：A compelling example of this relationship is demonstrated at REE Location D (Fig. 1), which represents a Cu-U-REE mineral system associated with hydrothermal fluid flow during the Ordovician (Zivak, 2024). The mineralisation is hosted within multiply deformed, pyrite-rich metasedimentary and metavolcanic rocks, where REEs and uranium are highly concentrated, predominantly in monazite and, to a lesser extent, in apatite (which locally contains up to 11% total REEs). Critically, both the radioactive elements and REEs were co-transported by hydrothermal fluids sourced from granitoid rocks along the eastern extremity of the Paralana Fault (Marshall, 1979). This genetic link between U-Th-K and REE enrichment provides a robust basis for using radiometric data as a proxy for REE prospectivity.. (3) Lines 247-257：Remote sensing is an important method for mineral prospecting, utilising hyperspectral and multispectral data to detect hydrothermally altered minerals with diagnostic spectral absorption characteristics (Pour and Hashim, 2011). Metasomatism plays a crucial geochemical role in the concentration of REEs, U, and Th, often producing diagnostic mineralogical assemblages detectable through remote sensing (Khoshnoodi et al., 2016). For example, Location A (Fig. 1), defined by the Yerila Granite, demonstrates intense metasomatic enrichment that can be captured by remote sensing data. The Yerila Granite (~1560 Ma, Moolawatana Suite) is geochemically unique, with extremely high REE contents and exceptional radioactivity, making it possibly the most radioactive granite in Australia (White, 2005; Sheard, 2009). This enrichment is attributed to metasomatic processes that produced diagnostic REE-bearing minerals including allanite, monazite, yttrotitanite, and zircon, accompanied by K-feldspar alteration and fluorite mineralisation (White, 2005). These metasomatic assemblages and associated alteration minerals exhibit distinct spectral signatures in the visible-near infrared (VNIR) and shortwave infrared (SWIR) regions. (4) Lines 265- 274： For example, the high immediately to the west of REE Location G (Fig. 1), which is centred on Radium Ridge, demonstrates a clear relationship between topography and mineralisation. Radium Ridge is a prominent topographic feature extending approximately 3 km in an east-west direction, characterised by steep slopes on its southern flank and gentler gradients on the northern side (Sullivan et al., 1945). The steepest sections of the ridge result from differential weathering resistance of a silicified zone, which hosts small hydrothermal Fe-rich U and REE prospects along the ridge crest (Sullivan et al., 1945). This exemplifies how DEM data can capture the topographic expressions that result from structurally-controlled mineralisation and lithological contrasts caused by differential weathering and silicification processes, thereby serving as an indirect indicator of potential mineralised zones.
  Comment 1.7 — The study area is relatively small, in particular for an MPM study. Why did you not include the entire Mt Babbage and Mt Painter inliers, or at least the parts that are well exposed? That's not clear to me. Superimposing artificial, man-made boundaries on mineral systems should be avoided at all costs. At the very least, a clear explanation is required here as to the design of and rationale for the study area and its boundaries. Ideally, you should expand the study area to include the entire prospective tract. This should be easily possible given the great open-source data provided by the geological surveys of South Australia and New South Wales.
  Response: Thank you for your thoughtful comment regarding the scope of the study area. We acknowledge that the current study area is relatively limited in size, particularly for an MPM study. This was a deliberate design choice, driven primarily by the availability and quality of geochemical data, which are critical for accurately modeling REE mineralisation indicators. The constraint stems from the uneven distribution and sparsity of geochemical sampling points across the broader Mt Painter and Mt Babbage inliers. While open-source datasets offer excellent coverage for certain data types (e.g., geological maps and geophysical surveys), the geochemical datasets—essential for our DEEP-SEAM v1.0 tool's predictive modeling—present significant challenges. For instance:
  Stream sediment and soil geochemical data provide limited useful elements relevant to REE prospectivity.
  Rock geochemical datasets, while more comprehensive, suffer from partial element measurement gaps and highly irregular spatial distribution, with large tracts (please refer to the attached fig.1) lacking any sampling points altogether.
  To mitigate these issues without introducing undue bias or inflating prediction uncertainties, we implemented a geospatial interpolation approach: for each grid point, we extracted geochemical samples within a 1.5 km radius, computed a weighted average using the inverse square of the distance as weights, and substituted the median elemental value where no nearby samples were available. This method allowed us to generate reliable geochemical layers for the core study area. However, extending this approach to the entire inliers would have required extrapolating over vast unsampled regions, potentially leading to artifacts, increased model uncertainty, and reduced interpretability of the results. By confining the study area to regions with sufficient data density, we aimed to avoid superimposing unreliable predictions on the broader mineral system and to ensure the validity of our targeting model.
  Comment 1.8 — Here you briefly refer to a “conceptual mineralisation model”, which, however, is nowhere presented in the manuscript. This point links back to my above comments regarding the need for including a comprehensive targeting model that links mineralising processes to mappable ingredients of the REE-U-Cu system and the datasets available for mapping the system footprints. See paper by McCuaig et al. (2010) for more information (https://www.sciencedirect.com/science/article/pii/S0169136810000612)
  Response: In order to establish clear connections between mineralising processes → mineral system ingredients → mappable signatures → datasets, we have enhanced the "Geological Setting and REE Mineralisation" section with additional content, restructured Table 1, and strengthened the mineralisation-related justification for the datasets presented in Table 2.
  Comment 1.9 — How did you determine buffer zone sizes?
  Response: Thank you for your questions about buffer zone determination and the negative sampling issues you raised in the PDF. The buffer zone sizes in our study were determined through systematic optimization involving iterative model training and performance evaluation. We tested multiple buffer sizes for each application and selected optimal values based on model performance metrics. This approach ensures both reproducibility and optimal model performance. As noted in your PDF comments, we deliberately selected negative samples from regions distal to known mineralized zones (typically >5 km away) to enhance model discrimination. This strategy follows established practice, as demonstrated by Zuo and Carranza (2011), who showed that proximal locations often exhibit similar multivariate spatial signatures to deposit sites, potentially creating biased training data and reducing model sensitivity. Our approach also follows the "near-mine exploration" principle, which recognizes that mineralization probability increases near known occurrences, thus justifying the exclusion of ambiguous near-deposit areas from negative sample pools.
  Comment 1.10 — Parts of the Discussion (highlighted in the attached commented PDF) deal with geology and methodology. These parts should be presented in the relevant Geology and Methods sections.
  Response: Revised. These contents were moved to the appropriate positions.
  Comment 1.11 — Edge effects in Figure 10 (p. 29) must be discussed.
  Response: We have conducted a comprehensive examination of the spatial statistical feature extraction process within buffer zones. While most input layers demonstrate normal statistical behavior, we acknowledge that a small number of layers exhibit pronounced standard deviation anomalies at the study area boundaries, manifesting as edge effects. This phenomenon primarily stems from statistical bias introduced by buffer truncation. We discuss this issue from three perspectives: data characteristics, model mechanisms, and prediction outcomes. From a data characteristics perspective, evidence layers exhibiting high-frequency spatial variations experience significant statistical bias when truncated at buffer edges due to insufficient sample points, resulting in amplified standard deviations (for instance, the AlOH_Group_Composition_Standard_Deviation layer shown in Fig. 10 in manuscript). Conversely, evidence layers with regional distribution characteristics maintain statistical consistency at boundaries, as their gradual variations are less susceptible to local truncation effects (such as the AlOH_Group_Content_Standard_Deviation layer illustrated in the attached Fig. 2b). This disparity represents a classical manifestation of "edge effects" in spatial statistics, constrained by study area boundary conditions and fixed buffer radius settings. However, this issue is confined to a limited number of high-variability layers rather than being a systematic problem. Regarding model mechanisms, our framework employs the DevNet model based on multi-source exploration data for mineralization anomaly extraction. The model's core strengths lie in its multi-layer feature fusion strategy and anomaly learning mechanism, which enhance resistance to local noise including edge effects. DevNet establishes a robust "normality baseline" by learning the distribution patterns of extensive unlabeled samples and their differences from limited known mineralization samples in latent space. During training, the model is guided by prior mineralization information to learn anomaly patterns associated with mineralization rather than focusing on local anomalies and noise at boundaries, thereby effectively diluting the influence of edge effects in input data. In terms of prediction outcomes, both the prediction results and model interpretability analysis demonstrate that despite the significant presence of edge effects in certain evidence layers (such as AlOH_Group_Composition_Standard_Deviation) that play important roles in the model, the final mineral prospectivity map shows no apparent boundary artifacts such as edge-concentrated high values or anomalous gradients. This indirectly confirms that edge effects are progressively attenuated through multi-source data fusion and model learning processes, without compromising overall prediction accuracy and reliability. In summary, while edge effects objectively exist in a minority of input layers, their impact on final mineralization anomaly extraction reliability remains minimal due to the selective influence of data characteristics, DevNet's noise resistance mechanisms, and the stability of prediction results. We will consider extending buffer zones or implementing boundary correction algorithms for further optimization in future work.
  Comment 1.12 — Additional minor comments are provided in the attached PDF.
  Response: Thank you for your detailed review and the additional comments in the attached PDF. We have carefully addressed all suggested revisions in the revised manuscript. Regarding the additional questions outlined in the attachment.
  
  Citation: https://doi.org/10.5194/egusphere-2025-3283-AC3
CEC1:
'Comment on egusphere-2025-3283 - No compliance with the policy of the journal', Juan Antonio Añel, 11 Oct 2025

Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".

https://www.geoscientific-model-development.net/policies/code_and_data_policy.html
In your "Code and data availability" section (and in the corresponding methods section in the manuscript) you link sites to access the data that you use in your work, which are not suitable repositories according to our policy (for example the South Australian Resources Information Gateway). Therefore, the current situation with your manuscript is irregular. Please, publish all the data that you have used to produce your manuscript (input and output) in one of the appropriate repositories according to our policy, and reply to this comment with the relevant information (link and a permanent identifier for it (e.g. DOI)) as soon as possible, as we can not accept manuscripts in Discussions that do not comply with our policy.
Also, you must include a modified 'Code and Data Availability' section in a potentially reviewed manuscript, containing the information of the new repositories.
I must note that if you do not fix this problem, we cannot accept your manuscript for publication in our journal.
Juan A. Añel

Geosci. Model Dev. Executive Editor

Citation: https://doi.org/10.5194/egusphere-2025-3283-CEC1
- AC1: 'Reply on CEC1', Zijing Luo, 13 Oct 2025
  
  Dear Editor Añel,
  Thank you for your feedback regarding our manuscript's compliance with the Code and Data Policy.
  I would like to clarify that all input and output data necessary to reproduce our study have been deposited in Zenodo. The repository can be accessed at: https://doi.org/10.5281/zenodo.17098677
  
  The repository contains:
  
  1. Input_Data_Layers/ - All processed and clipped data layers used as model inputs (derived from the South Australian Resources Information Gateway)
  
  2. Output_Features_Generated/ - All model outputs and results
  
  3. Code/ - Complete code to reproduce the results
  While the original raw data were downloaded from the South Australian Resources Information Gateway (as cited in our manuscript), all processed input data specific to our study area (e.g., clipped and projected using ArcGIS), along with all model outputs, are available in the above repository. Users can directly run our code using the provided input data to reproduce our results without needing to access the original data sources.
  We acknowledge that our original "Code and Data Availability" section may not have clearly communicated this. We will revise this section in our manuscript to explicitly state the repository information and clarify the relationship between the original data sources and our processed datasets.
  Please let us know if any additional information or modifications are needed.
  Best regards,
  
  Zijing
  
  Citation: https://doi.org/10.5194/egusphere-2025-3283-AC1
RC2:
'Comment on egusphere-2025-3283', Anonymous Referee #2, 12 Oct 2025

This manuscript describes a generalisable method for using a variety of digital representations of geology to locate mineralisation. The application is to hydrothermal REE deposits in the Curnamona Province, Australia. The method uses deep learning for prediction to locate unknown mineralisation, and Shapley values to explain which input features contribute most to the predciction.
The submission is well-written and structured. The figures are a decent quality.
There are some issues that need rectifying before this work can be published.
Generalisability of the method. The authors can provide some discussion to how DEEP-SEAM can be used for other mineralisation types. This will help the manuscript fit within the scope of the journal.
Data reduction of geochemical assay. RPCA is used, and is a better choice than PCA. However, the results in Figure A1 tell me that RPCA is not that effective for data reduction. Six principal components are required to explain > 70% of the variance. I wonder whether a non-linear method such as UMAP may be a better option. UMAP can act like an auto-encoder.
Abundance of magnetic data derivatives versus other data. The number of magnetic layers used indicates the authors think that REE deposits are best imaged using magnetics, however this is not well-explained in the paper. What is the 2d correlation between these layers – i.e. are the same patterns presented in each, which can add bias?
Further, the justification of the various features with respect to REE mineralisation isn’t well established. Aside from the comment regarding magnetics, an example is using a DEM. More detail is needed on which “geological processes and environmental conditions influence mineralisation” are revealed by DEM. I’m assuming the DEM represents erosion and rock competency, but you may be thinking of others. This is important when justifying feature engineering (L258-259).
Each of the geophysical methods image different depths of the crust. This is partly due to the physics of the method itself, but mostly due to station spacing and line spacing. Wider line spacing means deeper minimum depths of imaging. Please explain this in context with the geophysical survey parameters.
The most interesting part of the study is finding areas on unknown mineralisation. However, there is limited analysis of the newly identified prospective areas. Examples are location ‘X’ (SW of E), the area extending SW of ‘E’ and a blob SE of ‘F’. Why would anyone go to these locations to acquire a tenement? Some geological analysis is needed.
The discussion needs to explore geological reasons how shap values align with geological understanding. Explore the mechanisms – what geological features contributing to mineralisation are magnetic? Why are others radioactive (e.g. lin469 to 472)? Are there any that are both magnetic and radioactive?
The results section opens with statements describing the interpretation of results for the results have been presented. E.g. “This study efficiently integrates multi-source exploration data and transforms them into a mineral prospectivity map utilising a semi-supervised DL model. Additionally, it provides interpretability for understanding the model’s prediction process”
and
“The DL-based mineral prospectivity model, DevNet, trained with optimised parameter configurations, effectively captures the complex mapping relationships between multidimensional features and mineralisation probability. The mineralisation probability is obtained by normalizing the anomaly scores output (the original value of the model output) by DevNet.”
The readers need to make up their minds by viewing the results first before the authors can make these statements (which are really discussion items).

I have minor comments listed below.
Intro
Not sure about anomaly detection being relevant to MPM. AD looks at a univariate signal, while MPM is a data fusion exercise. Anomalous prospectivity values - Just needs a clearer explanantion
Do occurrences, prospects and deposits have the same weight within your model?
Negative samples are drawn > 5km from known deposits. Establish that this is reasonable (all deposits are known? Then why do prospectivity analysis?)
L 439 Success rate high accuracy – against what benchmarks/other studies?
You use features which image the subsurface (magnetics and gravity) and those limited to the surface (DEM, ASTER and, at the scale of this study, radiometrics). It would be interesting to look at the SHAP values and see whether surface versus deeper signatures are more useful.
Figure 10. Std based features have border effects. Does this impact the model training and degrade results? Can these border effects be removed?

Citation: https://doi.org/10.5194/egusphere-2025-3283-RC2
- AC2: 'Reply on RC2', Zijing Luo, 16 Dec 2025
  
  Comment 2.1 — This manuscript describes a generalisable method for using a variety of digital representations of geology to locate mineralisation. The application is to hydrothermal REE deposits in the Curnamona Province, Australia. The method uses deep learning for prediction to locate unknown mineralisation, and Shapley values to explain which input features contribute most to the predciction. The submission is well-written and structured. The figures are a decent quality. There are some issues that need rectifying before this work can be published.
  Response: Thank you for your positive assessment of our manuscript. We appreciate your recognition of our novel approach and are grateful for your detailed comments. We have addressed each of them carefully in our revision to further improve the quality of our paper. Note: As advised not to upload the revised manuscript during the response process, please refer to the attached document for more detailed information regarding the specific revisions made in response to your comments.
  Comment 2.2 — Generalisability of the method. The authors can provide some discussion to how DEEP-SEAM can be used for other mineralisation types. This will help the manuscript fit within the scope of the journal.
  Response: We agree that discussing how DEEP-SEAM can be applied to other mineralisation systems would strengthen the manuscript. We have added a paragraph at the end of the Discussion and Conclusion section (Lines 651-670) that addresses the transferability of the DEEP-SEAM framework to different deposit types. Line649-668: The DEEP-SEAM framework is designed with potential generalisability and scalability, and its methodological principles are applicable to MPM across diverse mineralisation systems. The framework's ability to capture complex, non-linear relationships between multi-source geoscientific data and mineralisation probability establishes a theoretical foundation for its application to various deposit types. The potential transferability of DEEP-SEAM is built upon three key principles. First, the framework requires integration of deposit-specific evidence layers that reflect critical mineralisation processes, including geological proxies (e.g., structural features, lithological units, alteration zones), geochemical pathfinder elements, geophysical signatures, and remote sensing derivatives. The specific indicators employed vary with deposit type. For instance, orogenic Au systems commonly display Au-As-Sb associations; porphyry Cu deposits exhibit Cu-Mo-Au signatures; iron oxide-copper-gold (IOCG) systems are characterised by magnetic and gravity anomalies, whilst volcanic-associated massive sulphide (VMS) deposits display resistivity contrasts. The framework accommodates these variations through appropriate feature engineering and predictor variable selection. Second, the DL architecture within DEEP-SEAM can be adapted to different datasets through hyperparameter optimisation and structural adjustments, enabling the model to learn deposit-specific spatial patterns and feature interactions. This adaptive tuning process - involving adjustment of parameters such as network depth, learning rate, and dropout rate based on validation performance - represents standard practice in geoscientific machine learning applications where geological heterogeneity demands dataset-specific optimisation. Third, model interpretability through SHAP analysis provides a critical validation mechanism. By examining whether the model's predictive logic aligns with established metallogenic theory and the geological context of the target area, geoscientists can assess the reliability of prospectivity delineation. This interpretability component ensures that the framework operates not as a black box but as a geologically informed predictive tool applicable across mineralisation systems.
  Comment 2.3 — Data reduction of geochemical assay. RPCA is used, and is a better choice than PCA. However, the results in Figure A1 tell me that RPCA is not that effective for data reduction. Six principal components are required to explain > 70% of the variance. I wonder whether a non-linear method such as UMAP may be a better option. UMAP can act like an auto-encoder.
  Response: Thanks for suggesting non-linear methods. We conducted additional experiments by applying UMAP to the ILR-transformed geochemical data (Please see the attached Fig. 1). Indeed, UMAP is capable of capturing complex nonlinear patterns and has been successfully used for exploratory visualization in various fields. However, the main objective of this study is not only dimensionality reduction but also the development of an interpretable workflow for MPM. RPCA provides linear and robust principal components that can be directly linked to geochemical element loadings, enabling the identification of key controlling factors in geochemical variability and mineralization processes. In contrast, UMAP functions as a nonlinear autoencoder-like embedding, in which each dimension is not explicitly associated with the original geochemical variables, thereby limiting geological interpretability. Therefore, RPCA was retained in this study for its interpretability and robustness, even though it requires more components to capture a high cumulative variance.
  Comment 2.4 — Abundance of magnetic data derivatives versus other data. The number of magnetic layers used indicates the authors think that REE deposits are best imaged using magnetics, however this is not well-explained in the paper. What is the 2d correlation between these layers – i.e. are the same patterns presented in each, which can add bias?
  Response: To fully obtain data related to mineralization, we conducted a thorough search on the website for data that could provide such information, which included so much aeromagnetic data. Aeromagnetic surveying uses highly sensitive magnetometers carried by aircraft to measure the Earth's magnetic field on the surface, which can effectively detect magnetic minerals underground, including REE deposits rich in magnetic minerals. In our study area, the REE mineralization is genetically associated with igneous intrusions and hydrothermal alteration zones, both of which exhibit distinct magnetic signatures. Different magnetic derivatives (e.g., reduced-to-pole, vertical derivative, tilt angle) can highlight different aspects of these geological structures, such as boundary edges, depth information, and subtle anomalies, making them complementary for capturing mineralization-related features. We share the reviewer's concern about potential 2D correlation between magnetic data layers, which could introduce bias into the model. To address this issue, we implemented a systematic correlation analysis during the data preprocessing stage. The Spearman correlation method (Hauke and Kossowski, 2011) was used to quantify relationships between all feature layers. We tested multiple correlation thresholds and evaluated their impact on model performance, ultimately determining 0.65 as the optimal threshold (Lines 259-261). Features exceeding this correlation threshold were removed to minimize redundancy while preserving the diverse information content necessary for robust prediction. This approach ensures that the retained magnetic layers contribute complementary information rather than duplicating the same patterns, thereby reducing potential bias in the model.
  Comment 2.5 — justification of the various features with respect to REE mineralisation isn’t well established. Aside from the comment regarding magnetics, an example is using a DEM. More detail is needed on which “geological processes and environmental conditions influence mineralisation” are revealed by DEM. I’m assuming the DEM represents erosion and rock competency, but you may be thinking of others. This is important when justifying feature engineering (L258-259).
  Response: Table 2 is discussed with additional evidence demonstrating how our adopted datasets effectively support mineral prospectivity mapping. We added (1) Lines 201-205 Magnetic responses to REE mineralisation can be highly variable and depend on the nature of host rocks. White (2005) interprets the Yerila Granite (Table 1) in the study area not as a true granite (although parts of the protolith may have been deformed granite) but as a metasomatic allanite-rich rock, where the highest REE enrichment (Th, U, Zr, Y, and REEs) occurs in calcsilicate rocks that exhibit relatively distinct magnetic signatures compared to surrounding lithologies. (2) Lines 229-236：A compelling example of this relationship is demonstrated at REE Location D (Fig. 1), which represents a Cu-U-REE mineral system associated with hydrothermal fluid flow during the Ordovician (Zivak, 2024). The mineralisation is hosted within multiply deformed, pyrite-rich metasedimentary and metavolcanic rocks, where REEs and uranium are highly concentrated, predominantly in monazite and, to a lesser extent, in apatite (which locally contains up to 11% total REEs). Critically, both the radioactive elements and REEs were co-transported by hydrothermal fluids sourced from granitoid rocks along the eastern extremity of the Paralana Fault (Marshall, 1979). This genetic link between U-Th-K and REE enrichment provides a robust basis for using radiometric data as a proxy for REE prospectivity.. (3) Lines 247-257：Remote sensing is an important method for mineral prospecting, utilising hyperspectral and multispectral data to detect hydrothermally altered minerals with diagnostic spectral absorption characteristics (Pour and Hashim, 2011). Metasomatism plays a crucial geochemical role in the concentration of REEs, U, and Th, often producing diagnostic mineralogical assemblages detectable through remote sensing (Khoshnoodi et al., 2016). For example, Location A (Fig. 1), defined by the Yerila Granite, demonstrates intense metasomatic enrichment that can be captured by remote sensing data. The Yerila Granite (~1560 Ma, Moolawatana Suite) is geochemically unique, with extremely high REE contents and exceptional radioactivity, making it possibly the most radioactive granite in Australia (White, 2005; Sheard, 2009). This enrichment is attributed to metasomatic processes that produced diagnostic REE-bearing minerals including allanite, monazite, yttrotitanite, and zircon, accompanied by K-feldspar alteration and fluorite mineralisation (White, 2005). These metasomatic assemblages and associated alteration minerals exhibit distinct spectral signatures in the visible-near infrared (VNIR) and shortwave infrared (SWIR) regions. (4) Lines 265- 274： For example, the high immediately to the west of REE Location G (Fig. 1), which is centred on Radium Ridge, demonstrates a clear relationship between topography and mineralisation. Radium Ridge is a prominent topographic feature extending approximately 3 km in an east-west direction, characterised by steep slopes on its southern flank and gentler gradients on the northern side (Sullivan et al., 1945). The steepest sections of the ridge result from differential weathering resistance of a silicified zone, which hosts small hydrothermal Fe-rich U and REE prospects along the ridge crest (Sullivan et al., 1945). This exemplifies how DEM data can capture the topographic expressions that result from structurally-controlled mineralisation and lithological contrasts caused by differential weathering and silicification processes, thereby serving as an indirect indicator of potential mineralised zones.
  Comment 2.6 — Each of the geophysical methods image different depths of the crust. This is partly due to the physics of the method itself, but mostly due to station spacing and line spacing. Wider line spacing means deeper minimum depths of imaging. Please explain this in context with the geophysical survey parameters.
  Response: Thank you for this helpful comment. You're absolutely right that the depth each geophysical method images is influenced not only by the underlying physics of the method but also by the survey design. We have supplemented the manuscript with papers that provide more details of the data processing procedure. Taking the gravity data we used as an example to explain, this dataset is sourced from the South Australian Resources Information Gateway (SARIG), with detailed processing methods and original survey parameters documented in the official report from the Geological Survey of South Australia (Katona, 2017). Key points to note are: (1) Variability in spacing and effective resolution: The original station spacing is highly uneven, ranging from dense intervals of 50 metres to sparse ones of up to 50 kilometres (Katona, 2017, p. 30, Table 2). Whilst the final product is a regular 100-metre grid, the effective resolution is determined by line spacing rather than along-line station spacing (Katona, 2017, p. 39, Figure 2). For instance, even if stations along a line are spaced 100 metres apart, if the line spacing is 2 kilometres, the effective resolution in that area is limited to several hundred metres, capable only of reflecting larger-scale, deeper geological sources. (2) Quantification of survey parameters: Figure 33 in the report (Nominal Station Line Spacing map) quantifies and visualises the line spacing distribution across the entire state. Line spacing across our study area varies from 4,000 to 8,000 metres (please see the attached Fig. 2). The gravity data we employed consist of processed 100-metre grids (including Onshore Bouguer gravity anomaly, First vertical derivative, and Residual gravity), which complement information on deep large-scale and shallow small-scale anomalies. In response to your suggestion, we have added relevant references in the Methods section. Similar considerations apply to the magnetic and radiometric data used in this study.
  Comment 2.7 — The most interesting part of the study is finding areas on unknown mineralisation. However, there is limited analysis of the newly identified prospective areas. Examples are location ‘X’ (SW of E), the area extending SW of ‘E’ and a blob SE of ‘F’. Why would anyone go to these locations to acquire a tenement? Some geological analysis is needed.
  Response: We added Lines 488-507: Additionally, the high prospectivity areas identified by the proposed framework, excluding known mineral occurrences, are subjected to further analysis (Fig. 5). Based on comprehensive review of historical exploration company reports, location X comprises a metamorphosed volcano-sedimentary sequence with radioactive metasediments subdivided into (a) allanite-bearing calc-silicates and (b) allanite-biotite schists and granitic gneisses, likely derived from tuffaceous protoliths. Allanite serves as the principal REE- and U-bearing phase (Teale, 1981). Grab samples display anomalous concentrations of Ce (up to 4600 ppm), Th (3600 ppm), Nb (900 ppm), U (600 ppm), W (180 ppm), and Sn (170 ppm) (Brewer, 1980). The area extending southwest of location E, known as the Paralana Plateau, represents another prospective target successfully delineated by the proposed framework. Historical exploration by Exoil N.L. around 1970 identified multiple sub-vertical monazite-bearing schist lenses ranging from 7 to 17 m in thickness and 30 to 170 m in length (Exoil, 1970). The indicated resource exceeds 5 million tonnes of schist with monazite grades of 5-6%. As monazite is a principal REE-bearing mineral (with some containing 50-70% total rare earth oxides), this represents REE mineralisation potential. The anomaly southeast of location F is located on a silicified ridge of Mesoproterozoic rocks near the East Painter Uranium Workings, and this area is characterised by uranium mineralisation associated with anomalous REE and Cu concentrations (Teale, 1993; Robertson et al., 2006). The spatial and geological association suggests potential for REE mineralisation, warranting field investigation to validate the model prediction. Taken together, the aggregate performance metrics and the spatial analysis of the prospectivity maps suggest that the parameter-optimised DL model, DevNet, characterises the complex associations between multivariate features and mineralisation probability, thereby enabling the use of available geoscience data to estimate mineral potential in underexplored areas.
  Comment 2.8 — The discussion needs to explore geological reasons how shap values align with geological understanding. Explore the mechanisms – what geological features contributing to mineralisation are magnetic? Why are others radioactive (e.g. lin469 to 472)? Are there any that are both magnetic and radioactive?
  Response: We added Lines 550-559: The geological basis for this strong radiometric signature can be exemplified by Location C (Moolawatana), where a quartz-sericite schist exhibits locally high radioactivity as determined by scintillometer surveys. This schist contains anomalously high Ce (up to 1,400 ppm) and Th (with a Ce:Th ratio of approximately 1.7:1), indicating the presence of cerium-bearing allanite (McPhee et al., 1982). This mineral is likely the main contributor to the elevated prospectivity scores at this location. Although Ce was not among the elements used in lithogeochemistry (Table 2), the allanite and/or its host rock likely contain other listed elements (e.g., Th, U, Y), thereby providing anomalously high indicator signals for this location. This example demonstrates the direct geological linkage between radioactive minerals (allanite hosting Th and U) and REE enrichment, validating the use of radiometric data as a diagnostic tool for REE prospectivity.
  Comment 2.9 — The results section opens with statements describing the interpretation of results for the results have been presented. E.g. “This study efficiently integrates multi-source exploration data and transforms them into a mineral prospectivity map utilising a semi-supervised DL model. Additionally, it provides interpretability for understanding the model’s prediction process” And “The DL-based mineral prospectivity model, DevNet, trained with optimised parameter configurations, effectively captures the complex mapping relationships between multidimensional features and mineralisation probability. The mineralisation probability is obtained by normalizing the anomaly scores output (the original value of the model output) by DevNet.” The readers need to make up their minds by viewing the results first before the authors can make these statements (which are really discussion items).
  Response: Thank you for this valuable feedback. We have modified the description of the result section to distinguish it from the discussion section.
  Comment 2.10 — Not sure about anomaly detection being relevant to MPM. AD looks at a univariate signal, while MPM is a data fusion exercise. Anomalous prospectivity values - Just needs a clearer explanantion
  Response: We've moved this content from the Introduction to the Methods section and provided a clearer explanation: ”Semi-supervised learning represents an intermediate paradigm between supervised and unsupervised learning approaches, leveraging the combination of limited labelled data and abundant unlabelled samples to drive model training (Ruff et al., 2019). This methodology enables enhanced predictive accuracy while simultaneously reducing annotation costs. This paradigm is particularly well-suited for anomaly detection (AD) tasks: under conditions where anomalous samples are extremely scarce while normal samples are relatively abundant, semi-supervised anomaly detection aims to identify observations that significantly deviate from the joint distribution of normal data and construct decision boundaries capable of effectively distinguishing between anomalous and normal patterns (Ruff et al., 2019; Zhang et al., 2018). Studies have demonstrated that with as little as 1% labelled anomalies, appropriate semi-supervised methods can achieve effective performance (Pang et al., 2019). Mineral deposits can be conceptualised as rare “geological anomalies” formed through the coupling of multiple geological processes (Cheng and Zhao, 2011). Ore-forming processes are typically accompanied by a series of geological anomalous phenomena and products (e.g., structural, lithological, mineralogical, and fluid-related), which are manifested in multi-source datasets including geological, geophysical, geochemical, and remote sensing data (Zhao, 2002). MPM enables the extraction of mineralisation-related anomalous information from these multi-source exploration datasets, thereby providing target area guidance for subsequent exploration deployment. However, some early semi-supervised AD methods implicitly assume “anomaly homogeneity” (i.e., anomalous samples are mutually similar; Liu et al., 2002; 2003), which often fails to hold in complex geological settings characterised by diverse mineralisation types and varying ore-forming mechanisms. Pang et al. (2019) proposed a semi-supervised neural architecture, known as DevNet, which directly optimises anomaly scores through an end-to-end network. This method leverages a few labelled outliers as prior information to accommodate anomalies exhibiting different abnormal behaviours. The approach has demonstrated excellent performance in fields such as network security intrusion detection, gear pitting fault detection, and geochemical anomaly identification (Alper et al., 2023; Luo et al., 2024; Zheng et al., 2024).”
  Comment 2.11 — Do occurrences, prospects and deposits have the same weight within your model?
  Response: Yes, occurrences, prospects, and deposits are treated with equal weight in our model. All these mineralization points are assigned the same positive label (value = 1) during the training process. By allowing the model to learn from the patterns of all positive samples, we enable it to capture the full spectrum of geological signatures associated with REE mineralization systems, from small-scale occurrences to large deposits.
  Comment 2.12 — Negative samples are drawn > 5km from known deposits. Establish that this is reasonable (all deposits are known? Then why do prospectivity analysis?)
  Response: Our choice of >5km distance for negative sampling follows established practices in mineral prospectivity modeling papers (Carranza et al. 2008; Zuo and Carranza, 2011; Nykänen et al. 2015). The 5km buffer ensures that negative samples are collected from areas unlikely to be influenced by known mineralization processes, while still maintaining sufficient spatial coverage for model training. Regarding the apparent contradiction between "known deposits" and prospectivity analysis, in our modeling approach, only a subset of known deposits was used for training purposes, while the remaining deposits were reserved for independent validation to assess model performance. More importantly, the primary objective of our prospectivity analysis is not to rediscover known deposits, but to identify previously unrecognized prospective areas that may host undiscovered REE mineralization. The model learns from the geological signatures associated with known mineralization and extrapolates these patterns to identify areas with similar characteristics that warrant further exploration attention.
  Comment 2.13 — L439 Success rate high accuracy – against what benchmarks/other studies?
  Response: In our manuscript, the assessment of “high accuracy” is primarily based on a comparative analysis with previous deep learning–based mineral prospectivity studies that used similar evaluation metrics, in particular the proportion of known deposits captured within the top-ranked prospective areas (i.e., the success-rate curve). For example, Luo et al. (2023) reported that the success-rate curve of their metallogenic-factor VAE model showed that less than 60% of the known mineral deposits were located within the top 10% most anomalous areas predicted by the model. In a subsequent study, Luo et al. (2024) showed that the VAE-CAPSNET-GAN model achieved a success rate in which 10% of the study area contained 64% of the known mineral deposits. In contrast, our DEEP-SEAM framework achieves a considerably higher concentration of known deposits in the highest-ranked prospective areas: the success-rate curve indicates that the top 10% of the study area identified by DEEP-SEAM contains 86% of the known mineral deposits. Since these comparative results with other models are not explicitly listed in the manuscript, we have revised the relevant sentence in the text to read: Success-rate analysis indicates that the top 2% of the study area contains 86% of the known mineral deposits, and 30% of the area delineates all the REE deposits (Fig. 6), demonstrating that DevNet can effectively reduce the search space for high-potential metallogenic targets.
  Comment 2.14 — You use features which image the subsurface (magnetics and gravity) and those limited to the surface (DEM, ASTER and, at the scale of this study, radiometrics). It would be interesting to look at the SHAP values and see whether surface versus deeper signatures are more useful.
  Response: Thank you for this insightful suggestion. We have incorporated this analysis into the discussion section (Lines 599-609). The added text reads “ Furthermore, we employed SHAP values to comprehensively compare the relative importance of features from surface-indicative datasets (DEM, remote sensing, and radiometric data) and depth-indicative datasets (magnetic and gravity data) in producing high mineralisation probability scores (Fig. 9). The analysis reveals that the top four features contributing most significantly to the model's output anomaly scores are all derived from surface datasets, ranked as follows: Terrestrial_Radiation_Dose_Mean, Geochemical_PC1, Ferric_Oxide_Content_Dissimilarity, and ALOH_Group_Composition_Standard_Deviation. This result demonstrates that although subsurface geophysical data (magnetic and gravity) effectively delineate structural frameworks and identify favourable host rocks, the surface expression of REE enrichment exhibits more prominent signatures. Radiometric data directly detect radioactive elements (e.g., Th and U) associated with REE enrichment, providing direct indicators of REE concentration. Remote sensing derivative indices effectively identify hydrothermal alteration minerals indicative of REE mineralisation. Geochemical data directly reflect the spatial distribution and enrichment-depletion patterns of REEs.”
  
  Comment 2.15 — Figure 10.Std based features have border effects. Does this impact the model training and degrade results? Can these border effects be removed?
  Response: We have conducted a comprehensive examination of the spatial statistical feature extraction process within buffer zones. While most input layers demonstrate normal statistical behavior, we acknowledge that a small number of layers exhibit pronounced standard deviation anomalies at the study area boundaries, manifesting as edge effects. This phenomenon primarily stems from statistical bias introduced by buffer truncation. We discuss this issue from three perspectives: data characteristics, model mechanisms, and prediction outcomes. From a data characteristics perspective, evidence layers exhibiting high-frequency spatial variations experience significant statistical bias when truncated at buffer edges due to insufficient sample points, resulting in amplified standard deviations (for instance, the AlOH_Group_Composition_Standard_Deviation layer shown in Figure 10 in manuscript). Conversely, evidence layers with regional distribution characteristics maintain statistical consistency at boundaries, as their gradual variations are less susceptible to local truncation effects (such as AlOH_Group_Content_Standard_Deviation layer illustrated in attached Fig. 3b). This disparity represents a classical manifestation of "edge effects" in spatial statistics, constrained by study area boundary conditions and fixed buffer radius settings. However, this issue is confined to a limited number of high-variability layers rather than being a systematic problem. Regarding model mechanisms, our framework employs the DevNet model based on multi-source exploration data for mineralization anomaly extraction. The model's core strengths lie in its multi-layer feature fusion strategy and anomaly learning mechanism, which enhance resistance to local noise including edge effects. DevNet establishes a robust "normality baseline" by learning the distribution patterns of extensive unlabeled samples and their differences from limited known mineralization samples in latent space. During training, the model is guided by prior mineralization information to learn anomaly patterns associated with mineralization rather than focusing on local anomalies and noise at boundaries, thereby effectively diluting the influence of edge effects in input data. In terms of prediction outcomes, both the prediction results and model interpretability analysis demonstrate that despite the significant presence of edge effects in certain evidence layers (such as AlOH_Group_Composition_Standard_Deviation) that play important roles in the model, the final mineral prospectivity map shows no apparent boundary artifacts such as edge-concentrated high values or anomalous gradients. This indirectly confirms that edge effects are progressively attenuated through multi-source data fusion and model learning processes, without compromising overall prediction accuracy and reliability. In summary, while edge effects objectively exist in a minority of input layers, their impact on final mineralization anomaly extraction reliability remains minimal due to the selective influence of data characteristics, DevNet's noise resistance mechanisms, and the stability of prediction results. We will consider extending buffer zones or implementing boundary correction algorithms (such as mirror padding) for further optimization in future work.
  
  Citation: https://doi.org/10.5194/egusphere-2025-3283-AC2

Zijing Luo, Ehsan Farahbakhsh, Stephen Hore, and R. Dietmar Müller

Data sets

An explainable semi-supervised deep learning framework for mineral prospectivity mapping: DEEP-SEAM v1.0 Zijing Luo, Ehsan Farahbakhsh, Stephen Hore, R. Dietmar Müller https://doi.org/10.5281/zenodo.17098677

Model code and software

Interactive computing environment

Zijing Luo, Ehsan Farahbakhsh, Stephen Hore, and R. Dietmar Müller

Viewed

Total article views: 1,678 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
1,505	137	36	1,678	18	19

HTML: 1,505
PDF: 137
XML: 36
Total: 1,678
BibTeX: 18
EndNote: 19

Views and downloads (calculated since 15 Sep 2025)

Month	HTML	PDF	XML	Total
Sep 2025	904	21	4	929
Oct 2025	153	24	12	189
Nov 2025	97	20	6	123
Dec 2025	89	27	12	128
Jan 2026	157	27	0	184
Feb 2026	105	18	2	125

Cumulative views and downloads (calculated since 15 Sep 2025)

Month	HTML	PDF	XML	Total
Sep 2025	904	21	4	929
Oct 2025	153	24	12	189
Nov 2025	97	20	6	123
Dec 2025	89	27	12	128
Jan 2026	157	27	0	184
Feb 2026	105	18	2	125

Viewed (geographical distribution)

Total article views: 1,677 (including HTML, PDF, and XML) Thereof 1,677 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 28 Feb 2026

Short summary

By combining multi-source data with advanced processing techniques, our deep learning model effectively identifies mineralisation patterns despite extremely limited deposit samples, analyses data and validates the geological relevance of its decisions through explanatory analysis, providing a universally reliable solution for artificial intelligence-assisted mineral prospectivity mapping.


Total:	0
HTML:	0
PDF:	0
XML:	0