the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Recognizing geochemical spatial patterns using deformable convolutional networks guided with geological knowledge
Abstract. This study tackles the limited quantification of irregular spatial geochemical patterns and weak interpretability in deep learning models in geochemical anomaly recognition. We propose a hybrid approach that that integrates geological knowledge (GK) into deformable convolutional networks (DCN), creating a model termed GK_DCN, with the aim of enhancing both the performance and transparency of geochemical anomaly recognition. This model introduces learnable parameters that allow the convolutional kernels to adaptively adjust their sampling locations, enabling them to more accurately capture complex, irregular geochemical anomaly patterns caused by mineralization. To enhance geological consistency, ore-controlling fault are incorporated as geological knowledge constraints, guiding the network to prioritize spatial correlations between deposits and faults. Experimental results in southern Tianshan Au-Cu polymetallic ore district demonstrate that the GK_DCN significantly enhances the accuracy and reliability of geochemical anomaly recognition verified across multiple evaluation metrics, producing more distinct spatial anomalous patterns and higher consistency with known mineral deposits by adaptively adjusting the receptive field. Visualization of the kernel offsets revealed the model's superior adaptive spatial sampling mechanism. Furthermore, using Grad-CAM to generate feature significance heatmaps highlighted the key features the model focused on during geochemical anomaly recognition, significantly improving interpretability and proving effectiveness in capturing complex geochemical patterns. This work provides an effective intelligent method for geochemical pattern recognition and offers a reference for interpretable deep learning in geochemical exploration through multi-angle visualization.
- Preprint
(3601 KB) - Metadata XML
- BibTeX
- EndNote
Status: final response (author comments only)
-
RC1: 'Comment on egusphere-2025-4877', Anonymous Referee #1, 21 Nov 2025
- AC2: 'Reply on RC1', Yihui Xiong, 03 Feb 2026
-
CEC1: 'Comment on egusphere-2025-4877 - No compliance with the policy of the journal', Juan Antonio Añel, 07 Dec 2025
Dear authors,
Unfortunately, after checking your manuscript, it has come to our attention that it does not comply with our "Code and Data Policy".
https://www.geoscientific-model-development.net/policies/code_and_data_policy.htmlIn your the Zenodo repository that you provide for the code, you have failed to include the input data necessary to train the models, and the resulting output data. Moreover, for part of the data you cite another published paper, which is not acceptable. Given this your manuscript should have never been accepted for Discussions. Our policy clearly states that all the data necessary to replicate a manuscript must be published openly and freely to anyone before submission.
Therefore, we are granting you a short time to solve this situation. You have to reply to this comment in a prompt manner with the information for the repositories containing all the data that you use to produce and necessary to replicate your manuscript. The reply must include the link and permanent identifier (e.g. DOI). Also, any future version of your manuscript must include the modified section with the new information.
I must note that if you do not fix this problem, we cannot continue with the peer-review process or accept your manuscript for publication in our journal.
Juan A. Añel
Geosci. Model Dev. Executive EditorCitation: https://doi.org/10.5194/egusphere-2025-4877-CEC1 -
AC1: 'Reply on CEC1', Yihui Xiong, 03 Feb 2026
Dear Dr. Juan A. Añel,
Thank you for your guidance regarding our manuscript's compliance with the Code and Data Policy.
We have now updated the manuscript to permanently archive both the code and data in Zenodo, as reflected in the revised 'Code and Data Availability' sections:
Code and Data Availability: https://doi.org/10.5281/zenodo.18454129
We believe the manuscript now fully complies with the journal's policy.
Sincerely,
Xinyu Zhang, Yihui Xiong
Citation: https://doi.org/10.5194/egusphere-2025-4877-AC1
-
AC1: 'Reply on CEC1', Yihui Xiong, 03 Feb 2026
-
RC2: 'Comment on egusphere-2025-4877', Anonymous Referee #2, 26 Jan 2026
This contribution proposes a hybrid approach that integrates geological knowledge (specifically ore-controlling faults) into Deformable Convolutional Networks (GK_DCN) to recognize anisotropic geochemical anomalies. The authors carry out a case study in the Southern Tianshan Au-Cu polymetallic ore district to investigate the model's ability to capture complex anomaly patterns and enhance interpretability through visualization techniques. This study offers a solid workflow for coupling data-driven and knowledge-driven approaches, providing potential references for mineral prospectivity mapping in structurally complex terrains. The methodology and argumentation of this paper are comparatively reliable, however, the robustness of the experimental design requires further strengthening. I think it will be of general interest to readers of EGUsphere after a major revision.
The main issues are as follows:
- Considering the training set is notably small (only 134 samples), it is likely insufficient to support the generalization capability of a deep learning model with high-dimensional features (39 channels). Although the authors introduced geological constraints for regularization and performed multi-metric evaluations, the risk of overfitting has not been entirely mitigated. I strongly recommend implementing k-fold cross-validation to strengthen the robustness of the experimental results. Furthermore, regarding the negative sampling strategy, the authors state that samples were randomly selected from barren regions. However, the absence of discovered deposits does not equate to the absence of mineralization. Random selection risks inadvertently labeling undiscovered concealed deposits as negative, thereby introducing label noise. I suggest establishing a buffer zone around known deposits to exclude potential mineralization areas when selecting negative samples.
- The authors incorporate a “distance to faults” rule directly into the loss function to constrain model training. The significant performance improvement observed confirms that fault structures provide critical information. However, this raises an important question: the study currently only addresses the influence of known (mapped) faults. How do concealed or blind faults affect the model's judgment? Since the loss function relies on the distance to known faults, there is a risk that the model might suppress valid anomalies controlled by unmapped blind faults. In other words, if fault information is incomplete, what is the extent of the potential bias in the results? I believe that discussing these limitations and the potential influence of hidden structures would significantly enhance the generalizability and practical value of the proposed method.
Specific comments:
Line 8: Delete the duplicate “that”.
Line 13: replace “fault” by “faults”.
Line 27: replace “movement” by “migration”.
Line 29: replace “ore materials” by “economic minerals”.
Line 242: replace “points” by “deposits”.
Line 257: The specific value of λ in Equation 9 is not provided. Since this parameter balances the data loss and the geological constraint, it is critical for reproducibility. How do different values of λ affect the model's performance?
Lines 311-316: IDW is an isotropic smoothing interpolation method. This approach may artificially weaken the anisotropic features of the data before it is even input into the model. Given that the core of this study is to utilize DCN to capture irregular and anisotropic patterns, why was the isotropic IDW method chosen for preprocessing instead of Kriging or other methods capable of preserving directionality?
Lines 317-322: The positive samples are extracted as 9*9 grid patches centered on known deposits. If the distance between two deposits is less than the patch dimensions, their corresponding samples will spatially overlap. If one of these overlapping samples is assigned to the training set and the other to the validation set, the model may achieve artificially inflated accuracy by simply memorizing local background features rather than learning generalizable metallogenic rules. Did the authors implement any de-duplication steps or use a spatially blocked split (e.g., splitting by geographic region) to prevent this data leakage? Please clarify the splitting strategy.
Line 344: replace “perspective” by “prospectivity”.
Line 372: There appears to be a contradiction between the text and the figures regarding the PC2 scores. The text states that low PC2 values correspond to the Au-Cu mineralization assemblage. However, the legend in Figure 11 explicitly states that yellow (high values) represents high concentration, and Figure 9b clearly shows that most Au-Cu deposits are located in areas with high PC2 scores.
Line 469: replace “anomalous” by “anomalies”.
Citation: https://doi.org/10.5194/egusphere-2025-4877-RC2 - AC3: 'Reply on RC2', Yihui Xiong, 03 Feb 2026
Viewed
| HTML | XML | Total | BibTeX | EndNote | |
|---|---|---|---|---|---|
| 248 | 345 | 28 | 621 | 23 | 25 |
- HTML: 248
- PDF: 345
- XML: 28
- Total: 621
- BibTeX: 23
- EndNote: 25
Viewed (geographical distribution)
| Country | # | Views | % |
|---|
| Total: | 0 |
| HTML: | 0 |
| PDF: | 0 |
| XML: | 0 |
- 1
The manuscript provides a new valuable method for recognizing geochemical anomalies from high-dimensional geochemical survey datasets. It can be considered for publication after moderate revisions. All the suggestions have been marked on the attached pdf file. The suggestions include the following aspects:
1. Please further polish the English of the manuscript.
2. The training patches are only 134. It is too few for training the deep learning models used in geochemical anomaly recognition. This needs to furtehr explanations.
3. The limitations of the new method should be discussed in the Results and Discussion Section.