A novel explainable deep learning framework for reconstructing South Asian palaeomonsoons

Hunt, Kieran M. R.; Harrison, Sandy P.

doi:10.5194/egusphere-2024-2128

Preprints

https://doi.org/10.5194/egusphere-2024-2128

Preprints

09 Aug 2024

| 09 Aug 2024

A novel explainable deep learning framework for reconstructing South Asian palaeomonsoons

Kieran M. R. Hunt and Sandy P. Harrison

Abstract. We present novel explainable deep learning techniques for reconstructing South Asian palaeomonsoon rainfall over the last 500 years, leveraging long instrumental precipitation records and palaeoenvironmental datasets from South and East Asia to build two types of model: dense neural networks ('timeline models') and convolutional neural networks (CNNs). The timeline models are trained individually on seven regional rainfall datasets and while they capture decadal-scale variability and significant droughts, they underestimate interannual variability. The CNNs, designed to account for spatial relationships in both predictor and target, demonstrate higher skill in reconstructing rainfall patterns and produce robust spatiotemporal reconstructions. The 19th and 20th centuries were characterised by marked inter-annual variability in the monsoon, but earlier periods were characterised by more decadal- to centennial-scale oscillations. Multidecadal droughts occurred in the mid-seventeenth and nineteenth centuries, while much of the eighteenth century (particularly the early part of the century) was characterised by above-average monsoon precipitation. Extreme droughts tend to be concentrated in south and west India and often coincide with recorded famines. By applying explainability techniques, we show that the models make use of both local hydroclimate and synoptic-scale dynamical relationships. Our findings offer insights into the historical variability of the Indian summer monsoon and highlight the potential of deep learning techniques in palaeoclimate reconstruction.

Received: 09 Jul 2024 – Discussion started: 09 Aug 2024

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 10764 KB)

Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
Preprint (10764 KB)

Download & links

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Journal article(s) based on this preprint

07 Jan 2025

A novel explainable deep learning framework for reconstructing South Asian palaeomonsoons

Kieran M. R. Hunt and Sandy P. Harrison

Clim. Past, 21, 1–26, https://doi.org/10.5194/cp-21-1-2025,https://doi.org/10.5194/cp-21-1-2025, 2025

Short summary

Kieran M. R. Hunt and Sandy P. Harrison

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-2128', Anonymous Referee #1, 30 Aug 2024
The authors employed deep learning models to reconstruct paleomonsoon rainfall over India, using a range of paleoclimate records and two distinct approaches: one generating spatial map time series and the other producing regional time series. They implemented Convolutional Neural Networks for the spatial maps timeseries and multilayer perceptron models for the regional time series. The authors also incorporated an ensemble modelling strategy to enhance the robustness of their predictions. Although the predictive performance was not particularly strong, they effectively used the available dataset to perform data mining, and extracted as much information as possible, and I find that there methodological approach is pretty novel. Additionally, they applied explainable machine learning techniques to identify key predictors (paleoclimate records) across different locations, and supported their findings with physical knowledge.
The methods and results were well presented and thoroughly explained, with the authors fairly acknowledging the strengths and weaknesses of their findings and providing a well-rounded discussion. The datasets produced in the study have also been made available.
I recommend publishing this work, but I have a list of minor comments that could further improve it.
Specific comments
When using the term "timeline models", readers might expect an architecture designed to handle time dependencies, such as recurrent neural networks or transformers. However, your MLP model does not inherently handle sequential or time-dependent data. I suggest using "regional models" to refer to MLPs, as you are applying them to predict region-based time series, and "spatial models" for the CNN models that produce spatial maps.

Line 33: ENSO acronym not defined

Line 35: ITCZ acronym not defined, same for PDO in Line 59 and SST in Line 99

Line 84: ‘Machine learning approaches have been used in palaeoclimate research for automated palaeoenvironmental record generation, model post-processing’. Could you clarify what specific model is being referred to? Same applies to ‘Machine learning has not been as widely used for model post-processing’ in Line 93. Please specify which models you are discussing to enhance clarity.

Line 98: There’s a typo in byMalmgren

Line 99 and elsewhere: The term ‘simple’ MLP is not commonly used. Consider using a more standard term such as ‘basic MLP’ or just ‘MLP’

L100: There is an extra ‘to’

Line 101: The description ‘two-layer MLP’ includes unnecessary detail. Consider simplifying it to just ‘MLP’

L116: Instead of ‘to stabilize model training’ it would be better to say: ‘optimize the performance of the model’

Figure 1: Are there blue contours present, or do you mean blue shades?

Section 2.1.4: This technical detail might be more suitable for the supplementary materials. Additionally, in point 6, please clarify the difference between the two CSV files.

Line 162: In ‘After this filtering, there were 157 datasets that could be used to train the models. (Fig. 1),’ please specify that these are the predictors of the models for clarity.

Line 203: how many years/samples were left for training?

Line 204: These are not ‘Independent models’ since they share training data. It would be better to say ‘separate models’ or ‘distinct models’ instead.

Section 3.2 Regularisation: Instead of writing this as a separate section, consider combining it with the loss function section and include the equation of the loss function with the regularisation term.

In Table 2, for clarity, update the caption to specify that L1 and L2 represent the hidden layers, and indicate the number of neurons in the input layer

Line 244: Was the choice of alpha arbitrary, or was it determined through trial and error?

Lines 255-258: The ‘Data Preparation’ section is unnecessary. Instead, you could include details on how the predictors were standardised or normalised, as this information is currently missing.

Figure 2: Where is the monsoon core zone?

Lines 285-290: Instead of detailing how CNNs are typically used as encoders, focus on explaining your specific approach; starting with dense layers and then using CNN as a decoder, as mentioned in Line 290.

Line 325: The term ‘distribution mean’ may not be accurate. It would be clearer to say ‘the mean of all predictions.’

In the caption of Figure 3: ‘Observed values taken from the reconstructed time series in Sontakke et al. (2008) are given in green’. Do you mean the observed values are shown to the right of the green line?

Line 367: The term ‘pattern correlation’ is ambiguous as it could refer to either ‘temporal’ or ‘spatial’ correlation. Please specify that this refers to ‘spatial’ correlation for clarity.

Figure 5 caption: Please revise the sentence ‘the prediction from with the PCC is computed is taken’ for clarity. Additionally, ensure consistency between ‘CNN-ERA5’ in the caption and ‘ERA5-CNN’ mentioned in Line 385.

Figure 7 caption: There is no dotted black line as mentioned; instead, there is a single dotted green line. Please correct the caption to reflect that it is one dotted green line, not ‘green lines’.

Line 455: Are you referring to low ‘temporal’ resolution?

Figure 8: How were the Shapley values standardised? Also, did you take the absolute values of the Shapley values before averaging them? Please add these details for clarity.
Citation: https://doi.org/10.5194/egusphere-2024-2128-RC1
- AC1: 'Reply on RC1', Kieran Hunt, 23 Sep 2024
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2024/egusphere-2024-2128/egusphere-2024-2128-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2024-2128-AC1
RC2:
'Comment on egusphere-2024-2128', Nick Scroxton, 10 Sep 2024
General Comments
In their manuscript Hunt and Harrison provide a novel approach to reconstructing historical monsoon variability: using machine learning to assess the relationship between paleoclimate records as predictors and the IMD gridded rainfall dataset between 1901-present as the target, and then extending that record back through time using annually resolved paleoclimate records from tree-rings, speleothems, glaciers etc. The results are interesting and appear to provide a significant step forwards monsoon predictability, comparable or slightly better than previous techniques in developing time series at point locations or areal means, but a significant advance in reconstructing spatial heterogeneity of monsoonal variability. The section on assessing the importance of individual paleoclimate records in the model, and the ‘sphere of influence’ of individual sites is an exciting development and appears to be backed by our understanding of monsoon dynamics.
The manuscript is well written, and the language is largely appropriate but could be simplified in places to the intended audience of Climate of The Past who likely have a more limited understanding of machine learning techniques. This includes more high-level introduction to methods, and care explaining predictors, targets, training datasets, validation datasets, test datasets etc. There is also sometimes a disconnect between what is labelled on the figures, the figure captions and what the figures refer to. Consistency and clarity are needed in places.
I do not have any significant concerns over the science and resulting inferences presented in this manuscript, though I will happily defer to more qualified machine learning reviewers on the robustness of the methods. I believe that this paper is suitable for publication in Climate in the Past, after correction for some minor issues.
Thank-you
Nick Scroxton
Maynooth University
Specific Comments
Could you explain LiPDs in either section 2.1.1 or 2.1.2 before you get to line 148.

Section 3: A high-level couple of sentences at the start of the methods would help non-machine learning experts. For example, it doesn’t actually say in the manuscript what is your predictor data-set is and what is the target data-set.

What’s the difference between validation and test datasets in Figure 2 and section 3.5 1a. What do they do and why?

Figure 3: This figure does not show clearly the information that is attributed to it. There are significant mismatches between the figure and figure caption. I don’t see the ensemble median in black, the spread in red, or the Sontakke time series in green. I think this figure should be expanded to take-up more space on the page to really highlight the key results attributed to it.

Line 380-383: The dismissal of the PCA technique is too strong at this stage of the manuscript. At this point the reader has only been introduced to figure 4. The PCA outperforms the CNN model 40% of the time (2 out of 5 years) in figure 4 so cannot be dismissed, although an argument can be made that it lacks spatial heterogeneity. Once we have seen figure 5 and the larger dataset, we see that outperformance of the PCA method in figure 4 is likely just an artifact of small sample size, and therefore the dismissal is more reasonable. This section therefore might need to be reworded

I wonder if dry years also correspond to major volcanic eruptions (as we might expect). This might provide an additional test of reconstruction performance that is less reliant on additional human societal complexities.

I disagree with some of your speleothem inferences. In section 4.3 If the EPI speleothem record is d18O then we might expect a better correlation with WPI rainfall than EPI, and thus this example belongs in the following paragraph instead. I don’t see the relevance of the river basins argument here on speleothem proxy variability.

Technical Corrections
Line 196-198: Could you unreversed the first half of this sentence for clarity

Figure 2 caption: ‘one of the timeline model’

Line 354: New paragraph?

Line 364: 1988?

Line 380: New paragraph?

Line 528, 542: The cave name is spelt ‘Mawmluh’ or ‘Krem Mawmluh” if cave needs to be included.

Line 530: ‘documented’?

Line 531: ‘subcontinent’ to be consistent throughout the manuscript.

Line 561: clarify what you mean by wavelength or use less technical language.

Line 578: delete ‘have’
Citation: https://doi.org/10.5194/egusphere-2024-2128-RC2
- AC2: 'Reply on RC2', Kieran Hunt, 23 Sep 2024
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2024/egusphere-2024-2128/egusphere-2024-2128-AC2-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2024-2128-AC2

Interactive discussion

Status: closed

RC1:
'Comment on egusphere-2024-2128', Anonymous Referee #1, 30 Aug 2024
The authors employed deep learning models to reconstruct paleomonsoon rainfall over India, using a range of paleoclimate records and two distinct approaches: one generating spatial map time series and the other producing regional time series. They implemented Convolutional Neural Networks for the spatial maps timeseries and multilayer perceptron models for the regional time series. The authors also incorporated an ensemble modelling strategy to enhance the robustness of their predictions. Although the predictive performance was not particularly strong, they effectively used the available dataset to perform data mining, and extracted as much information as possible, and I find that there methodological approach is pretty novel. Additionally, they applied explainable machine learning techniques to identify key predictors (paleoclimate records) across different locations, and supported their findings with physical knowledge.
The methods and results were well presented and thoroughly explained, with the authors fairly acknowledging the strengths and weaknesses of their findings and providing a well-rounded discussion. The datasets produced in the study have also been made available.
I recommend publishing this work, but I have a list of minor comments that could further improve it.
Specific comments
When using the term "timeline models", readers might expect an architecture designed to handle time dependencies, such as recurrent neural networks or transformers. However, your MLP model does not inherently handle sequential or time-dependent data. I suggest using "regional models" to refer to MLPs, as you are applying them to predict region-based time series, and "spatial models" for the CNN models that produce spatial maps.

Line 33: ENSO acronym not defined

Line 35: ITCZ acronym not defined, same for PDO in Line 59 and SST in Line 99

Line 84: ‘Machine learning approaches have been used in palaeoclimate research for automated palaeoenvironmental record generation, model post-processing’. Could you clarify what specific model is being referred to? Same applies to ‘Machine learning has not been as widely used for model post-processing’ in Line 93. Please specify which models you are discussing to enhance clarity.

Line 98: There’s a typo in byMalmgren

Line 99 and elsewhere: The term ‘simple’ MLP is not commonly used. Consider using a more standard term such as ‘basic MLP’ or just ‘MLP’

L100: There is an extra ‘to’

Line 101: The description ‘two-layer MLP’ includes unnecessary detail. Consider simplifying it to just ‘MLP’

L116: Instead of ‘to stabilize model training’ it would be better to say: ‘optimize the performance of the model’

Figure 1: Are there blue contours present, or do you mean blue shades?

Section 2.1.4: This technical detail might be more suitable for the supplementary materials. Additionally, in point 6, please clarify the difference between the two CSV files.

Line 162: In ‘After this filtering, there were 157 datasets that could be used to train the models. (Fig. 1),’ please specify that these are the predictors of the models for clarity.

Line 203: how many years/samples were left for training?

Line 204: These are not ‘Independent models’ since they share training data. It would be better to say ‘separate models’ or ‘distinct models’ instead.

Section 3.2 Regularisation: Instead of writing this as a separate section, consider combining it with the loss function section and include the equation of the loss function with the regularisation term.

In Table 2, for clarity, update the caption to specify that L1 and L2 represent the hidden layers, and indicate the number of neurons in the input layer

Line 244: Was the choice of alpha arbitrary, or was it determined through trial and error?

Lines 255-258: The ‘Data Preparation’ section is unnecessary. Instead, you could include details on how the predictors were standardised or normalised, as this information is currently missing.

Figure 2: Where is the monsoon core zone?

Lines 285-290: Instead of detailing how CNNs are typically used as encoders, focus on explaining your specific approach; starting with dense layers and then using CNN as a decoder, as mentioned in Line 290.

Line 325: The term ‘distribution mean’ may not be accurate. It would be clearer to say ‘the mean of all predictions.’

In the caption of Figure 3: ‘Observed values taken from the reconstructed time series in Sontakke et al. (2008) are given in green’. Do you mean the observed values are shown to the right of the green line?

Line 367: The term ‘pattern correlation’ is ambiguous as it could refer to either ‘temporal’ or ‘spatial’ correlation. Please specify that this refers to ‘spatial’ correlation for clarity.

Figure 5 caption: Please revise the sentence ‘the prediction from with the PCC is computed is taken’ for clarity. Additionally, ensure consistency between ‘CNN-ERA5’ in the caption and ‘ERA5-CNN’ mentioned in Line 385.

Figure 7 caption: There is no dotted black line as mentioned; instead, there is a single dotted green line. Please correct the caption to reflect that it is one dotted green line, not ‘green lines’.

Line 455: Are you referring to low ‘temporal’ resolution?

Figure 8: How were the Shapley values standardised? Also, did you take the absolute values of the Shapley values before averaging them? Please add these details for clarity.
Citation: https://doi.org/10.5194/egusphere-2024-2128-RC1
- AC1: 'Reply on RC1', Kieran Hunt, 23 Sep 2024
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2024/egusphere-2024-2128/egusphere-2024-2128-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2024-2128-AC1
RC2:
'Comment on egusphere-2024-2128', Nick Scroxton, 10 Sep 2024
General Comments
In their manuscript Hunt and Harrison provide a novel approach to reconstructing historical monsoon variability: using machine learning to assess the relationship between paleoclimate records as predictors and the IMD gridded rainfall dataset between 1901-present as the target, and then extending that record back through time using annually resolved paleoclimate records from tree-rings, speleothems, glaciers etc. The results are interesting and appear to provide a significant step forwards monsoon predictability, comparable or slightly better than previous techniques in developing time series at point locations or areal means, but a significant advance in reconstructing spatial heterogeneity of monsoonal variability. The section on assessing the importance of individual paleoclimate records in the model, and the ‘sphere of influence’ of individual sites is an exciting development and appears to be backed by our understanding of monsoon dynamics.
The manuscript is well written, and the language is largely appropriate but could be simplified in places to the intended audience of Climate of The Past who likely have a more limited understanding of machine learning techniques. This includes more high-level introduction to methods, and care explaining predictors, targets, training datasets, validation datasets, test datasets etc. There is also sometimes a disconnect between what is labelled on the figures, the figure captions and what the figures refer to. Consistency and clarity are needed in places.
I do not have any significant concerns over the science and resulting inferences presented in this manuscript, though I will happily defer to more qualified machine learning reviewers on the robustness of the methods. I believe that this paper is suitable for publication in Climate in the Past, after correction for some minor issues.
Thank-you
Nick Scroxton
Maynooth University
Specific Comments
Could you explain LiPDs in either section 2.1.1 or 2.1.2 before you get to line 148.

Section 3: A high-level couple of sentences at the start of the methods would help non-machine learning experts. For example, it doesn’t actually say in the manuscript what is your predictor data-set is and what is the target data-set.

What’s the difference between validation and test datasets in Figure 2 and section 3.5 1a. What do they do and why?

Figure 3: This figure does not show clearly the information that is attributed to it. There are significant mismatches between the figure and figure caption. I don’t see the ensemble median in black, the spread in red, or the Sontakke time series in green. I think this figure should be expanded to take-up more space on the page to really highlight the key results attributed to it.

Line 380-383: The dismissal of the PCA technique is too strong at this stage of the manuscript. At this point the reader has only been introduced to figure 4. The PCA outperforms the CNN model 40% of the time (2 out of 5 years) in figure 4 so cannot be dismissed, although an argument can be made that it lacks spatial heterogeneity. Once we have seen figure 5 and the larger dataset, we see that outperformance of the PCA method in figure 4 is likely just an artifact of small sample size, and therefore the dismissal is more reasonable. This section therefore might need to be reworded

I wonder if dry years also correspond to major volcanic eruptions (as we might expect). This might provide an additional test of reconstruction performance that is less reliant on additional human societal complexities.

I disagree with some of your speleothem inferences. In section 4.3 If the EPI speleothem record is d18O then we might expect a better correlation with WPI rainfall than EPI, and thus this example belongs in the following paragraph instead. I don’t see the relevance of the river basins argument here on speleothem proxy variability.

Technical Corrections
Line 196-198: Could you unreversed the first half of this sentence for clarity

Figure 2 caption: ‘one of the timeline model’

Line 354: New paragraph?

Line 364: 1988?

Line 380: New paragraph?

Line 528, 542: The cave name is spelt ‘Mawmluh’ or ‘Krem Mawmluh” if cave needs to be included.

Line 530: ‘documented’?

Line 531: ‘subcontinent’ to be consistent throughout the manuscript.

Line 561: clarify what you mean by wavelength or use less technical language.

Line 578: delete ‘have’
Citation: https://doi.org/10.5194/egusphere-2024-2128-RC2
- AC2: 'Reply on RC2', Kieran Hunt, 23 Sep 2024
  
  The comment was uploaded in the form of a supplement: https://egusphere.copernicus.org/preprints/2024/egusphere-2024-2128/egusphere-2024-2128-AC2-supplement.pdf
  
  Citation: https://doi.org/10.5194/egusphere-2024-2128-AC2

Peer review completion

AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload

ED: Publish subject to minor revisions (review by editor) (23 Oct 2024) by Nerilie Abram

AR by Kieran Hunt on behalf of the Authors (23 Oct 2024) Author's response Author's tracked changes Manuscript

ED: Publish as is (31 Oct 2024) by Nerilie Abram

AR by Kieran Hunt on behalf of the Authors (07 Nov 2024) Manuscript

Journal article(s) based on this preprint

07 Jan 2025

A novel explainable deep learning framework for reconstructing South Asian palaeomonsoons

Kieran M. R. Hunt and Sandy P. Harrison

Clim. Past, 21, 1–26, https://doi.org/10.5194/cp-21-1-2025,https://doi.org/10.5194/cp-21-1-2025, 2025

Short summary

Kieran M. R. Hunt and Sandy P. Harrison

Data sets

Data-driven reconstructions of the Indian palaeomonsoon (1500–1995 CE) Kieran M. R. Hunt and Sandy P. Harrison https://zenodo.org/records/12688184

Kieran M. R. Hunt and Sandy P. Harrison

Viewed

Total article views: 1,884 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
841	965	78	1,884	48	73

HTML: 841
PDF: 965
XML: 78
Total: 1,884
BibTeX: 48
EndNote: 73

Views and downloads (calculated since 09 Aug 2024)

Month	HTML	PDF	XML	Total
Aug 2024	147	43	11	201
Sep 2024	87	27	26	140
Oct 2024	37	12	2	51
Nov 2024	39	9	2	50
Dec 2024	48	9	0	57
Jan 2025	26	19	6	51
Feb 2025	23	3	6	32
Mar 2025	17	12	0	29
Apr 2025	17	18	3	38
May 2025	19	13	1	33
Jun 2025	12	19	0	31
Jul 2025	12	22	1	35
Aug 2025	34	17	0	51
Sep 2025	117	37	1	155
Oct 2025	25	33	0	58
Nov 2025	13	64	0	77
Dec 2025	33	27	2	62
Jan 2026	33	67	3	103
Feb 2026	32	44	3	79
Mar 2026	36	237	7	280
Apr 2026	34	228	4	266
May 2026	5	0	5

Cumulative views and downloads (calculated since 09 Aug 2024)

Month	HTML	PDF	XML	Total
Aug 2024	147	43	11	201
Sep 2024	87	27	26	140
Oct 2024	37	12	2	51
Nov 2024	39	9	2	50
Dec 2024	48	9	0	57
Jan 2025	26	19	6	51
Feb 2025	23	3	6	32
Mar 2025	17	12	0	29
Apr 2025	17	18	3	38
May 2025	19	13	1	33
Jun 2025	12	19	0	31
Jul 2025	12	22	1	35
Aug 2025	34	17	0	51
Sep 2025	117	37	1	155
Oct 2025	25	33	0	58
Nov 2025	13	64	0	77
Dec 2025	33	27	2	62
Jan 2026	33	67	3	103
Feb 2026	32	44	3	79
Mar 2026	36	237	7	280
Apr 2026	34	228	4	266
May 2026	5	0	5

Viewed (geographical distribution)

Total article views: 1,874 (including HTML, PDF, and XML) Thereof 1,874 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 02 May 2026

Download

The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.

Preprint (10764 KB)
Metadata XML

Short summary

In this study, we train machine learning models on tree rings, speleothems, and instrumental rainfall to estimate seasonal monsoon rainfall over India over the last 500 years. Our models highlight multidecadal droughts in the mid-seventeenth and nineteenth centuries, and we link these to historical famines. Using techniques from explainable AI, we show our models use known relationships between local hydroclimate and the monsoon circulation.


Total:	0
HTML:	0
PDF:	0
XML:	0