the Creative Commons Attribution 4.0 License.
the Creative Commons Attribution 4.0 License.
Linking reported drought impacts with drought indices, water scarcity, and aridity: the case of Kenya
Abstract. The relation between drought severity, as expressed through widely used drought indices, and drought impacts is complex. In particular in water-limited regions where water scarcity is prevalent, the attribution of drought impacts is difficult. This study assesses the relation between reported drought impacts, drought indices, water scarcity, and aridity across several counties in Kenya. The monthly bulletins of the National Drought Management Authority in Kenya have been used to gather drought impact data. A Random Forest (RF) model was used to explore which set of drought indices best explains drought impacts on: pasture, livestock deaths, milk production, crop losses, food insecurity, trekking distance for water, and malnutrition. The findings of this study suggest a relation between drought severity and the frequency of drought impacts, whereby the latter also showed a relation with aridity, whilst water scarcity did not. The results of the RF model reveal that drought impacts can be explained by a range of drought indices across regions with different aridity. While the findings strongly depend on the availability of drought impact data and the socio-economic circumstances within a region, this study highlights the potential of linking drought indices with text-based impact reports. In doing so, however, spatial differences in aridity and water scarcity conditions have to be taken into account.
-
Notice on discussion status
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
-
Preprint
(2083 KB)
-
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(2083 KB) - Metadata XML
- BibTeX
- EndNote
- Final revised paper
Journal article(s) based on this preprint
Interactive discussion
Status: closed
-
CC1: 'Comment on egusphere-2022-458', Paolo Tasseron, 17 Aug 2022
This study discusses the possible linking of drought indices with text-based documents by a drought management authority in Kenya. An assessment is made how drought impacts relate with drought indices and water scarcity under various circumstances, by using a Random Forest Model. In doing so, the authors aim to contribute to ongoing debates about operational needs for drought monitoring. The importance of improving early warning systems to mitigate adverse consequences of drought is corroborated by the study and its results.
In general, I believe the authors present a very well-written manuscript, with clear and complete (sub)sections. The quantity and quality of references to relevant state-of-the-art studies is spot on. I can only suggest some minor and a few moderate revisions require attention prior to publication. These are summarised below. Well done!
Minor and moderate suggestions for revision.
Linenumber
Comment
25
Consider changing ‘society’ to ‘societies’
30-37
I think the terms meteorological and hydrological drought do not need such an elaborate explanation. If you think the readership does need this explanation, then consider adding a short definition of ‘soil moisture/agricultural drought’ as well.
83-85
Consider including the Köppen classification to describe climatological setting in Kenya, to highlight the diversity between the relatively wet southwest and dry/arid/desert north and east.
117
How was the difference in spatial resolution (0.25 degree) with the other datasets (0.1 degree) dealt with in by the authors?
145
Consider changing ‘for each person’ to ‘per capita’
148
Replace ‘have been’ by ‘was’
159-160
How are the grid cells spread over the different counties? Did you only compute the means of grid cells completely within a county border? Please specify this, also the resolution that was used for the grids.
173-174
So the indices were calculated for 1980-2020, but only used for 2014-2020. What is the added value of calculating 1980-2014, if it is not being used?
194
‘have’ = ‘has’
202-203
does this aggregation have any drawbacks?
Table 3
Could you indicate p-values in this table? i.e. p<0.05 = *, p<0.01 = **, p<0.001 = ***
243-244
It is a bit unclear what the authors mean by 2, 4, 2 months out of 10/4,0,4 months out of 12. Consider rewriting this sentence.
251
The AUC abbreviation was not previously introduced and should be written out fully.
Figure 4 & 5
Font size should be increased. In addition, the range of the X-axis is different for each figure. If relevant for comparison between the different categories (which I think is the case), consider using the same range for each figure. You might even want to plot the points of multiple categories in one figure with different colours, instead of having 14 individual subplots.
293
Please avoid using the term ‘reliable’ when talking about an objective appraisal, as this is a subjective judgement without qualitative or quantitative data to support the statement. It would improve the statement to shortly explain how the iterative processes and the focus on abnormal conditions took place, as I do not recall reading about this in the methods section. I believe this is important, because the bulletins form an integral part of your analyses.
298-300
Great to see the authors mention multiple outlets to complement NDMA bulletins!
306
‘Good’: please quantify this by for instance mentioning the average AUC value, or repeat the AUC value of the top 3 categories.
320
‘a kind of lag’ – please specify in more detail.
339
‘low population density does not imply low water stress’ – this calls for a reference.
350
‘should have suffered from water scarcity during periods of drought due to the high population density’ – this reads a bit dark. Consider rephrasing the sentence so it does not read as if you want these counties to suffer.
371-373
this was already mentioned earlier in the manuscript and does not need repetition here.
376-378
The authors mention comparisons with these studies are difficult due to different socio-economic and climatic circumstances. That’s a fair point, but how do they compare in terms of resolution? Is it practically possible to compare these quantitatively, or does this also not work?
400-401
Could you please include a reference to this existing database (or name it)?
432-433
Is there any indication of the spatial resolution required to capture the regional differences?
431-437
This last paragraph of the conclusion reads well and contains and sensible information, but I think it undermines the results of the study. It reads as if the work explored in this study is disqualified a bit, since focus is put on the need for finer resolutions to contribute to the development of early warning systems. I recommend the authors to ‘praise’ their own work a bit more in this last paragraph, instead of talking it down.
-
AC4: 'Reply on CC1', Marleen Lam, 05 Dec 2022
Thank you very much for reviewing our manuscript. We are very pleased that you recognize the scientific relevance of our research. The minor and moderate suggestions for revision will be considered and incorporated.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC4
-
AC4: 'Reply on CC1', Marleen Lam, 05 Dec 2022
-
RC1: 'Review of Lam et al. (2022): Linking reported drought impacts with drought indices, water scarcity, and aridity: the case of Kenya', Anonymous Referee #1, 25 Nov 2022
General comments:
In this study, Lam et al. explore statistical links between drought impacts, drought indices, and water scarcity in five counties in Kenya from 2014 to 2020. The authors use seven impact factors retrieved from the monthly bulletins of the National Drought Management Authority in Kenya and four drought indices at various time scales. A random forest model was used to measure statistical associations between the impacts and drought indices. Their result shows the drought indices that are best linked with each of the impact factors and indicate that these associations are also influenced by the aridity level of the regions.
This study on droughts is in the scope of the NHESS journal and is a good contribution to the field of drought impacts, mainly as it is one of the few studies focusing on the African continent. However, the method and the analysis in the manuscript are not sufficient to support some of the authors’ arguments, and the overall presentation of the manuscript (some figures and discussions) needs to be improved. I suggest some major revisions before considering a possible publication in the journal. Below find my comments.
Specific comments:
- Abstract
It would be nice to add two or three more sentences about the results (for example, which indices were included in the study, which one is the most associated with which impact factor, etc.) to give more information to the readers. As it is presented, the abstract sounds very general, and hard to see what the significant contribution of this study is. Also, if the authors’ study is the first one (or one of the few) that explores the link between drought impacts and drought indices in the African continent, that should be emphasized.
1. Introduction
Line 22) "Due to the projected increase in drought frequencies": In which regions and when will this increase become significant? Note that the increases in drought frequency and intensity are not globally equal. See Chapter 11 in the new IPCC The Physical Science Basis report (Seneviratne et al., 2021) and cite this new report instead of the 2014 version.
Line 26) What do the authors mean by "a hydro-meteorological event"? "Hydrometerological events" in general or "drought" or "extreme hydrometeorological events"?
Line 27) I am not convinced here how "Linking drought impacts to drought indices" could contribute to "the development and improvement of the M&EW .". Will it improve drought forecasting (in which way)? Help with better preparedness? Please elaborate on this sentence better.
Line 30-43) This paragraph should go after the first paragraph (from line 22), as it introduces drought indices before discussing "Linking drought impacts to drought indices.”
Line 31) In this study, as impacts on society are considered, I would also add a definition of socio-economic drought, which occurs when water as an economic good cannot meet demands due to a weather-related shortfall. For the definition, see, for example, Mishra and Sigh (2010).
Line 33-35) Add a citation for this argument.
Line 41) I suggest adding an explanation of some drought indices associated with each drought type after Yihdego et al. (2019) (at least the indices the authors use for this study). So the readers can grasp a better idea of which indices are usually used in drought studies.
Line 41) "Models using drought indices to forecast drought can detect climate signals": It is unclear what the authors mean by models (which models?) and climate signals. Please elaborate on this better.
Line 42) "Yet, the link between drought indices and socio-economic impacts has rarely been analyzed": But the authors have mentioned several papers on linking drought indices and impacts in the discussion and conclusion section (Blauhut et al., 2015; Bachmair et al., 2015, 2016, 2018, Stagge et al., 2015, Ma et al., 2020; Wang et al., 2020, among others.), right? Even in the next paragraph, the authors claimed that some works on the topic already exist and they are focused on Europe and the U.S. If these previous works had motivated the authors to perform this study, these works should be acknowledged and elaborated better in the introduction: which indices and methods were the previous studies used? What do their results say about the link between droughts and drought impacts? What is the difference between these studies and the authors' study? Also, was there any previous research analyzing the impacts of droughts over Kenya or the African continent?
Line 48) What does EDII stand for?
Line 65-67) I would remove “because of.” and split the sentence in two: “In this study, we focus on Kenya. The country is characterized by strong gradients in precipitation …”.
Line 74) I am not sure what the authors mean by “arid circumstances.”. Drought? Aridity index?
Line 75) "It is expected that drought indices will show a somewhat similar response across different climatic zones in Kenya because of the standardized nature of drought indices": For me, it is not very clear why the authors expect this. Standardizing indices standardize a water balance variable with respect to the climate of a region to facilitate the comparison across the different climates. But this does not make all regions have similar spatial and temporal variability of a drought index or respond similarly to drought drivers. Droughts can be local or regional events caused by different drivers. In addition, different surfaces, vegetation, and geographical conditions affect the spatiotemporal characteristics of droughts. Thus, even neighboring regions can present different drought conditions. For example, see the plots in Spinoni et al. (2015) which show different drought frequencies and extent (using SPI, SPEI, and RDI) in Europe.
2. Data and Methods
2.1. Study Area
Line 98) "This study will specifically focus on a region in Nyeri, namely Kieni...". This sentence is confusing. So, if I understood it correctly, inside of Nyeri, the authors will focus only on Kieni. Then, I would rephrase a sentence like, "This study specifically focuses on one region in Nyeri, namely Kieni. From now on, with Nyeri, we refer to only the Kieni region.".
If this is the case, why is only Kieni selected in the Nyeri region?
2.2. Data
Line 108) Add a citation for the MSWEP version 1.1.
Line 111) The spatial resolution of MSWEP is 0.25 degree based on Beck et al., 2017.
Line 117) Add a citation for MSWEP v2.8.
Line 117) Add a citation for Priestley and Taylor equation.
Line 118) Add a citation for ERA5.
Line 125) Add a citation for GLOFAS.
Line 127) What do HTESSEL and LISFLOOD stand for? Also, add a citation for each of the data and models, including ERA5, if the ERA5 dataset used in this part is a different version from the previously mentioned one.
Line 128) The last sentence about LISFLOOD can be removed if this is only the input model used to generate the GLOFAS data.
Line 130) I would suggest changing the subtitle to "Impact data from the National Drought Management Authority" to make it more consistent with the other subtitles.
Line 138) Add a citation for the VCI Index and a brief description of this index (how is this index estimated?)
Line 138) Add a citation after the Food Consumption Index and briefly describe this (how is this index estimated?)
Line 139) Add a citation for the malnutrition index.
Line 140) I suggest splitting the sentence into two. For example, "This study has utilized water scarcity (WS) data from McNally et al. (2019). The data is a monthly water scarcity dataset with a spatial resolution of 0.1 degree for Africa from March 2018 to the present."
Line 143) FLDAS is driven by the satellite-based CHIRPS precipitation. I can see that the authors mention this in the Appendix, but also add a sentence about it in this section to make it consistent with the description of other datasets.
CHIRPS is a different precipitation dataset and is not included in generating the MSWEP data. How would this WS based on a different set of precipitation affect your final result? This could be briefly discussed in the Discussion section.
Line 144) Add a citation for the Falkenmark index.
Caption for Table 1) Mention here that the Falkemark index is for the water scarcity level. Something like: "Falkenmark index for the water scarcity level".
2.3. Methods
2.3.1. Drought Indices
Line 158) The values are not fit through a standard normal distribution from the beginning. The cumulative precipitation is transformed to a normalized distribution by first "fitting a parametric statistical distribution" (Stagge et al., 2015a), which in general, they are a Gamma or Pearson III distribution for SPI (2015a) and a log-logistic distribution for SPEI (Vicente-Serrano et al., 2010; Beguería et al., 2014). Read Stagge et al. (2015a) and Stagge et al (2015b) to see different types of distributions for precipitation. Which type of distribution did the authors use?
Line 159) Are the values area-weighted averaged? If yes, mention it.
Lines 160) Is the same procedure for SPI and SPEI (cumulating monthly values, fitting a specific type of parametric distribution, then normalizing the distribution) also applied to SMI and SSMI? If yes, which parametric distributions were used for these two variables?
Line 162) "incorporates temperature by including evaporation anomalies": It is the other way around: SPEI incorporates the effects of potential evapotranspiration, which depends strongly on the temperature. And for the calculation, “potential evapotranspiration” is included, not “evaporation anomalies”. See Beguería et al. (2014).
Line 166) Is GLOFAS or GloFAS? Previously, the authors used GLOFAS (Line 125). Be consistent with the acronyms.
Line 179) The authors never define what they consider “drought”. What is the threshold the authors set to define drought periods?
General comments about Section 2.3.1) As the authors already mentioned in the introduction, each drought index represents different types of droughts: SPI is usually for meteorological drought that only considers the effect of precipitation, while SPEI and SMI are agricultural or soil moisture droughts, and SSI is the hydrological drought (See Yihdego et al., 2019). This difference between the indices should be clearly explained here.
2.3.2. Drought Impact data
Line 176) A citation or a link to this website should be added if this is an official website. Also, is this impact data categorized by the authors available online?
Lines 181-183) Could the authors explain more about how this impact data was transformed into a binary value? With binary values, it means 1 if there was an impact and 0 with no impact? What do the authors mean by normal values compared to the previous years? How are these normal values defined (based on the years with no drought?), and what is the threshold to distinguish impact and no impact? (For example, when there is 30% or 50% -or whatever percentage – of crop failures compared to the normal no-drought year?)
Line 181) How do the authors define drought impacts? Bachmair et al. (2016) define a drought impact as “a negative environmental, economic, or social effect experienced under drought conditions (based on EDII).”.
Did the authors initially categorize all the impacts from NMDA, then select only drought impacts (impacts under drought conditions) for the analysis?
Line 187) What is exactly food insecurity? How is this impact factor measured, and how independent is this factor from crop losses and milk production?
Line 191) Add a brief explanation of Jaccard similarity. How does this method work?
2.3.3. Random Forest model
General comment about the section) The overall explanation of RF should be improved, and many parts of the appendix should be included in this section. For instance, partitioning the data in training and testing sets is essential information for readers to know how the RF model was built. Have a look at Bachmair et al. (2016) to see how they introduce the RF method for their work. And more technical parts related to sampling (lines 196-201) could be moved to the appendix.
About the RF model) Usually, machine learning techniques aim to deal with large datasets. How sensitive is the output of an RF model to the number of total values or the length of each variable? Is there any metric that shows how statistically significant is the fitting of your data?
I’m asking this as I can see the Kenya dataset is only for 7 years, which is 84 months in total. Based on the Appendix, 75% of the dataset is used to feed and train the model (63 months) and the rest of the months to validate the output (11 months).
How can the authors ensure that the RF is a good tool for this dataset?
Could the authors repeat their analysis by using a very simple approach, such as a sort of correlation analysis (Spearman rank or Pearson correlation analysis, similar to Bachmair et al., 2016, or a simple linear regression model.) between the impact data and drought indices? This analysis takes into account the number of datasets in the significance test (by considering the degree of freedom). This additional analysis can be compared to the output of the RF, helping to see whether the result from the RF is robust enough and whether the sample size affects the final output.
Line 201) Mention here more clearly that the regions are divided by their aridity indices or levels to assess the relationship between aridity levels and drought indices.
Also, what do the authors mean by aggregated? The three regions are averaged to be one input data in the RF?
Line 204) Add a citation for the ROC method and explain briefly how this is calculated.
3. Result
3.1. Drought Impacts
Line 225 - 234) The authors mainly discuss the impacts and droughts in Masarabit and Nyeri. But what about other regions? Why are the other three regions not included in the result? The plots for impacts and drought indices of other regions should also be included, at least in the appendix or supplement.
Line 210) "Drought impacts": The same as the comment for line 181. Are these the total impacts during 2014-2020, or only the impacts during droughts? How do the authors define a drought impact? As I can see from Fig 1., some impacts occur during the non-drought period in SPEI (for example, September 2020).
Table 2) The same comment as above. Are these "drought impacts" or the total impacts?
And I would suggest replacing "Count" with "Number of impacts".
Line 219) Replace "A timeline" with a "time series".
Figure 2 (and also for other figures)):
- Enlarge the texts and labels, please. They are challenging to read. Also, enumerate each subplot (with a. b. c. d.) to indicate the part the authors want to discuss clearly.
- The plots for impacts (the upper two panels) are complicated to read and distinguish the years. I wonder if there is a better way to visualize these plots: for example, by transforming them to line or bar plots, or somehow finding a way to make the font bigger and add a vertical line to divide the years.
- In the caption: "time series" instead of "timeline".
Line 226) The same comment as line 210: Is this the number of total impacts or drought impacts?
Lines 235-138) What is the authors' criterion to claim "a bit of relation" and "no relation"? Why is 0.63 slightly similar but 0.4 not? Also, add a significance level of the Jaccard similarity.
Table 3) How statistically significant are these correlations? One can add asterisks next to the values when they are statistically significant.
Line 230) What do the authors mean by "a direct relation"? Could you clarify it?
Line 231) What and when is the onset of drought? This question is also a continuation of the comment in line 179: To mention the onset of drought, what the authors define as drought periods should be clarified first.
Overall comment about the section) I'm quite skeptical about including the indices with 2 years time scale (24 months) in this study. A higher time scale index is appropriate to represent relatively long dry and wet conditions. Even though these indices have a monthly time scale, one monthly value of the index represents two-year hydroclimate condition (Hence, also showing a strong persistence.).
The Kenya dataset is too short (7 years) to include these indices for the analysis. If one drought occurs in one month in this index, this indicates a dry period of almost 30% (2 years) of the study period. The authors can also see this in the time series of SPEI-24 for Masarabit: more than 50-60% of the time series shows a drought condition covering a large portion of the impacts. The plot basically indicates that there is only one long drought event for the analysis, and I don't think one can derive a significant statistical association from this time series.
Another study, for instance, Bahmair et al. (2016) can include this long-time scale index, as their dataset starts in 1970.
3.2. Drought impacts and water scarcity.
Line 240) Start the paragraph mentioning that the association between WS and droughts is done from 2018 due to the length of the WS dataset.
Line 241) Remove “(start of timeframe WS dataset)”.
Figure 3) The same for all figures: Enlarge the text and labels in the plots. And I suggest putting the enumeration and the subtitles of the plots at the top and not at the bottom of the figures, as they are mixed with the captions.
Fig.3a) Suggest replacing “amount of months” with "number of months in a year"
General comment about the section) How is water scarcity associated with drought conditions? Have the authors made some comparisons between water scarcity and drought indices? Whether water scarcity always occurs during droughts or not should be analyzed and included in the section.
I am asking this as in 2020, there is no drought in Masarabit and Nyeri (Fig. 2), and many regions are under no water stress during 2018-2020 (Fig 3.b). Can it be that this no-water stress situation occurs mainly in 2020 (during no-drought condition)?
And do all the impacts during 2018-2020 occur under droughts, or are these the total impacts during 2018-2020 (including non-drought period)?
Line 246) The same comment about the "drought impact” in line 181.
3.3. Drought Indices and Random Forest
General comment about the section) For this section, the question is again, why do the authors show and discuss the results for only some regions? In addition, in Fig.4 only some specific impacts are discussed for a few regions and not all. Why is that? If some regions do not show significant connections with impacts, they should still be mentioned and included in the plot. If including everything enlarge the plots, the authors can think about putting dots in different colors for different impact categories.
Line 250) I suggest changing the title to something similar to “association between drought impacts and drought indices” or “Random Forest to link drought impacts and drought indices” instead of the name of the method the authors use.
Line 251) Mention here again that the regions are aggregated by their aridity levels or indices to make clear that the analysis in this part also considers the aridity levels of different counties.
Table 4) How is this performance measured? Clarify it in the caption.
Line 262) What do the authors mean by "best linked"? How is the threshold for the best and worst defined? Please clarify.
Line 265) The authors never mentioned what the MDA and Ginni Index are. Add a brief description of these scores and how they are estimated in the Method section. Some explanation of these scores is in the appendix, but this information should also be included in the Method.
Fig 3 and 4) Enumerate each of the plots to distinguish the Ginni and MDA plots.
Line 270) SPEI is clearly not a meteorological index. The authors also mentioned in the introduction that meteorological drought considers only a precipitation deficit.
4. Discussion
4.1. Data sources and methods
Line 290) How many monthly gaps does this report have (how many percentages of the total years)? This should be mentioned in the data section.
Line 292) What do the authors mean by iterative processes? Which kind of iterative process is applied to deal with the missing months? Please include this in the Method section.
After all, how would these missing gaps affect the result of this analysis?
Line 295) What does DIR stand for? Also, add a citation for EDII and DIR.
Line 299) What does EM-DAT stand for? Add a citation for EM-DAT and DesInventar.
Line 305- 306) I am not sure whether the AUC supports this statement. The authors are only using one method here. How can one confirm the output of the model if this is not compared to other methods? Also, how sensitive is an RF model to the number of samples? See my comment for Section 2.3.3.
4.2. Relations with aridity
Line 310) I'm not sure if the result partially supports the first sentence. All the activities included here are mainly pastoral and not other activities, right? Then how do the authors know the impacts on other industrial or economic sectors?
Line 320) What do the authors mean by the onset? The authors never defined when an actual drought starts and when is the onset of these droughts. See my comments for line 179.
Line 322) After evapotranspiration, add some citations referring to this argument (for example, Seneviratne et al., 2021).
Line 322) "However, this study cannot link drought occurrence to aridity.". This sentence is not very clear to me what the authors want to say. Aridity is a constant arid condition. Do you mean the aridity level of the regions, constant desertification due to climate warming, or just the impacts of drought frequencies related to global warming? Elaborate on the sentence, please.
Line 322) "short time period (10 years)". Why 10 years, if based on what I understood, the authors are taking the study period from 2014 to 2020? (7 years, including 2020).
Line 323) "interannual trend": Do the authors want to say "long-term" trend? The interannual trend does not make sense for a longer time scale analysis that the authors claim to be necessary.
Line 323-325) Overall, these two sentences are pretty difficult to understand what the authors want to say. How would a longer time series help to understand drought events and drying climate? Do you mean a long-term drying trend and a change in the climate? How is this connected to aridity and droughts? Please clarify this part.
4.3. Water scarcity and drought impacts
Line 329-331) “Increased distance from water sources was reported in arid and semi-arid regions during most of the months when meteorological and hydrological drought conditions occurred (Figure 3b)”: I don't see this result in Fig. 3b.
Line 346-347) I cannot see any statistical relationship between WS and droughts as the authors haven’t compared the two datasets in the Result section (see the comment for Section 3.2). In addition, for me, 2020 looks like an almost drought-free period in Masarabit and Nyeri (Fig 2.)
4.4. Drought indices and the Random Forest model
Line 358) See my comment about the drought indices with a 24-month time scale in Section 3.1. SPEI-24 is not appropriate for this analysis with a 7-year-long time series to derive a robust statistical association.
Line 361) “These differences between the best match between drought impacts and drought indices implies therefore a link with human activities”: The argument here is not very convincing for me. How are the authors’ results on the best and worst indices related to human activities and the propagation of droughts? Please elaborate on this argument better.
Line 363) Again, SPEI is not a meteorological drought. It is an agricultural or soil moisture drought, and when the time scale becomes longer, it also shows memories that are comparable with the soil moisture anomalies.
Line 367) “Noticeable is that SSI gives a possible link with water-dependent activities while SSMI shows a possible link with agricultural practices.”: Isn’t this expected as SSI represents hydrological drought and SSMI the soil moisture drought? Clarify the division between different drought indices in the Method section (See my comment for Section 2.3.1).
Line 369) Again reiterating. SPEI is clearly not a meteorological drought.
Line 371-372) Note that Dai et al. 2020 use only SPI which is a meteorological drought index. They do not include SPEI in their analysis.
Line 375) Again, for this short dataset of 7 years, I would not include an index with a time scale of 2 years for statistical analysis.
Line 378-381) Could this difference in the result between the authors’ work and Bachmair et al. (2016) in Europe be related to each country’s economic and resilience capacity to droughts? It would be worth discussing how the economic condition of each country affects drought impacts and the result of this study.
Line 386) The same comments about the indices with a 24-month time scale (line 375).
Line 387) If the authors claim that long SPI and SPEI are the hydrological droughts, the hydrological index (SSI) at various time scales must also show a significant association to those impacts associated with long SPI and SPEI. The association of SSI should be discussed in more detail here and in the Result section.
Line 396) I can imagine that the adaptation measures will increase the resilience of smallholders. However, the sentence does not clearly explain how these measures will affect the result of this study.
Line 396-398) Difficult to understand. I suggest splitting the sentence into two.
Overall comments) Also, the effects and drawbacks of short time series and the RF method should be discussed briefly in this section.
5. Conclusion
Line 406) The same as the comment for line 22. Where and when?
Line 412) Again, why is the discussion mostly about Masarabit and Nyeri?
Line 418) What do the authors mean by “unreliable water condition”? Please clarify.
Line 419) I can see that in 2020, not many drought months occurred. Again, are all the impacts counted during the drought conditions or for the entire period?
Line 426) Same comments for the drought indices with a 24-month time scale.
Line 432-433) Do the authors mean over the entire globe or in Africa? Clarify it.Line 433-434) What about the temporal resolution of the dataset?
Line 434) What do the authors mean by “other research areas”?
Technical corrections:
Line 6) include “that” between “indices” and “best”.
Line 7) Remove “:”.
Line 10) “aridity” to “aridity level of the regions”.
Line 15) "impactful" instead of "impact-full".
Line 52) Replace "setting" with "regions".
Line 98) Remove "However".
Line 103) "reanalysis" instead of "re-analysis".
Line 104) Remove "extracting".
Line 109) the complementary nature of "the" highest.
Line 115) Remove (v3.5a).
Line 122) Add e.g., before the papers.
Line 140) "has" instead of "have".
Line 141) "from March 2018 to the present." instead of "between".
Line 146) Replace "However" with "For this" or something similar. “However” here does not make sense.
Table 1) Add a horizontal line between the first row and the second row.
Line 176) Instead of "between ... and ... " use "from ... to ...".
Line 236 and 247) Replace "a bit" with "slightly".
Line 317) Add "Also" after the first sentence. "Also, the interference...".
Line 320) Replace "a kind of" with "some time" lag.
Line 322) "cannot " instead of "can not".
Line 338) Remove “;” after Kenya.
Line 340) in "the" ASAL counties.
Line 362) "imply" instead of "implies".
Line 397) Add "Our" in front of the Result.
Line 399) "the first step".
Line 400) I'd use "exploring the link" instead of "analysing".
Line 409) Remove "In addition".
Plots in general) Enlarge the fonts and dots.
References:
Bachmair, S., Svensson, C., Hannaford, J., Barker, L.J. and Stahl, K., 2016. A quantitative analysis to objectively appraise drought indicators and model drought impacts. Hydrology and Earth System Sciences, 20(7), pp.2589-2609.
Beguería, S., VicenteâSerrano, S.M., Reig, F. and Latorre, B., 2014. Standardized precipitation evapotranspiration index (SPEI) revisited: parameter fitting, evapotranspiration models, tools, datasets and drought monitoring. International journal of climatology, 34(10), pp.3001-3023.
Mishra, A.K. and Singh, V.P., 2010: A review of drought concepts. Journal of hydrology, 391(1-2), pp.202-216.
Seneviratne, S.I., X. Zhang, M. Adnan, W. Badi, C. Dereczynski, A. Di Luca, S. Ghosh, I. Iskandar, J. Kossin, S. Lewis, F. Otto, I. Pinto, M. Satoh, S.M. Vicente-Serrano, M. Wehner, and B. Zhou, 2021: Weather and Climate Extreme Events in a Changing Climate. In Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change [Masson-Delmotte, V., P. Zhai, A. Pirani, S.L. Connors, C. Péan, S. Berger, N. Caud, Y. Chen, L. Goldfarb, M.I. Gomis, M. Huang, K. Leitzell, E. Lonnoy, J.B.R. Matthews, T.K. Maycock, T. Waterfield, O. Yelekçi, R. Yu, and B. Zhou (eds.)]. Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, pp. 1513–1766, doi:10.1017/9781009157896.013.
Spinoni, J., Naumann, G., Vogt, J. and Barbosa, P., 2015: European drought climatologies and trends based on a multi-indicator approach. Global and Planetary Change, 127, pp.50-57.
Stagge, J.H., Kohn, I., Tallaksen, L.M. and Stahl, K., 2015: Modeling drought impact occurrence based on meteorological drought indices in Europe. Journal of Hydrology, 530, pp.37-50.
Stagge, J.H., Tallaksen, L.M., Gudmundsson, L., Van Loon, A.F. and Stahl, K., 2015b: Candidate distributions for climatological drought indices (SPI and SPEI). International Journal of Climatology, 35(13), pp.4027-4040.
Citation: https://doi.org/10.5194/egusphere-2022-458-RC1 -
AC1: 'Reply on RC1', Marleen Lam, 05 Dec 2022
We would like to thank the reviewer for providing a detailed and in-depth review of our manuscript. We are pleased to see the reviewer confirming that our study is one of the few studies focusing on the African continent in relation to linking drought and drought impacts, and that in principle a revised version of the manuscript could be publishable. The comments, which in general we are happy to account for, will help us to further improve the manuscript and significantly raise the quality. We will provide a detailed point-by-point response to the comments in a rebuttal.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC1 -
AC5: 'Reply on RC1', Marleen Lam, 05 Dec 2022
Thank you again for your comments and suggestions. We would like to address some of the major comments already. The reviewer suggests to exclude the drought indicators with an accumulation period of 24 months because of the length of the dataset (7 years). This remark will definitely be considered and the analysis can easily be carried out without the accumulation period of 24 months. In addition, there is suggested to compare the random forest analysis with a sort of correlation analysis between the drought impact data and the drought indices. This could be a good way to validate the model and will be considered and carried out. The reviewer addressed as well that some choices (such as highlighting some counties over others) should be explained in more detail which will be solved by adding some extra sentences. Overall, we are happy to make these adjustments and to take these remarks into consideration which will result in an improved quality of the manuscript.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC5
-
RC2: 'Comment on egusphere-2022-458', Anonymous Referee #2, 28 Nov 2022
Manuscript Number: egusphere-2022-458
Title: Linking reported drought impacts with drought indices, water scarcity, and aridity: the case of Kenya
Authors: Marleen R. Lam, Alessia Matanó, Anne F. Van Loon, Rhoda Odongo, Aklilu D. Teklesadik, Charles N. Wamucii, Marc J. C. van den Homberg, Shamton Waruru, and Adriaan J. Teuling
The work analyzes the applicability of drought indices in explaining drought impacts using Kenya as a case study. A Random Forest (RF) model was used to identify which drought indices best explains drought
impacts on pasture, livestock deaths, milk production, crop losses, food insecurity, trekking distance for water, and malnutrition.
Major Comments:
- The results presented in lines 230-235 state that the drought impacts better overlaps with the accumulation time of 12 months in comparison with the others tested (1, 3, 6 and 24). However, isn’t expected that the effects of meteorological droughts on pasture, milk production and etc. be lagged either than happening simultaneously? How is this lag effect considered? Also, the SPEI indices of all accumulation times displayed in Figure 2 show good overlap with the impacts listed.
- The study does not account for drought resilience actions that could have been applied in the region (e.g., reservoirs, irrigation, etc.) and can affect the correlation of meteorological droughts and its impacts.
- How do the current results compare with the one cited in line 301-305, using other correlation metrics?
Minor Comments:
- It should be mentioned in the abstract which drought indices were utilized in the analysis and which ones better performed in predicting the drought impacts.
- In Figure 2, what do the circles in the upper panel represent? Is it only if the impact occurred or does it reflect the number of occurrences?
Citation: https://doi.org/10.5194/egusphere-2022-458-RC2 -
AC2: 'Reply on RC2', Marleen Lam, 05 Dec 2022
Thank you very much for reviewing our manuscript, and for the assessment that a revised version should be of interest for NHESS. The remarks and suggestions are very useful, and they will help us to further improve the manuscript. In general we are happy to account for all the comments made in a revised version. We will provide a detailed point-by-point response to every comment in a rebuttal.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC2 -
AC3: 'Reply on RC2', Marleen Lam, 05 Dec 2022
Publisher’s note: this comment is a copy of AC2 and its content was therefore removed.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC3 -
AC6: 'Reply on RC2', Marleen Lam, 05 Dec 2022
Thank you again for your comments and suggestions. We would like to address some of the major comments already. The reviewer notifies that effects of meteorological drought on pasture, milk production and etc. are lagged either than happening simultaneously. We definitely agree with this comment. The lag is considered in the accumulation periods considered in this study: shorter timescales (3 or 6) are often used to monitor agricultural drought while longer accumulation periods (12 or 24) are used to monitor hydrological drought (Wang et al., 2020). Therefore, combining the drought impacts with these different timescales already includes a memory or the so-called ‘lag’ of drought impacts. The revised manuscript will elaborate a bit more on this topic to improve the clarity. As the other reviewer suggested as well, the results of the random forest model can be compared with other correlation metrices. This is a good suggestion and will be carried out. Overall, we are happy to take these remarks into consideration and make adjustments accordingly.
Wang, Y., Lv, J., Hannaford, J., Wang, Y., Sun, H., Barker, L. J., ... & Eastman, M. (2020). Linking drought indices to impacts to support drought risk assessment in Liaoning province, China. Natural Hazards and Earth System Sciences, 20(3), 889-906.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC6
Interactive discussion
Status: closed
-
CC1: 'Comment on egusphere-2022-458', Paolo Tasseron, 17 Aug 2022
This study discusses the possible linking of drought indices with text-based documents by a drought management authority in Kenya. An assessment is made how drought impacts relate with drought indices and water scarcity under various circumstances, by using a Random Forest Model. In doing so, the authors aim to contribute to ongoing debates about operational needs for drought monitoring. The importance of improving early warning systems to mitigate adverse consequences of drought is corroborated by the study and its results.
In general, I believe the authors present a very well-written manuscript, with clear and complete (sub)sections. The quantity and quality of references to relevant state-of-the-art studies is spot on. I can only suggest some minor and a few moderate revisions require attention prior to publication. These are summarised below. Well done!
Minor and moderate suggestions for revision.
Linenumber
Comment
25
Consider changing ‘society’ to ‘societies’
30-37
I think the terms meteorological and hydrological drought do not need such an elaborate explanation. If you think the readership does need this explanation, then consider adding a short definition of ‘soil moisture/agricultural drought’ as well.
83-85
Consider including the Köppen classification to describe climatological setting in Kenya, to highlight the diversity between the relatively wet southwest and dry/arid/desert north and east.
117
How was the difference in spatial resolution (0.25 degree) with the other datasets (0.1 degree) dealt with in by the authors?
145
Consider changing ‘for each person’ to ‘per capita’
148
Replace ‘have been’ by ‘was’
159-160
How are the grid cells spread over the different counties? Did you only compute the means of grid cells completely within a county border? Please specify this, also the resolution that was used for the grids.
173-174
So the indices were calculated for 1980-2020, but only used for 2014-2020. What is the added value of calculating 1980-2014, if it is not being used?
194
‘have’ = ‘has’
202-203
does this aggregation have any drawbacks?
Table 3
Could you indicate p-values in this table? i.e. p<0.05 = *, p<0.01 = **, p<0.001 = ***
243-244
It is a bit unclear what the authors mean by 2, 4, 2 months out of 10/4,0,4 months out of 12. Consider rewriting this sentence.
251
The AUC abbreviation was not previously introduced and should be written out fully.
Figure 4 & 5
Font size should be increased. In addition, the range of the X-axis is different for each figure. If relevant for comparison between the different categories (which I think is the case), consider using the same range for each figure. You might even want to plot the points of multiple categories in one figure with different colours, instead of having 14 individual subplots.
293
Please avoid using the term ‘reliable’ when talking about an objective appraisal, as this is a subjective judgement without qualitative or quantitative data to support the statement. It would improve the statement to shortly explain how the iterative processes and the focus on abnormal conditions took place, as I do not recall reading about this in the methods section. I believe this is important, because the bulletins form an integral part of your analyses.
298-300
Great to see the authors mention multiple outlets to complement NDMA bulletins!
306
‘Good’: please quantify this by for instance mentioning the average AUC value, or repeat the AUC value of the top 3 categories.
320
‘a kind of lag’ – please specify in more detail.
339
‘low population density does not imply low water stress’ – this calls for a reference.
350
‘should have suffered from water scarcity during periods of drought due to the high population density’ – this reads a bit dark. Consider rephrasing the sentence so it does not read as if you want these counties to suffer.
371-373
this was already mentioned earlier in the manuscript and does not need repetition here.
376-378
The authors mention comparisons with these studies are difficult due to different socio-economic and climatic circumstances. That’s a fair point, but how do they compare in terms of resolution? Is it practically possible to compare these quantitatively, or does this also not work?
400-401
Could you please include a reference to this existing database (or name it)?
432-433
Is there any indication of the spatial resolution required to capture the regional differences?
431-437
This last paragraph of the conclusion reads well and contains and sensible information, but I think it undermines the results of the study. It reads as if the work explored in this study is disqualified a bit, since focus is put on the need for finer resolutions to contribute to the development of early warning systems. I recommend the authors to ‘praise’ their own work a bit more in this last paragraph, instead of talking it down.
-
AC4: 'Reply on CC1', Marleen Lam, 05 Dec 2022
Thank you very much for reviewing our manuscript. We are very pleased that you recognize the scientific relevance of our research. The minor and moderate suggestions for revision will be considered and incorporated.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC4
-
AC4: 'Reply on CC1', Marleen Lam, 05 Dec 2022
-
RC1: 'Review of Lam et al. (2022): Linking reported drought impacts with drought indices, water scarcity, and aridity: the case of Kenya', Anonymous Referee #1, 25 Nov 2022
General comments:
In this study, Lam et al. explore statistical links between drought impacts, drought indices, and water scarcity in five counties in Kenya from 2014 to 2020. The authors use seven impact factors retrieved from the monthly bulletins of the National Drought Management Authority in Kenya and four drought indices at various time scales. A random forest model was used to measure statistical associations between the impacts and drought indices. Their result shows the drought indices that are best linked with each of the impact factors and indicate that these associations are also influenced by the aridity level of the regions.
This study on droughts is in the scope of the NHESS journal and is a good contribution to the field of drought impacts, mainly as it is one of the few studies focusing on the African continent. However, the method and the analysis in the manuscript are not sufficient to support some of the authors’ arguments, and the overall presentation of the manuscript (some figures and discussions) needs to be improved. I suggest some major revisions before considering a possible publication in the journal. Below find my comments.
Specific comments:
- Abstract
It would be nice to add two or three more sentences about the results (for example, which indices were included in the study, which one is the most associated with which impact factor, etc.) to give more information to the readers. As it is presented, the abstract sounds very general, and hard to see what the significant contribution of this study is. Also, if the authors’ study is the first one (or one of the few) that explores the link between drought impacts and drought indices in the African continent, that should be emphasized.
1. Introduction
Line 22) "Due to the projected increase in drought frequencies": In which regions and when will this increase become significant? Note that the increases in drought frequency and intensity are not globally equal. See Chapter 11 in the new IPCC The Physical Science Basis report (Seneviratne et al., 2021) and cite this new report instead of the 2014 version.
Line 26) What do the authors mean by "a hydro-meteorological event"? "Hydrometerological events" in general or "drought" or "extreme hydrometeorological events"?
Line 27) I am not convinced here how "Linking drought impacts to drought indices" could contribute to "the development and improvement of the M&EW .". Will it improve drought forecasting (in which way)? Help with better preparedness? Please elaborate on this sentence better.
Line 30-43) This paragraph should go after the first paragraph (from line 22), as it introduces drought indices before discussing "Linking drought impacts to drought indices.”
Line 31) In this study, as impacts on society are considered, I would also add a definition of socio-economic drought, which occurs when water as an economic good cannot meet demands due to a weather-related shortfall. For the definition, see, for example, Mishra and Sigh (2010).
Line 33-35) Add a citation for this argument.
Line 41) I suggest adding an explanation of some drought indices associated with each drought type after Yihdego et al. (2019) (at least the indices the authors use for this study). So the readers can grasp a better idea of which indices are usually used in drought studies.
Line 41) "Models using drought indices to forecast drought can detect climate signals": It is unclear what the authors mean by models (which models?) and climate signals. Please elaborate on this better.
Line 42) "Yet, the link between drought indices and socio-economic impacts has rarely been analyzed": But the authors have mentioned several papers on linking drought indices and impacts in the discussion and conclusion section (Blauhut et al., 2015; Bachmair et al., 2015, 2016, 2018, Stagge et al., 2015, Ma et al., 2020; Wang et al., 2020, among others.), right? Even in the next paragraph, the authors claimed that some works on the topic already exist and they are focused on Europe and the U.S. If these previous works had motivated the authors to perform this study, these works should be acknowledged and elaborated better in the introduction: which indices and methods were the previous studies used? What do their results say about the link between droughts and drought impacts? What is the difference between these studies and the authors' study? Also, was there any previous research analyzing the impacts of droughts over Kenya or the African continent?
Line 48) What does EDII stand for?
Line 65-67) I would remove “because of.” and split the sentence in two: “In this study, we focus on Kenya. The country is characterized by strong gradients in precipitation …”.
Line 74) I am not sure what the authors mean by “arid circumstances.”. Drought? Aridity index?
Line 75) "It is expected that drought indices will show a somewhat similar response across different climatic zones in Kenya because of the standardized nature of drought indices": For me, it is not very clear why the authors expect this. Standardizing indices standardize a water balance variable with respect to the climate of a region to facilitate the comparison across the different climates. But this does not make all regions have similar spatial and temporal variability of a drought index or respond similarly to drought drivers. Droughts can be local or regional events caused by different drivers. In addition, different surfaces, vegetation, and geographical conditions affect the spatiotemporal characteristics of droughts. Thus, even neighboring regions can present different drought conditions. For example, see the plots in Spinoni et al. (2015) which show different drought frequencies and extent (using SPI, SPEI, and RDI) in Europe.
2. Data and Methods
2.1. Study Area
Line 98) "This study will specifically focus on a region in Nyeri, namely Kieni...". This sentence is confusing. So, if I understood it correctly, inside of Nyeri, the authors will focus only on Kieni. Then, I would rephrase a sentence like, "This study specifically focuses on one region in Nyeri, namely Kieni. From now on, with Nyeri, we refer to only the Kieni region.".
If this is the case, why is only Kieni selected in the Nyeri region?
2.2. Data
Line 108) Add a citation for the MSWEP version 1.1.
Line 111) The spatial resolution of MSWEP is 0.25 degree based on Beck et al., 2017.
Line 117) Add a citation for MSWEP v2.8.
Line 117) Add a citation for Priestley and Taylor equation.
Line 118) Add a citation for ERA5.
Line 125) Add a citation for GLOFAS.
Line 127) What do HTESSEL and LISFLOOD stand for? Also, add a citation for each of the data and models, including ERA5, if the ERA5 dataset used in this part is a different version from the previously mentioned one.
Line 128) The last sentence about LISFLOOD can be removed if this is only the input model used to generate the GLOFAS data.
Line 130) I would suggest changing the subtitle to "Impact data from the National Drought Management Authority" to make it more consistent with the other subtitles.
Line 138) Add a citation for the VCI Index and a brief description of this index (how is this index estimated?)
Line 138) Add a citation after the Food Consumption Index and briefly describe this (how is this index estimated?)
Line 139) Add a citation for the malnutrition index.
Line 140) I suggest splitting the sentence into two. For example, "This study has utilized water scarcity (WS) data from McNally et al. (2019). The data is a monthly water scarcity dataset with a spatial resolution of 0.1 degree for Africa from March 2018 to the present."
Line 143) FLDAS is driven by the satellite-based CHIRPS precipitation. I can see that the authors mention this in the Appendix, but also add a sentence about it in this section to make it consistent with the description of other datasets.
CHIRPS is a different precipitation dataset and is not included in generating the MSWEP data. How would this WS based on a different set of precipitation affect your final result? This could be briefly discussed in the Discussion section.
Line 144) Add a citation for the Falkenmark index.
Caption for Table 1) Mention here that the Falkemark index is for the water scarcity level. Something like: "Falkenmark index for the water scarcity level".
2.3. Methods
2.3.1. Drought Indices
Line 158) The values are not fit through a standard normal distribution from the beginning. The cumulative precipitation is transformed to a normalized distribution by first "fitting a parametric statistical distribution" (Stagge et al., 2015a), which in general, they are a Gamma or Pearson III distribution for SPI (2015a) and a log-logistic distribution for SPEI (Vicente-Serrano et al., 2010; Beguería et al., 2014). Read Stagge et al. (2015a) and Stagge et al (2015b) to see different types of distributions for precipitation. Which type of distribution did the authors use?
Line 159) Are the values area-weighted averaged? If yes, mention it.
Lines 160) Is the same procedure for SPI and SPEI (cumulating monthly values, fitting a specific type of parametric distribution, then normalizing the distribution) also applied to SMI and SSMI? If yes, which parametric distributions were used for these two variables?
Line 162) "incorporates temperature by including evaporation anomalies": It is the other way around: SPEI incorporates the effects of potential evapotranspiration, which depends strongly on the temperature. And for the calculation, “potential evapotranspiration” is included, not “evaporation anomalies”. See Beguería et al. (2014).
Line 166) Is GLOFAS or GloFAS? Previously, the authors used GLOFAS (Line 125). Be consistent with the acronyms.
Line 179) The authors never define what they consider “drought”. What is the threshold the authors set to define drought periods?
General comments about Section 2.3.1) As the authors already mentioned in the introduction, each drought index represents different types of droughts: SPI is usually for meteorological drought that only considers the effect of precipitation, while SPEI and SMI are agricultural or soil moisture droughts, and SSI is the hydrological drought (See Yihdego et al., 2019). This difference between the indices should be clearly explained here.
2.3.2. Drought Impact data
Line 176) A citation or a link to this website should be added if this is an official website. Also, is this impact data categorized by the authors available online?
Lines 181-183) Could the authors explain more about how this impact data was transformed into a binary value? With binary values, it means 1 if there was an impact and 0 with no impact? What do the authors mean by normal values compared to the previous years? How are these normal values defined (based on the years with no drought?), and what is the threshold to distinguish impact and no impact? (For example, when there is 30% or 50% -or whatever percentage – of crop failures compared to the normal no-drought year?)
Line 181) How do the authors define drought impacts? Bachmair et al. (2016) define a drought impact as “a negative environmental, economic, or social effect experienced under drought conditions (based on EDII).”.
Did the authors initially categorize all the impacts from NMDA, then select only drought impacts (impacts under drought conditions) for the analysis?
Line 187) What is exactly food insecurity? How is this impact factor measured, and how independent is this factor from crop losses and milk production?
Line 191) Add a brief explanation of Jaccard similarity. How does this method work?
2.3.3. Random Forest model
General comment about the section) The overall explanation of RF should be improved, and many parts of the appendix should be included in this section. For instance, partitioning the data in training and testing sets is essential information for readers to know how the RF model was built. Have a look at Bachmair et al. (2016) to see how they introduce the RF method for their work. And more technical parts related to sampling (lines 196-201) could be moved to the appendix.
About the RF model) Usually, machine learning techniques aim to deal with large datasets. How sensitive is the output of an RF model to the number of total values or the length of each variable? Is there any metric that shows how statistically significant is the fitting of your data?
I’m asking this as I can see the Kenya dataset is only for 7 years, which is 84 months in total. Based on the Appendix, 75% of the dataset is used to feed and train the model (63 months) and the rest of the months to validate the output (11 months).
How can the authors ensure that the RF is a good tool for this dataset?
Could the authors repeat their analysis by using a very simple approach, such as a sort of correlation analysis (Spearman rank or Pearson correlation analysis, similar to Bachmair et al., 2016, or a simple linear regression model.) between the impact data and drought indices? This analysis takes into account the number of datasets in the significance test (by considering the degree of freedom). This additional analysis can be compared to the output of the RF, helping to see whether the result from the RF is robust enough and whether the sample size affects the final output.
Line 201) Mention here more clearly that the regions are divided by their aridity indices or levels to assess the relationship between aridity levels and drought indices.
Also, what do the authors mean by aggregated? The three regions are averaged to be one input data in the RF?
Line 204) Add a citation for the ROC method and explain briefly how this is calculated.
3. Result
3.1. Drought Impacts
Line 225 - 234) The authors mainly discuss the impacts and droughts in Masarabit and Nyeri. But what about other regions? Why are the other three regions not included in the result? The plots for impacts and drought indices of other regions should also be included, at least in the appendix or supplement.
Line 210) "Drought impacts": The same as the comment for line 181. Are these the total impacts during 2014-2020, or only the impacts during droughts? How do the authors define a drought impact? As I can see from Fig 1., some impacts occur during the non-drought period in SPEI (for example, September 2020).
Table 2) The same comment as above. Are these "drought impacts" or the total impacts?
And I would suggest replacing "Count" with "Number of impacts".
Line 219) Replace "A timeline" with a "time series".
Figure 2 (and also for other figures)):
- Enlarge the texts and labels, please. They are challenging to read. Also, enumerate each subplot (with a. b. c. d.) to indicate the part the authors want to discuss clearly.
- The plots for impacts (the upper two panels) are complicated to read and distinguish the years. I wonder if there is a better way to visualize these plots: for example, by transforming them to line or bar plots, or somehow finding a way to make the font bigger and add a vertical line to divide the years.
- In the caption: "time series" instead of "timeline".
Line 226) The same comment as line 210: Is this the number of total impacts or drought impacts?
Lines 235-138) What is the authors' criterion to claim "a bit of relation" and "no relation"? Why is 0.63 slightly similar but 0.4 not? Also, add a significance level of the Jaccard similarity.
Table 3) How statistically significant are these correlations? One can add asterisks next to the values when they are statistically significant.
Line 230) What do the authors mean by "a direct relation"? Could you clarify it?
Line 231) What and when is the onset of drought? This question is also a continuation of the comment in line 179: To mention the onset of drought, what the authors define as drought periods should be clarified first.
Overall comment about the section) I'm quite skeptical about including the indices with 2 years time scale (24 months) in this study. A higher time scale index is appropriate to represent relatively long dry and wet conditions. Even though these indices have a monthly time scale, one monthly value of the index represents two-year hydroclimate condition (Hence, also showing a strong persistence.).
The Kenya dataset is too short (7 years) to include these indices for the analysis. If one drought occurs in one month in this index, this indicates a dry period of almost 30% (2 years) of the study period. The authors can also see this in the time series of SPEI-24 for Masarabit: more than 50-60% of the time series shows a drought condition covering a large portion of the impacts. The plot basically indicates that there is only one long drought event for the analysis, and I don't think one can derive a significant statistical association from this time series.
Another study, for instance, Bahmair et al. (2016) can include this long-time scale index, as their dataset starts in 1970.
3.2. Drought impacts and water scarcity.
Line 240) Start the paragraph mentioning that the association between WS and droughts is done from 2018 due to the length of the WS dataset.
Line 241) Remove “(start of timeframe WS dataset)”.
Figure 3) The same for all figures: Enlarge the text and labels in the plots. And I suggest putting the enumeration and the subtitles of the plots at the top and not at the bottom of the figures, as they are mixed with the captions.
Fig.3a) Suggest replacing “amount of months” with "number of months in a year"
General comment about the section) How is water scarcity associated with drought conditions? Have the authors made some comparisons between water scarcity and drought indices? Whether water scarcity always occurs during droughts or not should be analyzed and included in the section.
I am asking this as in 2020, there is no drought in Masarabit and Nyeri (Fig. 2), and many regions are under no water stress during 2018-2020 (Fig 3.b). Can it be that this no-water stress situation occurs mainly in 2020 (during no-drought condition)?
And do all the impacts during 2018-2020 occur under droughts, or are these the total impacts during 2018-2020 (including non-drought period)?
Line 246) The same comment about the "drought impact” in line 181.
3.3. Drought Indices and Random Forest
General comment about the section) For this section, the question is again, why do the authors show and discuss the results for only some regions? In addition, in Fig.4 only some specific impacts are discussed for a few regions and not all. Why is that? If some regions do not show significant connections with impacts, they should still be mentioned and included in the plot. If including everything enlarge the plots, the authors can think about putting dots in different colors for different impact categories.
Line 250) I suggest changing the title to something similar to “association between drought impacts and drought indices” or “Random Forest to link drought impacts and drought indices” instead of the name of the method the authors use.
Line 251) Mention here again that the regions are aggregated by their aridity levels or indices to make clear that the analysis in this part also considers the aridity levels of different counties.
Table 4) How is this performance measured? Clarify it in the caption.
Line 262) What do the authors mean by "best linked"? How is the threshold for the best and worst defined? Please clarify.
Line 265) The authors never mentioned what the MDA and Ginni Index are. Add a brief description of these scores and how they are estimated in the Method section. Some explanation of these scores is in the appendix, but this information should also be included in the Method.
Fig 3 and 4) Enumerate each of the plots to distinguish the Ginni and MDA plots.
Line 270) SPEI is clearly not a meteorological index. The authors also mentioned in the introduction that meteorological drought considers only a precipitation deficit.
4. Discussion
4.1. Data sources and methods
Line 290) How many monthly gaps does this report have (how many percentages of the total years)? This should be mentioned in the data section.
Line 292) What do the authors mean by iterative processes? Which kind of iterative process is applied to deal with the missing months? Please include this in the Method section.
After all, how would these missing gaps affect the result of this analysis?
Line 295) What does DIR stand for? Also, add a citation for EDII and DIR.
Line 299) What does EM-DAT stand for? Add a citation for EM-DAT and DesInventar.
Line 305- 306) I am not sure whether the AUC supports this statement. The authors are only using one method here. How can one confirm the output of the model if this is not compared to other methods? Also, how sensitive is an RF model to the number of samples? See my comment for Section 2.3.3.
4.2. Relations with aridity
Line 310) I'm not sure if the result partially supports the first sentence. All the activities included here are mainly pastoral and not other activities, right? Then how do the authors know the impacts on other industrial or economic sectors?
Line 320) What do the authors mean by the onset? The authors never defined when an actual drought starts and when is the onset of these droughts. See my comments for line 179.
Line 322) After evapotranspiration, add some citations referring to this argument (for example, Seneviratne et al., 2021).
Line 322) "However, this study cannot link drought occurrence to aridity.". This sentence is not very clear to me what the authors want to say. Aridity is a constant arid condition. Do you mean the aridity level of the regions, constant desertification due to climate warming, or just the impacts of drought frequencies related to global warming? Elaborate on the sentence, please.
Line 322) "short time period (10 years)". Why 10 years, if based on what I understood, the authors are taking the study period from 2014 to 2020? (7 years, including 2020).
Line 323) "interannual trend": Do the authors want to say "long-term" trend? The interannual trend does not make sense for a longer time scale analysis that the authors claim to be necessary.
Line 323-325) Overall, these two sentences are pretty difficult to understand what the authors want to say. How would a longer time series help to understand drought events and drying climate? Do you mean a long-term drying trend and a change in the climate? How is this connected to aridity and droughts? Please clarify this part.
4.3. Water scarcity and drought impacts
Line 329-331) “Increased distance from water sources was reported in arid and semi-arid regions during most of the months when meteorological and hydrological drought conditions occurred (Figure 3b)”: I don't see this result in Fig. 3b.
Line 346-347) I cannot see any statistical relationship between WS and droughts as the authors haven’t compared the two datasets in the Result section (see the comment for Section 3.2). In addition, for me, 2020 looks like an almost drought-free period in Masarabit and Nyeri (Fig 2.)
4.4. Drought indices and the Random Forest model
Line 358) See my comment about the drought indices with a 24-month time scale in Section 3.1. SPEI-24 is not appropriate for this analysis with a 7-year-long time series to derive a robust statistical association.
Line 361) “These differences between the best match between drought impacts and drought indices implies therefore a link with human activities”: The argument here is not very convincing for me. How are the authors’ results on the best and worst indices related to human activities and the propagation of droughts? Please elaborate on this argument better.
Line 363) Again, SPEI is not a meteorological drought. It is an agricultural or soil moisture drought, and when the time scale becomes longer, it also shows memories that are comparable with the soil moisture anomalies.
Line 367) “Noticeable is that SSI gives a possible link with water-dependent activities while SSMI shows a possible link with agricultural practices.”: Isn’t this expected as SSI represents hydrological drought and SSMI the soil moisture drought? Clarify the division between different drought indices in the Method section (See my comment for Section 2.3.1).
Line 369) Again reiterating. SPEI is clearly not a meteorological drought.
Line 371-372) Note that Dai et al. 2020 use only SPI which is a meteorological drought index. They do not include SPEI in their analysis.
Line 375) Again, for this short dataset of 7 years, I would not include an index with a time scale of 2 years for statistical analysis.
Line 378-381) Could this difference in the result between the authors’ work and Bachmair et al. (2016) in Europe be related to each country’s economic and resilience capacity to droughts? It would be worth discussing how the economic condition of each country affects drought impacts and the result of this study.
Line 386) The same comments about the indices with a 24-month time scale (line 375).
Line 387) If the authors claim that long SPI and SPEI are the hydrological droughts, the hydrological index (SSI) at various time scales must also show a significant association to those impacts associated with long SPI and SPEI. The association of SSI should be discussed in more detail here and in the Result section.
Line 396) I can imagine that the adaptation measures will increase the resilience of smallholders. However, the sentence does not clearly explain how these measures will affect the result of this study.
Line 396-398) Difficult to understand. I suggest splitting the sentence into two.
Overall comments) Also, the effects and drawbacks of short time series and the RF method should be discussed briefly in this section.
5. Conclusion
Line 406) The same as the comment for line 22. Where and when?
Line 412) Again, why is the discussion mostly about Masarabit and Nyeri?
Line 418) What do the authors mean by “unreliable water condition”? Please clarify.
Line 419) I can see that in 2020, not many drought months occurred. Again, are all the impacts counted during the drought conditions or for the entire period?
Line 426) Same comments for the drought indices with a 24-month time scale.
Line 432-433) Do the authors mean over the entire globe or in Africa? Clarify it.Line 433-434) What about the temporal resolution of the dataset?
Line 434) What do the authors mean by “other research areas”?
Technical corrections:
Line 6) include “that” between “indices” and “best”.
Line 7) Remove “:”.
Line 10) “aridity” to “aridity level of the regions”.
Line 15) "impactful" instead of "impact-full".
Line 52) Replace "setting" with "regions".
Line 98) Remove "However".
Line 103) "reanalysis" instead of "re-analysis".
Line 104) Remove "extracting".
Line 109) the complementary nature of "the" highest.
Line 115) Remove (v3.5a).
Line 122) Add e.g., before the papers.
Line 140) "has" instead of "have".
Line 141) "from March 2018 to the present." instead of "between".
Line 146) Replace "However" with "For this" or something similar. “However” here does not make sense.
Table 1) Add a horizontal line between the first row and the second row.
Line 176) Instead of "between ... and ... " use "from ... to ...".
Line 236 and 247) Replace "a bit" with "slightly".
Line 317) Add "Also" after the first sentence. "Also, the interference...".
Line 320) Replace "a kind of" with "some time" lag.
Line 322) "cannot " instead of "can not".
Line 338) Remove “;” after Kenya.
Line 340) in "the" ASAL counties.
Line 362) "imply" instead of "implies".
Line 397) Add "Our" in front of the Result.
Line 399) "the first step".
Line 400) I'd use "exploring the link" instead of "analysing".
Line 409) Remove "In addition".
Plots in general) Enlarge the fonts and dots.
References:
Bachmair, S., Svensson, C., Hannaford, J., Barker, L.J. and Stahl, K., 2016. A quantitative analysis to objectively appraise drought indicators and model drought impacts. Hydrology and Earth System Sciences, 20(7), pp.2589-2609.
Beguería, S., VicenteâSerrano, S.M., Reig, F. and Latorre, B., 2014. Standardized precipitation evapotranspiration index (SPEI) revisited: parameter fitting, evapotranspiration models, tools, datasets and drought monitoring. International journal of climatology, 34(10), pp.3001-3023.
Mishra, A.K. and Singh, V.P., 2010: A review of drought concepts. Journal of hydrology, 391(1-2), pp.202-216.
Seneviratne, S.I., X. Zhang, M. Adnan, W. Badi, C. Dereczynski, A. Di Luca, S. Ghosh, I. Iskandar, J. Kossin, S. Lewis, F. Otto, I. Pinto, M. Satoh, S.M. Vicente-Serrano, M. Wehner, and B. Zhou, 2021: Weather and Climate Extreme Events in a Changing Climate. In Climate Change 2021: The Physical Science Basis. Contribution of Working Group I to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change [Masson-Delmotte, V., P. Zhai, A. Pirani, S.L. Connors, C. Péan, S. Berger, N. Caud, Y. Chen, L. Goldfarb, M.I. Gomis, M. Huang, K. Leitzell, E. Lonnoy, J.B.R. Matthews, T.K. Maycock, T. Waterfield, O. Yelekçi, R. Yu, and B. Zhou (eds.)]. Cambridge University Press, Cambridge, United Kingdom and New York, NY, USA, pp. 1513–1766, doi:10.1017/9781009157896.013.
Spinoni, J., Naumann, G., Vogt, J. and Barbosa, P., 2015: European drought climatologies and trends based on a multi-indicator approach. Global and Planetary Change, 127, pp.50-57.
Stagge, J.H., Kohn, I., Tallaksen, L.M. and Stahl, K., 2015: Modeling drought impact occurrence based on meteorological drought indices in Europe. Journal of Hydrology, 530, pp.37-50.
Stagge, J.H., Tallaksen, L.M., Gudmundsson, L., Van Loon, A.F. and Stahl, K., 2015b: Candidate distributions for climatological drought indices (SPI and SPEI). International Journal of Climatology, 35(13), pp.4027-4040.
Citation: https://doi.org/10.5194/egusphere-2022-458-RC1 -
AC1: 'Reply on RC1', Marleen Lam, 05 Dec 2022
We would like to thank the reviewer for providing a detailed and in-depth review of our manuscript. We are pleased to see the reviewer confirming that our study is one of the few studies focusing on the African continent in relation to linking drought and drought impacts, and that in principle a revised version of the manuscript could be publishable. The comments, which in general we are happy to account for, will help us to further improve the manuscript and significantly raise the quality. We will provide a detailed point-by-point response to the comments in a rebuttal.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC1 -
AC5: 'Reply on RC1', Marleen Lam, 05 Dec 2022
Thank you again for your comments and suggestions. We would like to address some of the major comments already. The reviewer suggests to exclude the drought indicators with an accumulation period of 24 months because of the length of the dataset (7 years). This remark will definitely be considered and the analysis can easily be carried out without the accumulation period of 24 months. In addition, there is suggested to compare the random forest analysis with a sort of correlation analysis between the drought impact data and the drought indices. This could be a good way to validate the model and will be considered and carried out. The reviewer addressed as well that some choices (such as highlighting some counties over others) should be explained in more detail which will be solved by adding some extra sentences. Overall, we are happy to make these adjustments and to take these remarks into consideration which will result in an improved quality of the manuscript.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC5
-
RC2: 'Comment on egusphere-2022-458', Anonymous Referee #2, 28 Nov 2022
Manuscript Number: egusphere-2022-458
Title: Linking reported drought impacts with drought indices, water scarcity, and aridity: the case of Kenya
Authors: Marleen R. Lam, Alessia Matanó, Anne F. Van Loon, Rhoda Odongo, Aklilu D. Teklesadik, Charles N. Wamucii, Marc J. C. van den Homberg, Shamton Waruru, and Adriaan J. Teuling
The work analyzes the applicability of drought indices in explaining drought impacts using Kenya as a case study. A Random Forest (RF) model was used to identify which drought indices best explains drought
impacts on pasture, livestock deaths, milk production, crop losses, food insecurity, trekking distance for water, and malnutrition.
Major Comments:
- The results presented in lines 230-235 state that the drought impacts better overlaps with the accumulation time of 12 months in comparison with the others tested (1, 3, 6 and 24). However, isn’t expected that the effects of meteorological droughts on pasture, milk production and etc. be lagged either than happening simultaneously? How is this lag effect considered? Also, the SPEI indices of all accumulation times displayed in Figure 2 show good overlap with the impacts listed.
- The study does not account for drought resilience actions that could have been applied in the region (e.g., reservoirs, irrigation, etc.) and can affect the correlation of meteorological droughts and its impacts.
- How do the current results compare with the one cited in line 301-305, using other correlation metrics?
Minor Comments:
- It should be mentioned in the abstract which drought indices were utilized in the analysis and which ones better performed in predicting the drought impacts.
- In Figure 2, what do the circles in the upper panel represent? Is it only if the impact occurred or does it reflect the number of occurrences?
Citation: https://doi.org/10.5194/egusphere-2022-458-RC2 -
AC2: 'Reply on RC2', Marleen Lam, 05 Dec 2022
Thank you very much for reviewing our manuscript, and for the assessment that a revised version should be of interest for NHESS. The remarks and suggestions are very useful, and they will help us to further improve the manuscript. In general we are happy to account for all the comments made in a revised version. We will provide a detailed point-by-point response to every comment in a rebuttal.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC2 -
AC3: 'Reply on RC2', Marleen Lam, 05 Dec 2022
Publisher’s note: this comment is a copy of AC2 and its content was therefore removed.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC3 -
AC6: 'Reply on RC2', Marleen Lam, 05 Dec 2022
Thank you again for your comments and suggestions. We would like to address some of the major comments already. The reviewer notifies that effects of meteorological drought on pasture, milk production and etc. are lagged either than happening simultaneously. We definitely agree with this comment. The lag is considered in the accumulation periods considered in this study: shorter timescales (3 or 6) are often used to monitor agricultural drought while longer accumulation periods (12 or 24) are used to monitor hydrological drought (Wang et al., 2020). Therefore, combining the drought impacts with these different timescales already includes a memory or the so-called ‘lag’ of drought impacts. The revised manuscript will elaborate a bit more on this topic to improve the clarity. As the other reviewer suggested as well, the results of the random forest model can be compared with other correlation metrices. This is a good suggestion and will be carried out. Overall, we are happy to take these remarks into consideration and make adjustments accordingly.
Wang, Y., Lv, J., Hannaford, J., Wang, Y., Sun, H., Barker, L. J., ... & Eastman, M. (2020). Linking drought indices to impacts to support drought risk assessment in Liaoning province, China. Natural Hazards and Earth System Sciences, 20(3), 889-906.
Citation: https://doi.org/10.5194/egusphere-2022-458-AC6
Peer review completion
Journal article(s) based on this preprint
Viewed
HTML | XML | Total | BibTeX | EndNote | |
---|---|---|---|---|---|
1,039 | 330 | 44 | 1,413 | 20 | 20 |
- HTML: 1,039
- PDF: 330
- XML: 44
- Total: 1,413
- BibTeX: 20
- EndNote: 20
Viewed (geographical distribution)
Country | # | Views | % |
---|
Total: | 0 |
HTML: | 0 |
PDF: | 0 |
XML: | 0 |
- 1
Cited
3 citations as recorded by crossref.
- Advances and gaps in the science and practice of impact‐based forecasting of droughts A. Shyrokaya et al. 10.1002/wat2.1698
- Linking reported drought impacts with drought indices, water scarcity and aridity: the case of Kenya M. Lam et al. 10.5194/nhess-23-2915-2023
- Significant relationships between drought indicators and impacts for the 2018–2019 drought in Germany A. Shyrokaya et al. 10.1088/1748-9326/ad10d9
Alessia Matanó
Anne F. Van Loon
Rhoda Odongo
Aklilu D. Teklesadik
Charles N. Wamucii
Marc J. C. van den Homberg
Shamton Waruru
The requested preprint has a corresponding peer-reviewed final revised paper. You are encouraged to refer to the final revised version.
- Preprint
(2083 KB) - Metadata XML