Addressing Class Imbalance in Soil Movement Predictions

Kumar, Praveen; Priyanka, Priyanka; Uday, Kala Venkata; Dutt, Varun

doi:https://doi.org/10.5194/egusphere-2023-1417

Praveen Kumar, Priyanka Priyanka, Kala Venkata Uday, and Varun Dutt

Abstract. Landslides threaten human life and infrastructure, resulting in fatalities and economic losses. Monitoring stations provide valuable data for predicting soil movement, which is crucial in mitigating this threat. Accurately predicting soil movement from monitoring data is challenging due to its complexity and inherent class imbalance. This study proposes developing machine learning (ML) models with oversampling techniques to address the class imbalance issue and develop a robust soil movement prediction system. The dataset, comprising two years (2019–2021) of monitoring data from a landslide in Uttarakhand, was split into a 70:30 ratio for training and testing. To tackle the class imbalance problem, various oversampling techniques, including Synthetic Minority Oversampling Technique (SMOTE), K-Means SMOTE, Borderline SMOTE, Support Vector Machine SMOTE, and Adaptive SMOTE (ADASYN), were applied to the dataset. Several ML models, namely Random Forest (RF), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (Light GBM), Adaptive Boosting (AdaBoost), Category Boosting (CatBoost), Long Short-Term Memory (LSTM), Multilayer Perceptron (MLP), and dynamic ensemble models, were trained and compared for soil movement prediction. Among these models, the dynamic ensemble model with K-Means SMOTE performed the best in testing, with an accuracy, precision, and recall rate of 99.68 % each and an F1-score of 0.9968. The RF model with K-Means SMOTE stood out as the second-best performer, achieving an impressive accuracy, precision, and recall rate of 99.64 % each and an F1-score of 0.9964. These results show that ML models with class imbalance techniques have the potential to significantly improve soil movement predictions in landslide-prone areas.

Received: 27 Jun 2023 – Discussion started: 10 Aug 2023

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this preprint. The responsibility to include appropriate place names lies with the authors.

Download & links

Preprint (PDF, 627 KB)

Download & links

Journal article(s) based on this preprint

06 Jun 2024

Addressing class imbalance in soil movement predictions

Praveen Kumar, Priyanka Priyanka, Kala Venkata Uday, and Varun Dutt

Nat. Hazards Earth Syst. Sci., 24, 1913–1928, https://doi.org/10.5194/nhess-24-1913-2024,https://doi.org/10.5194/nhess-24-1913-2024, 2024

Short summary

Country	#	Views	%
United States of America	1	144	34
India	2	61	14
Germany	3	44	10
China	4	25	5
Netherlands	5	20	4


Total:	0
HTML:	0
PDF:	0
XML:	0

Addressing Class Imbalance in Soil Movement Predictions

Journal article(s) based on this preprint

Interactive discussion

Interactive discussion

Peer review completion

Journal article(s) based on this preprint

Viewed

Viewed (geographical distribution)