作成者 |
|
|
|
|
|
|
本文言語 |
|
出版者 |
|
発行日 |
|
収録物名 |
|
巻 |
|
開始ページ |
|
終了ページ |
|
会議情報 |
|
出版タイプ |
|
アクセス権 |
|
Crossref DOI |
|
権利関係 |
|
権利関係 |
|
概要 |
Imbalanced data is common and presents significant challenge towards classification of data. In this research, we present a combination of two techniques used for handling class imbalance in datasets,... SMOTE (Synthetic Minority Over-sampling Technique) and Tomek Links. Each strategy handles the class imbalance problem in a unique way, and their combination attempts to create a more balanced and cleaner dataset for training machine learning models to handle binary classification by addressing problematic or difficult-to-classify data. Machine learning classifiers used in this study are K-Nearest Neighbour (KNN), Support Vector Machine (SVM), Logistic Regression, Decision Tree (DT), Random Forest (RF), Gradient Boosting, Extreme Gradient Boosting (XGBoost), Light Gradient Boosting (LGBM), AdaBoost and Catboost. It has been discovered that the mean F1 score for resampled datasets provides more trustworthy results for forecasting floods.続きを見る
|