Boosting the Oversampling Methods Based on Differential Evolution Strategies for Imbalanced Learning

No Thumbnail Available

Date

2021

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Elsevier

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Top 10%
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

The class imbalance problem is a challenging problem in the data mining area. To overcome the low classification performance related to imbalanced datasets, sampling strategies are used for balancing the datasets. Oversampling is a technique that increases the minority class samples in various proportions. In this work, these 16 different DE strategies are used for oversampling the imbalanced datasets for better classification. The main aim of this work is to determine the best strategy in terms of Area Under the receiver operating characteristic (ROC) Curve (AUC) and Geometric Mean (G-Mean) metrics. 44 imbalanced datasets are used in experiments. Support Vector Machines (SVM), k-Nearest Neighbor (kNN), and Decision Tree (DT) are used as a classifier in the experiments. The best results are produced by 6th Debohid Strategy (DSt6), 1th Debohid Strategy (DSt1), and 3th Debohid Strategy (DSt3) by using kNN, DT, and SVM classifiers, respectively. The obtained results outperform the 9 state-of-the-art oversampling methods in terms of AUC and G-Mean metrics (C) 2021 Elsevier B.V. All rights reserved.

Description

Keywords

Imbalanced Datasets, Differential Evolution, Oversampling, Imbalanced Learning, Class Imbalance, Differential Evolution Strategies, Preprocessing Method, Global Optimization, Software Tool, Smote, Classification, Algorithms, Keel

Turkish CoHE Thesis Center URL

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
18

Source

Applied Soft Computing

Volume

112

Issue

Start Page

107787

End Page

PlumX Metrics
Citations

CrossRef : 23

Scopus : 25

Captures

Mendeley Readers : 25

SCOPUS™ Citations

24

checked on Feb 03, 2026

Web of Science™ Citations

21

checked on Feb 03, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
2.39876078

Sustainable Development Goals

SDG data is not available