Debohid: a Differential Evolution Based Oversampling Approach for Highly Imbalanced Datasets

No Thumbnail Available

Date

2021

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

PERGAMON-ELSEVIER SCIENCE LTD

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Top 10%
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

Class distribution of the samples in the dataset is one of the critical factors affecting the classification success. Classifiers trained with imbalanced datasets classify majority class samples more successfully than minority class samples. Oversampling, which is based on increasing the minority class samples, is a frequently used method to overcome the class imbalance. More than two decades, many oversampling methods are presented for the class imbalance problem. Differential Evolution is a metaheuristic algorithm that achieves successful results in a lot of domains. One of the main reasons for this success is that DE has an effective candidate individual generation mechanism. In this work, we propose a novel oversampling method based on a differential evolution algorithm for highly imbalanced datasets, and it is named as DEBOHID (A differential evolution based oversampling approach for highly imbalanced datasets). In order to show the success of DEBOHID, 44 highly imbalanced ratio datasets are used in experiments. The obtained results are compared with nine different state-of-art oversampling methods. In order to show the independence of the experimental results to classifier, Support Vector Machines (SVM), k-Nearest Neighbor (kNN), and Decision Tree (DT) are used as a classifier in the experiments. AUC and G Mean metrics are used for the performance measurements. The experimental results and statistical analyses have shown the triumph of the DEBOHID.

Description

Keywords

Imbalanced data learning, Differential evolution, Oversampling, Class imbalance

Turkish CoHE Thesis Center URL

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
29

Source

EXPERT SYSTEMS WITH APPLICATIONS

Volume

169

Issue

Start Page

114482

End Page

PlumX Metrics
Citations

CrossRef : 37

Scopus : 42

Captures

Mendeley Readers : 39

SCOPUS™ Citations

41

checked on Feb 03, 2026

Web of Science™ Citations

36

checked on Feb 03, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
3.67148871

Sustainable Development Goals

SDG data is not available