The Effect of Balancing Process on Classifying Unbalancing Data Set

dc.contributor.author Jassim, Samara
dc.contributor.author Kaya, Ersin
dc.date.accessioned 2024-10-03T13:11:21Z
dc.date.available 2024-10-03T13:11:21Z
dc.date.issued 2018
dc.description.abstract Unbalanced data indicates a situation where the number of monitoring is not the same for all categories in the label data set. In some fields, unbalanced data problems are very common. Some of machine learning classifiers failed to deal with unbalanced training data sets because they are sensitive to the proportions of different classes. As a result, these algorithms tend to favor the class with the largest proportion of observations known as the majority class, which may lead to misleading accuracy. Most of data sets are unbalanced because most of the data collected over the diseases are usually not disease. These data when used in the classification algorithm it gave un-well results, the data sets used in the training process must be balanced to increase this success. In this article, (SMOTE) synthetic minority over-sampling technique is used on data sets. K-Nearest Neighbors (K-NN), and Naïve Bayes (NB) classification algorithms are applied to classify the balanced datasets and according to the obtained classification results the balanced data sets achieved a better classification success. en_US
dc.identifier.isbn 978-605-68537-3-9 en_US
dc.identifier.uri https://hdl.handle.net/20.500.13091/6317
dc.language.iso en en_US
dc.relation International Conference on Engineering Technologies en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Unbalanced Data en_US
dc.subject SMOTE en_US
dc.subject K-NN en_US
dc.subject NB en_US
dc.subject Classification en_US
dc.title The Effect of Balancing Process on Classifying Unbalancing Data Set en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.id 0000-0001-5668-5078
gdc.author.institutional Kaya, Ersin
gdc.coar.access open access
gdc.coar.type text::conference output
gdc.contributor.affiliation #PLACEHOLDER_PARENT_METADATA_VALUE# en_US
gdc.contributor.affiliation #PLACEHOLDER_PARENT_METADATA_VALUE# en_US
gdc.description.department Fakülteler, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümü en_US
gdc.description.endpage 124 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 121 en_US
gdc.description.wosquality N/A
gdc.virtual.author Kaya, Ersin
relation.isAuthorOfPublication 6b459b99-eed9-45fb-b42f-50fbb4ee7090
relation.isAuthorOfPublication.latestForDiscovery 6b459b99-eed9-45fb-b42f-50fbb4ee7090

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
ICENTE18_proceedingsbookv42.pdf
Size:
2.63 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
3.2 KB
Format:
Item-specific license agreed upon to submission
Description: