Enhancing Classification Accuracy Through Feature Extraction: a Comparative Study of Discretization and Clustering Approaches on Sensor-Based Datasets

dc.contributor.author Esme, Engin
dc.date.accessioned 2023-10-02T11:16:11Z
dc.date.available 2023-10-02T11:16:11Z
dc.date.issued 2023
dc.description Article; Early Access en_US
dc.description.abstract Accuracy in a classification problem is directly related to the ability of features to adequately represent the differences between classes. In sensor-based datasets, measurements taken from the sensor form feature vectors. Measuring a given physical signal with different sensors enables it to be expressed with various feature vectors. For this reason, using sensor fusion is preferred in data acquisition. However, each new sensor added to the system brings problems such as complex sensory and supply circuit structures, extra energy consumption, signal sampling complexity, and time-consumption. On the other hand, in cases where sensor fusion cannot be applied, the ability of data from one sensor to represent classes may be insufficient. To avoid these problems, discretization and clustering approaches are suitable to derive more features from fewer sensors. The aim is to improve the accuracy of classifiers by deriving new feature vectors that can represent sensor data. This research reveals the contributions of clustering and discretization approaches as feature extraction methods to improve classification accuracy. In this study, three widely used machine learning techniques are investigated on Perfume, Wine, Seeds, and Gas datasets from the UCI repository. This comprehensive empirical study indicates that the accuracy of classifiers improves by up to 20% on datasets obtained from some sensors by using both discretization and clustering as feature-extracting methods. en_US
dc.identifier.doi 10.1007/s10115-023-01960-0
dc.identifier.issn 0219-1377
dc.identifier.issn 0219-3116
dc.identifier.scopus 2-s2.0-85169118901
dc.identifier.uri https://doi.org/10.1007/s10115-023-01960-0
dc.identifier.uri https://hdl.handle.net/20.500.13091/4610
dc.language.iso en en_US
dc.publisher Springer London Ltd en_US
dc.relation.ispartof Knowledge and Information Systems en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Hybrid classifier en_US
dc.subject Machine learning en_US
dc.subject Discretization en_US
dc.subject Clustering en_US
dc.title Enhancing Classification Accuracy Through Feature Extraction: a Comparative Study of Discretization and Clustering Approaches on Sensor-Based Datasets en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.institutional
gdc.author.scopusid 57189468408
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access metadata only access
gdc.coar.type text::journal::journal article
gdc.description.department KTÜN en_US
gdc.description.departmenttemp [Esme, Engin] Konya Tech Univ, Dept Software Engn, Konya, Turkiye en_US
gdc.description.endpage 356
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q2
gdc.description.startpage 339
gdc.description.volume 66
gdc.description.wosquality Q2
gdc.identifier.openalex W4386203228
gdc.identifier.wos WOS:001060370300001
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 1.0
gdc.oaire.influence 2.5208853E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 2.8183462E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration National
gdc.openalex.fwci 0.15887805
gdc.openalex.normalizedpercentile 0.43
gdc.opencitations.count 1
gdc.plumx.mendeley 3
gdc.plumx.scopuscites 1
gdc.scopus.citedcount 1
gdc.virtual.author Eşme, Engin
gdc.wos.citedcount 1
relation.isAuthorOfPublication c5d1c3d6-4f52-4ee3-bf9a-581fe29780e6
relation.isAuthorOfPublication.latestForDiscovery c5d1c3d6-4f52-4ee3-bf9a-581fe29780e6

Files