Voice Analysis in Dogs With Deep Learning: Development of a Fully Automatic Voice Analysis System for Bioacoustics Studies

dc.contributor.author Karaaslan, Mahmut
dc.contributor.author Turkoglu, Bahaeddin
dc.contributor.author Kaya, Ersin
dc.contributor.author Asuroglu, Tunc
dc.date.accessioned 2025-01-10T20:54:44Z
dc.date.available 2025-01-10T20:54:44Z
dc.date.issued 2024
dc.description , Mahmut KARAASLAN/0009-0002-9386-5806; KAYA, Ersin/0000-0001-5668-5078; Asuroglu, Tunc/0000-0003-4153-0764; Turkoglu, Bahaeddin/0000-0003-0255-8422 en_US
dc.description.abstract Extracting behavioral information from animal sounds has long been a focus of research in bioacoustics, as sound-derived data are crucial for understanding animal behavior and environmental interactions. Traditional methods, which involve manual review of extensive recordings, pose significant challenges. This study proposes an automated system for detecting and classifying animal vocalizations, enhancing efficiency in behavior analysis. The system uses a preprocessing step to segment relevant sound regions from audio recordings, followed by feature extraction using Short-Time Fourier Transform (STFT), Mel-frequency cepstral coefficients (MFCCs), and linear-frequency cepstral coefficients (LFCCs). These features are input into convolutional neural network (CNN) classifiers to evaluate performance. Experimental results demonstrate the effectiveness of different CNN models and feature extraction methods, with AlexNet, DenseNet, EfficientNet, ResNet50, and ResNet152 being evaluated. The system achieves high accuracy in classifying vocal behaviors, such as barking and howling in dogs, providing a robust tool for behavioral analysis. The study highlights the importance of automated systems in bioacoustics research and suggests future improvements using deep learning-based methods for enhanced classification performance. en_US
dc.identifier.doi 10.3390/s24247978
dc.identifier.issn 1424-8220
dc.identifier.scopus 2-s2.0-85213281431
dc.identifier.uri https://doi.org/10.3390/s24247978
dc.language.iso en en_US
dc.publisher Mdpi en_US
dc.relation.ispartof Sensors
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject automatic behavior analysis en_US
dc.subject bioacoustics en_US
dc.subject CNN en_US
dc.subject audio processing en_US
dc.title Voice Analysis in Dogs With Deep Learning: Development of a Fully Automatic Voice Analysis System for Bioacoustics Studies en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id , Mahmut KARAASLAN/0009-0002-9386-5806
gdc.author.id KAYA, Ersin/0000-0001-5668-5078
gdc.author.id Asuroglu, Tunc/0000-0003-4153-0764
gdc.author.id Turkoglu, Bahaeddin/0000-0003-0255-8422
gdc.author.wosid turkoglu, bahaeddin/AFM-7521-2022
gdc.author.wosid Asuroglu, Tunc/ITV-2441-2023
gdc.author.wosid KAYA, Ersin/V-7558-2019
gdc.bip.impulseclass C4
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.description.department KTÜN en_US
gdc.description.departmenttemp [Karaaslan, Mahmut; Kaya, Ersin] Konya Tech Univ, Dept Comp Engn, TR-42250 Konya, Turkiye; [Turkoglu, Bahaeddin] Ankara Univ, Dept Artificial Intelligence & Data Engn, TR-06830 Ankara, Turkiye; [Asuroglu, Tunc] Tampere Univ, Fac Med & Hlth Technol, Tampere 33720, Finland; [Asuroglu, Tunc] VTT Tech Res Ctr Finland, Tampere 33101, Finland en_US
gdc.description.issue 24 en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.startpage 7978
gdc.description.volume 24 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q2
gdc.identifier.openalex W4405367021
gdc.identifier.pmid 39771714
gdc.identifier.wos WOS:001386615600001
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.impulse 7.0
gdc.oaire.influence 3.0057026E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Fourier Analysis
gdc.oaire.keywords Chemical technology
gdc.oaire.keywords 610
gdc.oaire.keywords 600
gdc.oaire.keywords TP1-1185
gdc.oaire.keywords Acoustics
gdc.oaire.keywords audio processing
gdc.oaire.keywords 3111
gdc.oaire.keywords Article
gdc.oaire.keywords bioacoustics
gdc.oaire.keywords Deep Learning
gdc.oaire.keywords Dogs
gdc.oaire.keywords Voice
gdc.oaire.keywords Animals
gdc.oaire.keywords Neural Networks, Computer
gdc.oaire.keywords Vocalization, Animal
gdc.oaire.keywords automatic behavior analysis
gdc.oaire.keywords CNN
gdc.oaire.popularity 7.751352E-9
gdc.oaire.publicfunded false
gdc.oaire.sciencefields 02 engineering and technology
gdc.oaire.sciencefields 01 natural sciences
gdc.oaire.sciencefields 0103 physical sciences
gdc.oaire.sciencefields 0202 electrical engineering, electronic engineering, information engineering
gdc.openalex.collaboration International
gdc.openalex.fwci 5.57130396
gdc.openalex.normalizedpercentile 0.92
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 0
gdc.plumx.mendeley 8
gdc.plumx.newscount 1
gdc.plumx.pubmedcites 2
gdc.plumx.scopuscites 6
gdc.scopus.citedcount 5
gdc.virtual.author Karaaslan, Mahmut
gdc.virtual.author Kaya, Ersin
gdc.wos.citedcount 5
relation.isAuthorOfPublication db184ac3-adb0-4f98-b2c5-11342d4b0ec0
relation.isAuthorOfPublication 6b459b99-eed9-45fb-b42f-50fbb4ee7090
relation.isAuthorOfPublication.latestForDiscovery db184ac3-adb0-4f98-b2c5-11342d4b0ec0

Files