Voice Analysis in Dogs With Deep Learning: Development of a Fully Automatic Voice Analysis System for Bioacoustics Studies

No Thumbnail Available

Date

2024

Authors

Karaaslan, Mahmut
Kaya, Ersin

Journal Title

Journal ISSN

Volume Title

Publisher

Mdpi

Open Access Color

GOLD

Green Open Access

Yes

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Average
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

Extracting behavioral information from animal sounds has long been a focus of research in bioacoustics, as sound-derived data are crucial for understanding animal behavior and environmental interactions. Traditional methods, which involve manual review of extensive recordings, pose significant challenges. This study proposes an automated system for detecting and classifying animal vocalizations, enhancing efficiency in behavior analysis. The system uses a preprocessing step to segment relevant sound regions from audio recordings, followed by feature extraction using Short-Time Fourier Transform (STFT), Mel-frequency cepstral coefficients (MFCCs), and linear-frequency cepstral coefficients (LFCCs). These features are input into convolutional neural network (CNN) classifiers to evaluate performance. Experimental results demonstrate the effectiveness of different CNN models and feature extraction methods, with AlexNet, DenseNet, EfficientNet, ResNet50, and ResNet152 being evaluated. The system achieves high accuracy in classifying vocal behaviors, such as barking and howling in dogs, providing a robust tool for behavioral analysis. The study highlights the importance of automated systems in bioacoustics research and suggests future improvements using deep learning-based methods for enhanced classification performance.

Description

, Mahmut KARAASLAN/0009-0002-9386-5806; KAYA, Ersin/0000-0001-5668-5078; Asuroglu, Tunc/0000-0003-4153-0764; Turkoglu, Bahaeddin/0000-0003-0255-8422

Keywords

automatic behavior analysis, bioacoustics, CNN, audio processing, Fourier Analysis, Chemical technology, 610, 600, TP1-1185, Acoustics, audio processing, 3111, Article, bioacoustics, Deep Learning, Dogs, Voice, Animals, Neural Networks, Computer, Vocalization, Animal, automatic behavior analysis, CNN

Turkish CoHE Thesis Center URL

Fields of Science

02 engineering and technology, 01 natural sciences, 0103 physical sciences, 0202 electrical engineering, electronic engineering, information engineering

Citation

WoS Q

Q2

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
N/A

Source

Sensors

Volume

24

Issue

24

Start Page

7978

End Page

PlumX Metrics
Citations

Scopus : 6

PubMed : 2

Captures

Mendeley Readers : 8

SCOPUS™ Citations

5

checked on Feb 03, 2026

Web of Science™ Citations

5

checked on Feb 03, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
5.57130396

Sustainable Development Goals

3

GOOD HEALTH AND WELL-BEING
GOOD HEALTH AND WELL-BEING Logo

11

SUSTAINABLE CITIES AND COMMUNITIES
SUSTAINABLE CITIES AND COMMUNITIES Logo

12

RESPONSIBLE CONSUMPTION AND PRODUCTION
RESPONSIBLE CONSUMPTION AND PRODUCTION Logo

13

CLIMATE ACTION
CLIMATE ACTION Logo

14

LIFE BELOW WATER
LIFE BELOW WATER Logo