Comparison of the Effects of Mel Coefficients and Spectrogram Images Via Deep Learning in Emotion Classification

No Thumbnail Available

Date

2020

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

INT INFORMATION & ENGINEERING TECHNOLOGY ASSOC

Open Access Color

BRONZE

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Top 10%
Influence
Top 10%
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

In the present paper, an approach was developed for emotion recognition from speech data using deep learning algorithms, a problem that has gained importance in recent years. Feature extraction manually and feature selection steps were more important in traditional methods for speech emotion recognition. In spite of this, deep learning algorithms were applied to data without any data reduction. The study implemented the triple emotion groups of EmoDB emotion data: Boredom, Neutral, and Sadness-BNS; and Anger, Happiness, and Fear-AHF. Firstly, the spectrogram images resulting from the signal data after preprocessing were classified using AlexNET. Secondly, the results formed from the MelFrequency Cepstrum Coefficients (MFCC) extracted by feature extraction methods to Deep Neural Networks (DNN) were compared. The importance and necessity of using manual feature extraction in deep learning was investigated, which remains a very important part of emotion recognition. The experimental results show that emotion recognition through the implementation of the AlexNet architecture to the spectrogram images was more discriminative than that through the implementation of DNN to manually extracted features.

Description

Keywords

Speech Emotion Recognition, Deep Neural Network (Dnn), Convolutional Neural Network (Cnn), Deep Learning Algorithm, Mel-Frequency Cepstrum Coefficients (Mfcc), Neural-Network, Speech, Recognition, Architectures

Turkish CoHE Thesis Center URL

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

Q4

Scopus Q

N/A
OpenCitations Logo
OpenCitations Citation Count
12

Source

TRAITEMENT DU SIGNAL

Volume

37

Issue

1

Start Page

51

End Page

57
PlumX Metrics
Citations

Scopus : 13

Captures

Mendeley Readers : 27

SCOPUS™ Citations

13

checked on Feb 03, 2026

Web of Science™ Citations

9

checked on Feb 03, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
1.25960781

Sustainable Development Goals

SDG data is not available