Advanced sequence learning approaches for emotion recognition using speech signals

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

The most significant and successful method of human communication is speech, which can also serve as a channel for human-computer interaction (HCI). The use of sensors to identify auditory emotions is an emerging field of HCI research. In the ongoing scenario, emotion recognition is a persistent issue that is crucial to real-time applications. Recognizing human emotions is difficult when analysing and predicting a person's behaviour from a collection of audio clips. This chapter is going to cover the concept of sequence learning for emotion recognition using LSTM, GRUs, and their modifications, such as multi-layer or deep LSTM and bi-directional LSTM networks. The successes and weaknesses of current LSTM/GRUs-based emotion recognition systems will be assessed and discussed. We'll also discuss the drawbacks of traditional RNNs and why LSTM is better than RNNs.

Original languageEnglish
Title of host publicationIntelligent Multimedia Signal Processing for Smart Ecosystems
PublisherSpringer International Publishing
Pages307-325
Number of pages19
ISBN (Electronic)9783031348730
ISBN (Print)9783031348723
DOIs
Publication statusPublished - Sept 30 2023
Externally publishedYes

Keywords

  • Audio signal
  • Emotional recognition
  • LSTM
  • Sensors

ASJC Scopus subject areas

  • General Computer Science
  • General Mathematics

Fingerprint

Dive into the research topics of 'Advanced sequence learning approaches for emotion recognition using speech signals'. Together they form a unique fingerprint.

Cite this