TY - GEN
T1 - Analysis of emotion recognition from cross-lingual speech
T2 - 2nd International Conference on Computation, Automation and Knowledge Management, ICCAKM 2021
AU - Farhad, Moomal
AU - Ismail, Heba
AU - Harous, Saad
AU - Masud, Mohammad Mehedy
AU - Beg, Azam
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/1/19
Y1 - 2021/1/19
N2 - In a system which involves interaction be- tween machines and humans, the recognition of emotion from audio has always been a focus of research. Emotion recognition can play an essential role in many fields, such as medicine, law, psychology, and customer services. In this paper, we present an empirical comparative analysis of several machine learning classifiers for emotion recognition in audio data. Evaluations are performed for a set of predefined emotions such as happy, sad, and angry from Arabic, English, and Urdu languages. Pitch and cepstral features are extracted from audio files and principal component analysis is applied for dimensionality reduction. Experiments show that random forest outperformed other classifiers on Urdu dataset with an accuracy of 78.75%. However, the performance of Meta iterative classifier on Arabic dataset was better than random forest and neural network with the accuracy of 70%. Classification of emotions on the English dataset, which do not differ much in terms of pitch and MFCC features, generated the lowest accuracies at or below 31%.
AB - In a system which involves interaction be- tween machines and humans, the recognition of emotion from audio has always been a focus of research. Emotion recognition can play an essential role in many fields, such as medicine, law, psychology, and customer services. In this paper, we present an empirical comparative analysis of several machine learning classifiers for emotion recognition in audio data. Evaluations are performed for a set of predefined emotions such as happy, sad, and angry from Arabic, English, and Urdu languages. Pitch and cepstral features are extracted from audio files and principal component analysis is applied for dimensionality reduction. Experiments show that random forest outperformed other classifiers on Urdu dataset with an accuracy of 78.75%. However, the performance of Meta iterative classifier on Arabic dataset was better than random forest and neural network with the accuracy of 70%. Classification of emotions on the English dataset, which do not differ much in terms of pitch and MFCC features, generated the lowest accuracies at or below 31%.
KW - Arabic
KW - Emotion recognition
KW - English
KW - Machine learning
KW - Urdu
UR - http://www.scopus.com/inward/record.url?scp=85102469726&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85102469726&partnerID=8YFLogxK
U2 - 10.1109/ICCAKM50778.2021.9357726
DO - 10.1109/ICCAKM50778.2021.9357726
M3 - Conference contribution
AN - SCOPUS:85102469726
T3 - Proceedings of 2nd International Conference on Computation, Automation and Knowledge Management, ICCAKM 2021
SP - 42
EP - 47
BT - Proceedings of 2nd International Conference on Computation, Automation and Knowledge Management, ICCAKM 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 19 January 2021 through 21 January 2021
ER -