Speech emotion recognition using deep neural networks /

With the ever-increasing interest of research community in studying human- computer/human-human interactions, systems deducing and identifying emotional aspects of a speech signal has emerged as a hot research topic. Speech Emotion Recognition (SER) has brought the development of automated and intel...

Full description

Saved in:

Bibliographic Details
Main Author:	Qadri, Syed Asif Ahmad (Author)
Format:	Thesis
Language:	English
Published:	Kuala Lumpur : Kulliyyah of Engineering, International Islamic University Malaysia, 2020
Subjects:	Speech processing systems Signal processing > Digital techniques Neural networks (Computer science) Theses, IIUM local
Online Access:	http://studentrepo.iium.edu.my/handle/123456789/10140
Tags:	Add Tag No Tags, Be the first to tag this record!


LEADER	044850000a22004450004500
008	201002s2020 my a f m 000 0 eng d
040			\|a UIAM \|b eng \|e rda
041			\|a eng
043			\|a a-my---
050	0	0	\|a TK7882.S65
100	1		\|a Qadri, Syed Asif Ahmad, \|e author
245	1	0	\|a Speech emotion recognition using deep neural networks / \|c by Syed Asif Ahmad Qadri
264		1	\|a Kuala Lumpur : \|b Kulliyyah of Engineering, International Islamic University Malaysia, \|c 2020
300			\|a xiv, 112 leaves : \|b colour illustrations ; \|c 30cm.
336			\|2 rdacontent \|a text
337			\|2 rdamedia \|a unmediated
337			\|2 rdamedia \|a computer
338			\|2 rdacarrier \|a volume
338			\|2 rdacarrier \|a computer disc
338			\|2 rdacarrier \|a online resource
347			\|2 rdaft \|a text file \|b PDF
500			\|a Abstracts in English and Arabic.
500			\|a "A dissertation submitted in fulfilment of the requirement for the degree of Master of Science (Computer and Information Engineering)."--On title page.
502			\|a Thesis (MSCIE)--International Islamic University Malaysia, 2020.
504			\|a Includes bibliographical references (leaves 99-106).
520			\|a With the ever-increasing interest of research community in studying human- computer/human-human interactions, systems deducing and identifying emotional aspects of a speech signal has emerged as a hot research topic. Speech Emotion Recognition (SER) has brought the development of automated and intelligent analysis of human utterances to reality. Typically, a SER system focuses on extracting the features from speech signals such as pitch frequency, formant features, energy related and spectral features, tailing it with a classification quest to understand the underlying emotion. However, as of now there still exists a considerable amount of uncertainty arising from factors like, determining influencing features, development of hybrid algorithms, type and number of emotions and languages under consideration, etc. The key issues pivotal for successful SER system are driven by proper selection of proper emotional feature extraction techniques. In this research Mel- frequency Cepstral Coefficient (MFCC) and Teager Energy Operator (TEO) along with a new-fangled fusion of MFCC and TEO referred as Teager-MFCC (TMFCC) is examined over multilingual database consisting of English, German and Hindi languages. These datasets have been retrieved from authentic and widely adopted sources. The German corpus is the well-known Berlin Emo-DB, the Hindi corpus is Indian Institute of Technology Kharagpur Simulated Emotion Hindi Speech Corpus (IITKGP-SEHSC) and the English corpus is Toronto emotional speech set (TESS). Deep Neural Networks has been used for the classification of the different emotions considered viz., happy, sad, angry, and neutral. Evaluation results shows that MFCC with recognition rate of 87.8% outperforms TEO and TMFCC. With TEO and TMFCC configurations, the recognition rate has been found as 77.4% and 82.1% respectively. However, while considering energy-based emotions, contrasting results were fetched. TEO with recognition rate of 90.5% outperforms MFCC and TMFCC. With MFCC and TMFCC configurations, the recognition rate has been found as 83.7% and 86.7% respectively. The outcome of this research would assist information of a pragmatic emotional speech recognition implementation driven by wiser selection of underlying feature extraction techniques.
596			\|a 1
650		0	\|a Speech processing systems
650		0	\|a Signal processing \|x Digital techniques
650		0	\|a Neural networks (Computer science)
655	7		\|a Theses, IIUM local
690			\|a Dissertations, Academic \|x Department of Electrical and Computer Engineering \|z IIUM
700	1		\|a Gunawan, Teddy Surya, \|e degree supervisor
700	0		\|a Hasmah Mansor, \|e degree supervisor
710	2		\|a International Islamic University Malaysia. \|b Department of Electrical and Computer Engineering
856	4		\|u http://studentrepo.iium.edu.my/handle/123456789/10140
900			\|a sz-nzj-sz-nzj-asbh
999			\|c 439328 \|d 470857
952			\|0 0 \|1 0 \|2 lcc \|4 0 \|6 T T K7882 S65 Q00001S 02020 \|7 3 \|8 IIUMTHESIS \|9 761845 \|a IIUM \|b IIUM \|c MULTIMEDIA \|d 2022-07-07 \|g 0.00 \|o t TK 7882 S65 Q1S 2020 \|p 11100418323 \|r 1900-01-02 \|t 1 \|v 0.00 \|y THESIS
952			\|0 0 \|1 0 \|2 lcc \|4 0 \|6 TS C D F TK 07882 S00065 Q00001S 02020 \|7 3 \|8 IIUMTHESIS \|9 859309 \|a IIUM \|b IIUM \|c MULTIMEDIA \|d 2022-07-07 \|g 0.00 \|o ts cdf TK 7882 S65 Q1S 2020 \|p 11100418324 \|r 1900-01-02 \|t 1 \|v 0.00 \|y THESISDIG

Speech emotion recognition using deep neural networks /

Similar Items