سامانه پژوهشی دانشگاه کردستان | Speech Emotion Recognition Using a Hierarchical Adaptive Weighted MultiLayer Sparse Auto-Encoder Extreme Learning Machine with New Weighting and Spectral/SpectroTemporal Gabor Filter Bank Features

عنوان	Speech Emotion Recognition Using a Hierarchical Adaptive Weighted MultiLayer Sparse Auto-Encoder Extreme Learning Machine with New Weighting and Spectral/SpectroTemporal Gabor Filter Bank Features
نوع پژوهش	مقاله ارائه شده کنفرانسی
کلیدواژه‌ها	Speech Emotion Recognition, ELM
چکیده	The importance of doing research into affective computing has multiplied with the growing popularity of intelligent and human-machine interface systems. In this paper, a system for speech emotion recognition (SER) is proposed using new techniques in different parts. The given system extracts speech features from both speech and glottal-waveform signals in feature extraction section including spectro-temporal ones obtained from Gabor filter bank (GBFB) and separate Gabor filter bank (SGBFB) which have not been so far utilized for SER. At the classification step, a hierarchical adaptive weighted multilayer extreme learning machine (H-AWELM) is employed. This hybrid classifier consists of two parts: the first part for sparse unsupervised feature learning using a multi-layer neural network (NN) with sparse extreme learning machine auto-encoder (ELMAE) layers, and the second part for feature classification in the last layer using Tikhonov’s regularized least squares (LS) technique. One of the most important issues in multi-class ELM training process is how to deal with data imbalance problem. This paper presents a new adaptive weighting method to solve this problem that can be more accurate than current weighting methods. Finally, the proposed system is evaluated on a well-known emotional speech database. Experimental results demonstrate that the proposed system outperforms the state-of-the-art ones
پژوهشگران	سیدجهانشاه کبودیان (نفر دوم)، فاطمه دانشفر (نفر اول)

مشخصات پژوهش