Search for: [Keywords = "Gaussian mixture model"]

Search results

Number of results: 4

items per page: 25 50 75

Sort by:

of 1

Speech Emotion Recognition under White Noise

Chengwei Huang Guoming Chen Hua Yu Yongqiang Bao Li Zhao

Archives of Acoustics | 2013 | vol. 38 | No 4 | 457-463 | DOI: 10.2478/aoa-2013-0054

Keywords speech emotion recognition speech enhancement emotion model Gaussian mixture model

Download PDF Download RIS Download Bibtex

Abstract

Speaker‘s emotional states are recognized from speech signal with Additive white Gaussian noise (AWGN). The influence of white noise on a typical emotion recogniztion system is studied. The emotion classifier is implemented with Gaussian mixture model (GMM). A Chinese speech emotion database is used for training and testing, which includes nine emotion classes (e.g. happiness, sadness, anger, surprise, fear, anxiety, hesitation, confidence and neutral state). Two speech enhancement algorithms are introduced for improved emotion classification. In the experiments, the Gaussian mixture model is trained on the clean speech data, while tested under AWGN with various signal to noise ratios (SNRs). The emotion class model and the dimension space model are both adopted for the evaluation of the emotion recognition system. Regarding the emotion class model, the nine emotion classes are classified. Considering the dimension space model, the arousal dimension and the valence dimension are classified into positive regions or negative regions. The experimental results show that the speech enhancement algorithms constantly improve the performance of our emotion recognition system under various SNRs, and the positive emotions are more likely to be miss-classified as negative emotions under white noise environment.

Go to article

Authors and Affiliations

Chengwei Huang

Guoming Chen

Hua Yu

Yongqiang Bao

Li Zhao

Voice Conversion Based on Hybrid SVR and GMM

Peng Song Yun Jin Li Zhao Cairong Zou

Archives of Acoustics | 2012 | vol. 37 | No 2 | 143-149 | DOI: 10.2478/v10168-012-0020-9

Keywords voice conversion support vector regression Gaussian mixture model F0 prediction speaker-specific information

Download PDF Download RIS Download Bibtex

Abstract

A novel VC (voice conversion) method based on hybrid SVR (support vector regression) and GMM (Gaussian mixture model) is presented in the paper, the mapping abilities of SVR and GMM are exploited to map the spectral features of the source speaker to those of target ones. A new strategy of F0 transformation is also presented, the F0s are modeled with spectral features in a joint GMM and predicted from the converted spectral features using the SVR method. Subjective and objective tests are carried out to evaluate the VC performance; experimental results show that the converted speech using the proposed method can obtain a better quality than that using the state-of-the-art GMM method. Meanwhile, a VC method based on non-parallel data is also proposed, the speaker-specific information is investigated using the SVR method and preliminary subjective experiments demonstrate that the proposed method is feasible when a parallel corpus is not available.

Go to article

Authors and Affiliations

Peng Song

Yun Jin

Li Zhao

Cairong Zou

Preprocessing Large Datasets Using Gaussian Mixture Modelling to Improve Prediction Accuracy of Truck Productivity at Mine Sites

Chengkai Fan Na Zhang Bei Jiang Wei Victor Liu

Archives of Mining Sciences | 2022 | vol. 67 | No 4 | 661-680 | DOI: 10.24425/ams.2022.143680

Keywords Oil sands mining Mine truck productivity Gaussian mixture model Latent variable Prediction accuracy Relative importance

Download PDF Download RIS Download Bibtex

Abstract

The historical datasets at operating mine sites are usually large. Directly applying large datasets to build prediction models may lead to inaccurate results. To overcome the real-world challenges, this study aimed to handle these large datasets using Gaussian mixture modelling (GMM) for developing a novel and accurate prediction model of truck productivity. A large dataset of truck haulage collected at operating mine sites was clustered by GMM into three latent classes before the prediction model was built. The labels of these latent classes generated a latent variable. Two multiple linear regression (MLR) models were then constructed, including the ordinary-MLR (O-MLR) and the hybrid GMM-MLR models. The GMM-MLR model incorporated the observed input variables and a latent variable in the form of interaction terms. The O-MLR model was the baseline model and did not involve the latent variable. The GMM-MLR model performed considerably better than the O-MLR model in predicting truck productivity. The interaction terms quantitatively measured the differences in how the observed input variables affected truck productivity in three classes (high, medium, and low truck productivity). The haul distance was the most crucial input variable in the GMM-MLR model. This study provides new insights into handling massive amounts of data in truck haulage datasets and a more accurate prediction model for truck productivity.

Go to article

Authors and Affiliations

Chengkai Fan

e-mail:

ORCID:

Na Zhang

e-mail:

ORCID:

Bei Jiang

e-mail:

ORCID:

Wei Victor Liu

e-mail:

ORCID:

University of Alberta , Edmonton, Department of Civil and Environmental Engineering, Alberta T6G 2E3, Canada
University of Alberta , Department of Mathematical and Statistical Sciences, Edmonton, Alberta T6G 2G1, Canada

Enhancement in Bearing Fault Classification Parameters Using Gaussian Mixture Models and Mel Frequency Cepstral Coefficients Features

Youcef Atmani Said Rechak Ammar Mesloub Larbi Hemmouch

Archives of Acoustics | 2020 | vol. 45 | No 2 | 283-295 | DOI: 10.24425/aoa.2020.133149

Keywords bearing faults Gaussian mixture models Mel frequency cepstral coefficients feature extraction diagnosis

Download PDF Download RIS Download Bibtex

Abstract

Last decades, rolling bearing faults assessment and their evolution with time have been receiving much interest due to their crucial role as part of the Conditional Based Maintenance (CBM) of rotating machinery. This paper investigates bearing faults diagnosis based on classification approach using Gaussian Mixture Model (GMM) and the Mel Frequency Cepstral Coefficients (MFCC) features. Throughout, only one criterion is defined for the evaluation of the performance during all the cycle of the classification process. This is the Average Classification Rate (ACR) obtained from the confusion matrix. In every test performed, the generated features vectors are considered along to discriminate between four fault conditions as normal bearings, bearings with inner and outer race faults and ball faults. Many configurations were tested in order to determinate the optimal values of input parameters, as the frame analysis length, the order of model, and others. The experimental application of the proposed method was based on vibration signals taken from the bearing datacenter website of Case Western Reserve University (CWRU). Results show that proposed method can reliably classify different fault conditions and have a highest classification performance under some conditions.

Go to article

Authors and Affiliations

Youcef Atmani

Said Rechak

Ammar Mesloub

Larbi Hemmouch

Search results

Filters

Search results

Speech Emotion Recognition under White Noise

Abstract

Authors and Affiliations

Voice Conversion Based on Hybrid SVR and GMM

Abstract

Authors and Affiliations

Preprocessing Large Datasets Using Gaussian Mixture Modelling to Improve Prediction Accuracy of Truck Productivity at Mine Sites

Abstract

Authors and Affiliations

Enhancement in Bearing Fault Classification Parameters Using Gaussian Mixture Models and Mel Frequency Cepstral Coefficients Features

Abstract

Authors and Affiliations