Search results

Filters

  • Journals
  • Authors
  • Keywords
  • Date
  • Type

Search results

Number of results: 4
items per page: 25 50 75
Sort by:
Download PDF Download RIS Download Bibtex

Abstract

Speaker‘s emotional states are recognized from speech signal with Additive white Gaussian noise (AWGN). The influence of white noise on a typical emotion recogniztion system is studied. The emotion classifier is implemented with Gaussian mixture model (GMM). A Chinese speech emotion database is used for training and testing, which includes nine emotion classes (e.g. happiness, sadness, anger, surprise, fear, anxiety, hesitation, confidence and neutral state). Two speech enhancement algorithms are introduced for improved emotion classification. In the experiments, the Gaussian mixture model is trained on the clean speech data, while tested under AWGN with various signal to noise ratios (SNRs). The emotion class model and the dimension space model are both adopted for the evaluation of the emotion recognition system. Regarding the emotion class model, the nine emotion classes are classified. Considering the dimension space model, the arousal dimension and the valence dimension are classified into positive regions or negative regions. The experimental results show that the speech enhancement algorithms constantly improve the performance of our emotion recognition system under various SNRs, and the positive emotions are more likely to be miss-classified as negative emotions under white noise environment.
Go to article

Authors and Affiliations

Chengwei Huang
Guoming Chen
Hua Yu
Yongqiang Bao
Li Zhao
Download PDF Download RIS Download Bibtex

Abstract

A novel VC (voice conversion) method based on hybrid SVR (support vector regression) and GMM (Gaussian mixture model) is presented in the paper, the mapping abilities of SVR and GMM are exploited to map the spectral features of the source speaker to those of target ones. A new strategy of F0 transformation is also presented, the F0s are modeled with spectral features in a joint GMM and predicted from the converted spectral features using the SVR method. Subjective and objective tests are carried out to evaluate the VC performance; experimental results show that the converted speech using the proposed method can obtain a better quality than that using the state-of-the-art GMM method. Meanwhile, a VC method based on non-parallel data is also proposed, the speaker-specific information is investigated using the SVR method and preliminary subjective experiments demonstrate that the proposed method is feasible when a parallel corpus is not available.

Go to article

Authors and Affiliations

Peng Song
Yun Jin
Li Zhao
Cairong Zou
Download PDF Download RIS Download Bibtex

Abstract

The historical datasets at operating mine sites are usually large. Directly applying large datasets to build prediction models may lead to inaccurate results. To overcome the real-world challenges, this study aimed to handle these large datasets using Gaussian mixture modelling (GMM) for developing a novel and accurate prediction model of truck productivity. A large dataset of truck haulage collected at operating mine sites was clustered by GMM into three latent classes before the prediction model was built. The labels of these latent classes generated a latent variable. Two multiple linear regression (MLR) models were then constructed, including the ordinary-MLR (O-MLR) and the hybrid GMM-MLR models. The GMM-MLR model incorporated the observed input variables and a latent variable in the form of interaction terms. The O-MLR model was the baseline model and did not involve the latent variable. The GMM-MLR model performed considerably better than the O-MLR model in predicting truck productivity. The interaction terms quantitatively measured the differences in how the observed input variables affected truck productivity in three classes (high, medium, and low truck productivity). The haul distance was the most crucial input variable in the GMM-MLR model. This study provides new insights into handling massive amounts of data in truck haulage datasets and a more accurate prediction model for truck productivity.
Go to article

Authors and Affiliations

Chengkai Fan
1
ORCID: ORCID
Na Zhang
2
ORCID: ORCID
Bei Jiang
2
ORCID: ORCID
Wei Victor Liu
2
ORCID: ORCID

  1. University of Alberta , Edmonton, Department of Civil and Environmental Engineering, Alberta T6G 2E3, Canada
  2. University of Alberta , Department of Mathematical and Statistical Sciences, Edmonton, Alberta T6G 2G1, Canada
Download PDF Download RIS Download Bibtex

Abstract

Last decades, rolling bearing faults assessment and their evolution with time have been receiving much interest due to their crucial role as part of the Conditional Based Maintenance (CBM) of rotating machinery. This paper investigates bearing faults diagnosis based on classification approach using Gaussian Mixture Model (GMM) and the Mel Frequency Cepstral Coefficients (MFCC) features. Throughout, only one criterion is defined for the evaluation of the performance during all the cycle of the classification process. This is the Average Classification Rate (ACR) obtained from the confusion matrix. In every test performed, the generated features vectors are considered along to discriminate between four fault conditions as normal bearings, bearings with inner and outer race faults and ball faults. Many configurations were tested in order to determinate the optimal values of input parameters, as the frame analysis length, the order of model, and others. The experimental application of the proposed method was based on vibration signals taken from the bearing datacenter website of Case Western Reserve University (CWRU). Results show that proposed method can reliably classify different fault conditions and have a highest classification performance under some conditions.

Go to article

Authors and Affiliations

Youcef Atmani
Said Rechak
Ammar Mesloub
Larbi Hemmouch

This page uses 'cookies'. Learn more