Loss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification

Cited 39 time in webofscience Cited 0 time in scopus
  • Hit : 1032
  • Download : 174
DC FieldValueLanguage
dc.contributor.authorYun, Sung-Rackko
dc.contributor.authorYoo, Chang-Dongko
dc.date.accessioned2013-03-11T21:34:10Z-
dc.date.available2013-03-11T21:34:10Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2012-02-
dc.identifier.citationIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, v.20, no.2, pp.585 - 598-
dc.identifier.issn1558-7916-
dc.identifier.urihttp://hdl.handle.net/10203/100367-
dc.description.abstractThis paper considers a learning framework for speech emotion classification using a discriminant function based on Gaussian mixture models (GMMs). The GMM parameter set is estimated by margin scaling with a loss function to reduce the risk of predicting emotions with high loss. Here, the loss function is defined as a function of a distance metric using the Watson and Tellegen's emotion model. Margin scaling is known to have good generalization ability and can be considered appropriate for emotion modeling where the parameter set is likely to be over-fitted to the training data set whose characteristics may differ from those of the testing data set. Our learning framework is formulated as a constrained optimization problem which is solved using semi-definite programming. Three tasks were evaluated: acted emotion classification, natural emotion classification, and cross database emotion classification. In each task, four loss functions were evaluated. In all experiments, results consistently show that margin scaling improves the classification accuracy over other learning frameworks based on the maximum-likelihood, maximum mutual information and max-margin framework without margin scaling. Experiment results also show that margin scaling substantially reduces the overall loss compared to the max-margin framework without margin scaling.-
dc.languageEnglish-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.subjectRECOGNITION-
dc.titleLoss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification-
dc.typeArticle-
dc.identifier.wosid000299525800021-
dc.identifier.scopusid2-s2.0-83655164697-
dc.type.rimsART-
dc.citation.volume20-
dc.citation.issue2-
dc.citation.beginningpage585-
dc.citation.endingpage598-
dc.citation.publicationnameIEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING-
dc.identifier.doi10.1109/TASL.2011.2162405-
dc.embargo.liftdate9999-12-31-
dc.embargo.terms9999-12-31-
dc.contributor.localauthorYoo, Chang-Dong-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorGaussian mixture models (GMMs)-
dc.subject.keywordAuthormargin scaling-
dc.subject.keywordAuthorspeech emotion classification-
dc.subject.keywordAuthorWatson and Tellegen&apos-
dc.subject.keywordAuthors model-
dc.subject.keywordPlusRECOGNITION-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 39 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0