An MLP/HMM hybrid model using nonlinear predictors

Cited 4 time in webofscience Cited 0 time in scopus
  • Hit : 303
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorChung, YJko
dc.contributor.authorUn, Chong-Kwanko
dc.date.accessioned2013-02-27T21:29:47Z-
dc.date.available2013-02-27T21:29:47Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued1996-10-
dc.identifier.citationSPEECH COMMUNICATION, v.19, no.4, pp.307 - 316-
dc.identifier.issn0167-6393-
dc.identifier.urihttp://hdl.handle.net/10203/70954-
dc.description.abstractIn this paper, we propose an MLP/HMM hybrid model in which the input feature vectors are transformed by nonlinear predictors using multilayer perceptrons (MLPs) assigned to each state of a Hidden Markov Model (HMM). The prediction error vectors in the states are modeled by Gaussian mixture densities. The use of a hybrid model is motivated from the need to model the prediction errors in the conventional neural prediction model (NPM) where the prediction errors are variable due to the effect of varying contexts and speaker identity. The MLP/HMM hybrid model is advantageous because frame-correlation in the input speech signal is exploited by employing the MLP predictors, and the variabilities in the prediction error signals are explicitly modeled. We present the training algorithms based on the maximum likelihood (ML) criterion and discriminative criterion for minimum error classification. Experiments were done on speaker-independent continuous speech recognition. By ML training of the hybrid model, we obtained a much better performance than a conventional NPM which does not explicitly model the prediction error signals. By training with the discriminative criterion, confusion among different models was significantly reduced and word error rate was reduced by 56% compared with the ML training.-
dc.languageEnglish-
dc.publisherELSEVIER SCIENCE BV-
dc.subjectWORD RECOGNITION-
dc.subjectNETWORK-
dc.titleAn MLP/HMM hybrid model using nonlinear predictors-
dc.typeArticle-
dc.identifier.wosidA1996VV40100004-
dc.type.rimsART-
dc.citation.volume19-
dc.citation.issue4-
dc.citation.beginningpage307-
dc.citation.endingpage316-
dc.citation.publicationnameSPEECH COMMUNICATION-
dc.contributor.localauthorUn, Chong-Kwan-
dc.contributor.nonIdAuthorChung, YJ-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorspeech recognition-
dc.subject.keywordAuthormultilayer perceptron-
dc.subject.keywordAuthorHMM-
dc.subject.keywordAuthornonlinear prediction-
dc.subject.keywordPlusWORD RECOGNITION-
dc.subject.keywordPlusNETWORK-
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 4 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0