Unsupervised rapid speaker adaptation based on selective eigenvoice merging for user-specific voice interaction

Cited 4 time in webofscience Cited 5 time in scopus
  • Hit : 429
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorChoi, Dong-Jinko
dc.contributor.authorPark, Jeong-Sikko
dc.contributor.authorOh, Yung-Hwanko
dc.date.accessioned2016-04-12T06:29:40Z-
dc.date.available2016-04-12T06:29:40Z-
dc.date.created2015-05-06-
dc.date.created2015-05-06-
dc.date.created2015-05-06-
dc.date.issued2015-04-
dc.identifier.citationENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, v.40, pp.95 - 102-
dc.identifier.issn0952-1976-
dc.identifier.urihttp://hdl.handle.net/10203/203037-
dc.description.abstractSpeaker adaptation transforms the standard speaker-independent acoustic models into an adapted model relevant to the user (called the target speaker) in order to provide reliable speech recognition performance. Although several conventional adaptation techniques, such as Maximum Likelihood Linear Regression (MLLR) and Maximum A Posteriori (MAP), have been successfully applied to speech recognition tasks, they demonstrate great dependency on the amount of adaptation data. However, the eigenvoice-based adaptation technique is known to provide reliable performance regardless of the amount of data, even for a very small amount In this study, we propose an efficient eigenvoice adaptation approach to construct more reliable adapted models. The proposed approach merges eigenvoice sets for possible eigenvoice combinations, and then selects optimal eigenvoice sets that are most relevant to the target speaker. For this task, we propose an efficient unsupervised eigenvoice selection method as well as a rapid merging technique. On speech recognition experiments using the Defense Advanced Research Projects Agency's Resource Management corpus, the proposed approach exhibited superior performance, compared to conventional methods, in both recognition accuracy and time complexity.-
dc.languageEnglish-
dc.publisherPERGAMON-ELSEVIER SCIENCE LTD-
dc.subjectSPEECH RECOGNITION-
dc.subjectROBOTS-
dc.titleUnsupervised rapid speaker adaptation based on selective eigenvoice merging for user-specific voice interaction-
dc.typeArticle-
dc.identifier.wosid000352045600010-
dc.identifier.scopusid2-s2.0-84923364929-
dc.type.rimsART-
dc.citation.volume40-
dc.citation.beginningpage95-
dc.citation.endingpage102-
dc.citation.publicationnameENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE-
dc.identifier.doi10.1016/j.engappai.2015.01.010-
dc.contributor.localauthorOh, Yung-Hwan-
dc.contributor.nonIdAuthorChoi, Dong-Jin-
dc.contributor.nonIdAuthorPark, Jeong-Sik-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorSpeaker adaptation-
dc.subject.keywordAuthorEigenvoice-
dc.subject.keywordAuthorMaximum Likelihood Linear Regression-
dc.subject.keywordAuthorMaximum A Posteriori-
dc.subject.keywordAuthorSelective eigenvoice merging-
dc.subject.keywordAuthorSpeech recognition-
dc.subject.keywordAuthorSpeaker adaptation-
dc.subject.keywordAuthorEigenvoice-
dc.subject.keywordAuthorMaximum Likelihood Linear Regression-
dc.subject.keywordAuthorMaximum A Posteriori-
dc.subject.keywordAuthorSelective eigenvoice merging-
dc.subject.keywordAuthorSpeech recognition-
dc.subject.keywordPlusSPEECH RECOGNITION-
dc.subject.keywordPlusROBOTS-
dc.subject.keywordPlusSPEECH RECOGNITION-
dc.subject.keywordPlusROBOTS-
Appears in Collection
CS-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 4 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0