Noise-robust speaker recognition using subband likelihoods and reliable-feature selection

Cited 10 time in webofscience Cited 0 time in scopus
  • Hit : 379
  • Download : 560
DC FieldValueLanguage
dc.contributor.authorKim, Sko
dc.contributor.authorJi, Mko
dc.contributor.authorKim, HoiRinko
dc.date.accessioned2011-05-18T07:12:59Z-
dc.date.available2011-05-18T07:12:59Z-
dc.date.created2012-02-06-
dc.date.created2012-02-06-
dc.date.issued2008-02-
dc.identifier.citationETRI JOURNAL, v.30, no.1, pp.89 - 100-
dc.identifier.issn1225-6463-
dc.identifier.urihttp://hdl.handle.net/10203/23730-
dc.description.abstractWe consider the feature recombination technique in a multiband approach to speaker identification and verification. To overcome the ineffectiveness of conventional feature recombination in broadband noisy environments, we propose a new subband feature recombination which uses subband likelihoods and a subband reliable-feature selection technique with an adaptive noise model. In the decision step of speaker recognition, a few very low unreliable feature likelihood scores can cause a speaker recognition system to make an incorrect decision. To overcome this problem, reliable-feature selection adjusts the likelihood scores of an unreliable feature by comparison with those of an adaptive noise model, which is estimated by the maximum a posteriori adaptation technique using noise features directly obtained from noisy test speech. To evaluate the effectiveness of the proposed methods in noisy environments, we use the TIMIT database and the NTIMT database, which is the corresponding telephone version of TIMIT database. The proposed subband feature recombination with subband reliable-feature selection achieves better performance than the conventional feature recombination system with reliable-feature selection.-
dc.languageEnglish-
dc.language.isoen_USen
dc.publisherELECTRONICS TELECOMMUNICATIONS RESEARCH INST-
dc.subjectSCORE NORMALIZATION-
dc.subjectMODELS-
dc.titleNoise-robust speaker recognition using subband likelihoods and reliable-feature selection-
dc.typeArticle-
dc.identifier.wosid000252969400009-
dc.identifier.scopusid2-s2.0-39449083216-
dc.type.rimsART-
dc.citation.volume30-
dc.citation.issue1-
dc.citation.beginningpage89-
dc.citation.endingpage100-
dc.citation.publicationnameETRI JOURNAL-
dc.contributor.localauthorKim, HoiRin-
dc.contributor.nonIdAuthorKim, S-
dc.contributor.nonIdAuthorJi, M-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorspeaker recognition-
dc.subject.keywordAuthorGaussian mixture model-
dc.subject.keywordAuthoruniversal background model-
dc.subject.keywordAuthorfeature recombination-
dc.subject.keywordAuthormel-frequency cepstral coefficient-
dc.subject.keywordAuthorsubband likelihood-
dc.subject.keywordAuthorreliable feature selection-
dc.subject.keywordAuthoradaptive noise model-
dc.subject.keywordPlusSCORE NORMALIZATION-
dc.subject.keywordPlusMODELS-
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 10 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0