Deep Least Squares Regression for Speaker Adaptation

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 177
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKim, Younggwanko
dc.contributor.authorLim, Hyungjunko
dc.contributor.authorGoo, Jahyunko
dc.contributor.authorKim, Hoi-Rinko
dc.date.accessioned2017-12-05T01:33:55Z-
dc.date.available2017-12-05T01:33:55Z-
dc.date.created2017-11-27-
dc.date.created2017-11-27-
dc.date.created2017-11-27-
dc.date.issued2017-08-21-
dc.identifier.citation18th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2017), pp.729 - 733-
dc.identifier.issn2308-457X-
dc.identifier.urihttp://hdl.handle.net/10203/227311-
dc.description.abstractRecently, speaker adaptation methods in deep neural networks (DNNs) have been widely studied for automatic speech recognition. However, almost all adaptation methods for DNNs have to consider various heuristic conditions such as mini-batch sizes, learning rate scheduling, stopping criteria, and initialization conditions because of the inherent property of a stochastic gradient descent (SGD)-based training process. Unfortunately, those heuristic conditions are hard to be properly tuned. To alleviate those difficulties, in this paper, we propose a least squares regression -based speaker adaptation method in a DNN framework utilizing posterior mean of each class. Also, we show how the proposed method can provide a unique solution which is quite easy and fast to calculate without SGD. The proposed method was evaluated in the TED-LIUM corpus. Experimental results showed that the proposed method achieved up to a 4.6% relative improvement against a speaker independent DNN. In addition, we report further performance improvement of the proposed method with speaker-adapted features.-
dc.languageEnglish-
dc.publisherInternational Speech Communication Association (ISCA)-
dc.titleDeep Least Squares Regression for Speaker Adaptation-
dc.typeConference-
dc.identifier.wosid000457505000148-
dc.identifier.scopusid2-s2.0-85039163440-
dc.type.rimsCONF-
dc.citation.beginningpage729-
dc.citation.endingpage733-
dc.citation.publicationname18th Annual Conference of the International-Speech-Communication-Association (INTERSPEECH 2017)-
dc.identifier.conferencecountrySW-
dc.identifier.conferencelocationStockholm University-
dc.identifier.doi10.21437/Interspeech.2017-783-
dc.contributor.localauthorKim, Hoi-Rin-
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0