SVM Based Speaker Verification Using Sparse Maximum A Posteriori Adaptation

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 411
  • Download : 2
Modern speaker verification systems based on support vector machines (SVMs) use Gaussian mixture model (GMM) supervectors as their input feature vectors, and the maximum a posteriori (MAP) adaptation is a conventional method for generating speaker-dependent GMMs by adapting a universal background model (UBM). MAP adaptation requires the appropriate amount of input utterance due to the number of model parameters to be estimated. On the other hand, with limited utterances, unreliable MAP adaptation can be performed, which causes adaptation noise even though the Bayesian priors used in the MAP adaptation smooth the movements between the UBM and speaker dependent GMMs. This paper proposes a sparse MAP adaptation method, which is known to perform well in the automatic speech recognition area. By introducing sparse MAP adaptation to the GMM-SVM-based speaker verification system, the adaptation noise can be mitigated effectively. The proposed method utilizes the L0 norm as a regularizer to induce sparsity. The experimental results on the TIMIT database showed that the sparse MAP-based GMM-SVM speaker verification system yields a 42.6% relative reduction in the equal error rate with few additional computations.
Issue Date

IEIE Transactions on Smart Processing and Computing, v.2, no.5, pp.277 - 281

Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item


  • mendeley


rss_1.0 rss_2.0 atom_1.0