DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Kim, Hoi-Rin | - |
dc.contributor.advisor | 김회린 | - |
dc.contributor.author | Kim, Young-Gwan | - |
dc.contributor.author | 김영관 | - |
dc.date.accessioned | 2011-12-14T02:29:30Z | - |
dc.date.available | 2011-12-14T02:29:30Z | - |
dc.date.issued | 2010 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=419099&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/40102 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 정보통신공학과, 2010.2, [ viii, 49 p. ] | - |
dc.description.abstract | Statistical model-based voice activity detector (SMVAD) is a robust algorithm in various noise conditions to detect speech region from input signal using noise and noisy speech statistical models such as complex Gaussian probability density function (PDF). The decision rule of SMVAD is based on likelihood ratio test (LRT). However, the LRT-based decision rule may cause detection errors because of statistic properties of noise and speech signal. In this paper, we analyze the reasons why the detection errors occur. To decrease the detection errors, we propose two modified decision rules using reliable likelihood ratios (LRs) determined by spectral power of each frequency bin. We also propose a weighting scheme considering spectral characteristics of noise and speech signal. To decrease the spectral variation of same type of noise signal, in addition, we propose a spectral smoothing method of input signal and explain the effects of this method. The performances of our proposed methods are evaluated by receiver operating characteristic (ROC) curves and compared with three conventional methods in various noise environments. In most of noise conditions, the proposed methods show better performance than conventional methods. The experimental results also show that the proposed weighting scheme, which is applied to each LR, can guarantee the most stable performance improvement of SMVAD. | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Spectral smoothing | - |
dc.subject | Reliability of likelihood ratio | - |
dc.subject | Voice activity detector | - |
dc.subject | Statistical model | - |
dc.subject | Likelihood ratio weighting | - |
dc.subject | 우도비 가중치 | - |
dc.subject | 스펙트럼 평탄화 | - |
dc.subject | 우도비의 신뢰도 | - |
dc.subject | 음성검출기 | - |
dc.subject | 통계모델 | - |
dc.title | Improvement of Statistical model-based noise-robust voice activity detector | - |
dc.title.alternative | 잡음에 강인한 통계모델기반 음성검출기의 개선 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 419099/325007 | - |
dc.description.department | 한국과학기술원 : 정보통신공학과, | - |
dc.identifier.uid | 020084206 | - |
dc.contributor.localauthor | Kim, Hoi-Rin | - |
dc.contributor.localauthor | 김회린 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.