DSpace at KOASAS: On improving the excitation signal in low-rate celp coding

DSpace at KOASAS

College of Engineering(공과대학)School of Electrical Engineering(전기및전자공학부)EE-Theses_Ph.D.(박사논문)

On improving the excitation signal in low-rate celp coding저전송 속도에서 부호여기 선형예측 부호화기의 여기신호 개선에 관한 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 414
Download : 0

Export

Kwon, Chul-Hong / 권철홍

The main objective of this dissertation work is to bring the bit rate of a CELP coder to 4.8 kbits/s and lower while maintaining good speech quality. For this purpose, this dissertation work focuses on three major issues, that is, class-dependent modeling, and improving the weighting function and the excitation signal. For the class-dependent model we propose two new models which classify speech segments and use a different coding structure for each class. And, for the improved weighting function we propose a function which suppresses noise between harmonics of speech spectrum. Finally, for the improved excitation modeling we propose an excitation source with peaky pulse characteristic. First, we propose a CELP-based mixed source model (C-MSM) coder at 3 kbits/s. The coder classifies speech segments into three types: voiced, unvoiced and mixed. The class decision for each speech segment and the voiced/unvoiced determination for each frequency band are done by minimizing the perceptually weighted mean-squared error between an original and the corresponding reconstructed speech. The excitation for a voiced frame is generated from an adaptive source that is the output of a long-term predictor. The excitation for an unvoiced frame is generated from a stochastic source that is the scaled code vector of a Gaussian codebook. For a mixed frame the proposed coder uses a mixed source which combines a lowpass-filtered adaptive source and a highpass-filtered stochastic source. Simulation results show that the mixed source greatly reduces the buzzy quality associated with conventional LPC vocoders. According to listening tests, the proposed coder at 3 kbits/s is clearly superior to conventional LPC vocoders and is comparable to 4.8 kbits/s CELP coders. Second, we propose an improved weighting function in the error criterion. In general, the performance of a speech coder is heavily dependent on the selection of a weighting function in the error criterion. Previous methods of...

Advisors: Un, Chong-Kwan researcher; 은종관 researcher

Description: 한국과학기술원 : 전기 및 전자공학과,

Publisher: 한국과학기술원

Issue Date: 1994

Identifier: 69661/325007 / 000875030

Language: eng

Description: 학위논문(박사) - 한국과학기술원 : 전기 및 전자공학과, 1994.2, [ vi, 124 p. ]

URI: http://hdl.handle.net/10203/36233

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=69661&flag=dissertation

Appears in Collection: EE-Theses_Ph.D.(박사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

On improving the excitation signal in low-rate celp coding저전송 속도에서 부호여기 선형예측 부호화기의 여기신호 개선에 관한 연구

KOASAS

Communities & Collections