DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Hahn, Min-Soo | - |
dc.contributor.advisor | 한민수 | - |
dc.contributor.author | Beack, Seung-Kwon | - |
dc.contributor.author | 백승권 | - |
dc.date.accessioned | 2011-12-28T02:43:52Z | - |
dc.date.available | 2011-12-28T02:43:52Z | - |
dc.date.issued | 2005 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392580&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/54551 | - |
dc.description | 학위논문(박사) - 한국정보통신대학교 : 공학부, 2005, [ xi, 144 p. ] | - |
dc.description.abstract | The first issue of this thesis, new spatial audio coding schemes are proposed as a multichannel audio coding scheme. The SAC is a process to represent multichannel audio signals as down-mixed signal with spatial cues. Recently, binaural cue coding (BCC) has been introduced and becomes an important scheme for spatial audio coding. The inter-channel level difference (ICLD) as one of spatial cues of the BCC plays a pivotal role to remove a lot of redundant information. The accuracy of the ICLD, however, can be easily distorted by a quantization process. Instead of the ICLD, a new representation method of ICLD is proposed and it dramatically overcomes the quantization distortion. Another proposed scheme, global vector split based virtual source location information is newly presented as a SAC scheme. The GS-VSLI is analyzed on the semicircle plane and represented as angles. Spectral distortion measurement is conducted to confirm the usefulness of the GS-VSLI. As the second issue of the thesis, audio source separation techniques are dealt with. The object-based audio rendering is a method in order to make an auditory scene automatically. The core technique to realize object based audio processing is a blind source separation which makes multitude audio separated into object audio. For the robustness of our algorithm, the frequency-domain block-based multichannel blind deconvolution (MBD) with a normalization matrix is proposed. The normalization is designed to overcome the intrinsic problems of the time-domain MBD such as the whitening effect and the slow convergence. The experimental results confirm that the proposed MBD algorithm is superior to the previous works. | eng |
dc.language | eng | - |
dc.publisher | 한국정보통신대학교 | - |
dc.subject | Virutual Source Location Information | - |
dc.subject | Binaural Cue Coding | - |
dc.subject | MPEG-4 Spatial Audio Coding | - |
dc.subject | Blind Source Separation | - |
dc.subject | 블라인드 소스 분리 | - |
dc.subject | 가상 음원 위치 정보 | - |
dc.subject | 바이노럴 큐 코딩 | - |
dc.subject | MPEG-4 공간 오디오 코딩 | - |
dc.title | Multi-channel audio processing techniques | - |
dc.title.alternative | 음상정보를 이용한 공간 오디오 코딩 기술과 주파수 영역 오디오 음원분리 기술 기반의 다채널 오디오 처리 기술 | - |
dc.type | Thesis(Ph.D) | - |
dc.identifier.CNRN | 392580/225023 | - |
dc.description.department | 한국정보통신대학교 : 공학부, | - |
dc.identifier.uid | 020015320 | - |
dc.contributor.localauthor | Hahn, Min-Soo | - |
dc.contributor.localauthor | 한민수 | - |
dc.title.subtitle | angle information based spatial audio coding and frequency domain based audio source separation | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.