Multi-channel audio processing techniques : angle information based spatial audio coding and frequency domain based audio source separation음상정보를 이용한 공간 오디오 코딩 기술과 주파수 영역 오디오 음원분리 기술 기반의 다채널 오디오 처리 기술

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 523
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorHahn, Min-Soo-
dc.contributor.advisor한민수-
dc.contributor.authorBeack, Seung-Kwon-
dc.contributor.author백승권-
dc.date.accessioned2011-12-28T02:43:52Z-
dc.date.available2011-12-28T02:43:52Z-
dc.date.issued2005-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=392580&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/54551-
dc.description학위논문(박사) - 한국정보통신대학교 : 공학부, 2005, [ xi, 144 p. ]-
dc.description.abstractThe first issue of this thesis, new spatial audio coding schemes are proposed as a multichannel audio coding scheme. The SAC is a process to represent multichannel audio signals as down-mixed signal with spatial cues. Recently, binaural cue coding (BCC) has been introduced and becomes an important scheme for spatial audio coding. The inter-channel level difference (ICLD) as one of spatial cues of the BCC plays a pivotal role to remove a lot of redundant information. The accuracy of the ICLD, however, can be easily distorted by a quantization process. Instead of the ICLD, a new representation method of ICLD is proposed and it dramatically overcomes the quantization distortion. Another proposed scheme, global vector split based virtual source location information is newly presented as a SAC scheme. The GS-VSLI is analyzed on the semicircle plane and represented as angles. Spectral distortion measurement is conducted to confirm the usefulness of the GS-VSLI. As the second issue of the thesis, audio source separation techniques are dealt with. The object-based audio rendering is a method in order to make an auditory scene automatically. The core technique to realize object based audio processing is a blind source separation which makes multitude audio separated into object audio. For the robustness of our algorithm, the frequency-domain block-based multichannel blind deconvolution (MBD) with a normalization matrix is proposed. The normalization is designed to overcome the intrinsic problems of the time-domain MBD such as the whitening effect and the slow convergence. The experimental results confirm that the proposed MBD algorithm is superior to the previous works.eng
dc.languageeng-
dc.publisher한국정보통신대학교-
dc.subjectVirutual Source Location Information-
dc.subjectBinaural Cue Coding-
dc.subjectMPEG-4 Spatial Audio Coding-
dc.subjectBlind Source Separation-
dc.subject블라인드 소스 분리-
dc.subject가상 음원 위치 정보-
dc.subject바이노럴 큐 코딩-
dc.subjectMPEG-4 공간 오디오 코딩-
dc.titleMulti-channel audio processing techniques-
dc.title.alternative음상정보를 이용한 공간 오디오 코딩 기술과 주파수 영역 오디오 음원분리 기술 기반의 다채널 오디오 처리 기술-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN392580/225023-
dc.description.department한국정보통신대학교 : 공학부, -
dc.identifier.uid020015320-
dc.contributor.localauthorHahn, Min-Soo-
dc.contributor.localauthor한민수-
dc.title.subtitleangle information based spatial audio coding and frequency domain based audio source separation-
Appears in Collection
School of Engineering-Theses_Ph.D(공학부 박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0