DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Oh, Yung-Hwan | - |
dc.contributor.advisor | 오영환 | - |
dc.contributor.author | Park, Jeong-Sik | - |
dc.contributor.author | 박정식 | - |
dc.date.accessioned | 2011-12-13T05:27:16Z | - |
dc.date.available | 2011-12-13T05:27:16Z | - |
dc.date.issued | 2010 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=418703&flag=dissertation | - |
dc.identifier.uri | http://hdl.handle.net/10203/33288 | - |
dc.description | 학위논문(박사) - 한국과학기술원 : 전산학과, 2010.2, [ viii, 73 p. ] | - |
dc.description.abstract | Speech processing technology has been continuously advanced as an essential field of artificial intelligence for the last few decades. Researchers are investigating various approaches to make more intelligent and user-friendly applications, which assist and understand humans. Especially, service robot industry remains critical challenges to advance the human-robot interaction and they search for the solutions in voice interface. The most representative technology to enable intelligent machines to comprehend users` emotional state and interact with them is Speech Emotion Recognition (SER), which automatically identifies users` feelings and conditions from their spoken speech. Although many researchers have applied various technical approaches developed for speech recognition and speaker identification to SER system, they should take into account two critical issues. The first one is noisy environments. Many applicable devices of SER such as mobile devices and service robots are inevitably exposed to various background noises. As studied on speech recognition, noise contaminated speech may significantly degrade the recognition performance, and this drawback should be handled in SER equivalently. The next issue is the domain-oriented ambiguity: acoustically similar characteristics between emotions and variable speaker characteristics due to different user speaking styles. Each of these characteristics may cause a substantial amount of overlap between emotion models in feature vector space, thus decreasing SER accuracy. This dissertation aims at preserving the SER performance from each issue mentioned above. To address noisy environmental issue, this dissertation proposes an efficient front-end of SER system. The proposed front-end is based on adaptive comb filtering. Whereas conventional adaptive comb filtering reduces noise components remarkably, it is rarely effective in reducing severe noises. Furthermore, due to the uniformly distributed frequency response... | eng |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Adaptive comb filtering | - |
dc.subject | Noisy environments | - |
dc.subject | Speech emotion recognition | - |
dc.subject | Feature vector classification | - |
dc.subject | 특징 벡터 분류 | - |
dc.subject | 적응 콤 필터링 | - |
dc.subject | 잡음 환경 | - |
dc.subject | 음성 감정 인식 | - |
dc.title | Speech emotion recognition in noisy environments using adaptive comb filtering and feature vector classification | - |
dc.title.alternative | 적응 콤 필터링과 특징 벡터 분류 기법을 이용한 잡음 환경에서의 음성 감정 인식 | - |
dc.type | Thesis(Ph.D) | - |
dc.identifier.CNRN | 418703/325007 | - |
dc.description.department | 한국과학기술원 : 전산학과, | - |
dc.identifier.uid | 020035126 | - |
dc.contributor.localauthor | Oh, Yung-Hwan | - |
dc.contributor.localauthor | 오영환 | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.