Speech emotion recognition in noisy environments using adaptive comb filtering and feature vector classification적응 콤 필터링과 특징 벡터 분류 기법을 이용한 잡음 환경에서의 음성 감정 인식

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 801
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisorOh, Yung-Hwan-
dc.contributor.advisor오영환-
dc.contributor.authorPark, Jeong-Sik-
dc.contributor.author박정식-
dc.date.accessioned2011-12-13T05:27:16Z-
dc.date.available2011-12-13T05:27:16Z-
dc.date.issued2010-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=418703&flag=dissertation-
dc.identifier.urihttp://hdl.handle.net/10203/33288-
dc.description학위논문(박사) - 한국과학기술원 : 전산학과, 2010.2, [ viii, 73 p. ]-
dc.description.abstractSpeech processing technology has been continuously advanced as an essential field of artificial intelligence for the last few decades. Researchers are investigating various approaches to make more intelligent and user-friendly applications, which assist and understand humans. Especially, service robot industry remains critical challenges to advance the human-robot interaction and they search for the solutions in voice interface. The most representative technology to enable intelligent machines to comprehend users` emotional state and interact with them is Speech Emotion Recognition (SER), which automatically identifies users` feelings and conditions from their spoken speech. Although many researchers have applied various technical approaches developed for speech recognition and speaker identification to SER system, they should take into account two critical issues. The first one is noisy environments. Many applicable devices of SER such as mobile devices and service robots are inevitably exposed to various background noises. As studied on speech recognition, noise contaminated speech may significantly degrade the recognition performance, and this drawback should be handled in SER equivalently. The next issue is the domain-oriented ambiguity: acoustically similar characteristics between emotions and variable speaker characteristics due to different user speaking styles. Each of these characteristics may cause a substantial amount of overlap between emotion models in feature vector space, thus decreasing SER accuracy. This dissertation aims at preserving the SER performance from each issue mentioned above. To address noisy environmental issue, this dissertation proposes an efficient front-end of SER system. The proposed front-end is based on adaptive comb filtering. Whereas conventional adaptive comb filtering reduces noise components remarkably, it is rarely effective in reducing severe noises. Furthermore, due to the uniformly distributed frequency response...eng
dc.languageeng-
dc.publisher한국과학기술원-
dc.subjectAdaptive comb filtering-
dc.subjectNoisy environments-
dc.subjectSpeech emotion recognition-
dc.subjectFeature vector classification-
dc.subject특징 벡터 분류-
dc.subject적응 콤 필터링-
dc.subject잡음 환경-
dc.subject음성 감정 인식-
dc.titleSpeech emotion recognition in noisy environments using adaptive comb filtering and feature vector classification-
dc.title.alternative적응 콤 필터링과 특징 벡터 분류 기법을 이용한 잡음 환경에서의 음성 감정 인식-
dc.typeThesis(Ph.D)-
dc.identifier.CNRN418703/325007 -
dc.description.department한국과학기술원 : 전산학과, -
dc.identifier.uid020035126-
dc.contributor.localauthorOh, Yung-Hwan-
dc.contributor.localauthor오영환-
Appears in Collection
CS-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0