Controllable waveform-domain diffusion model for event-guided foley sound synthesis제어 가능한 이벤트 가이딩 폴리 사운드 합성을 위한 웨이브폼 도메인에서의 디퓨전 모델 활용

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 3
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor남주한-
dc.contributor.authorChung, Yoonjin-
dc.contributor.author정윤진-
dc.date.accessioned2024-07-25T19:30:47Z-
dc.date.available2024-07-25T19:30:47Z-
dc.date.issued2023-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045731&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/320543-
dc.description학위논문(석사) - 한국과학기술원 : 김재철AI대학원, 2023.8,[iv, 29 p. :]-
dc.description.abstractThis paper addresses the challenge of generating realistic and event-aligned Foley sound effects, which play a crucial role in enhancing the immersive experience of various media forms. We propose a generative audio synthesis system that incorporates sound class category and event timing conditions to generate appropriate waveforms. To preserve temporal information and enhance synchronization with specific events, we introduce Block-FiLM, a block-wise feature linear modulation method. Our approach is demonstrated to significantly improve the quality and alignment of generated sounds by experiments and ablation studies. Evaluation results based on objective metrics and subjective listening tests confirm the effectiveness of our approach. Overall, this work contributes to the advancement of Foley sound synthesis and indicates the potential of generative models for automating and streamlining sound production in various domains.-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject폴리 사운드 합성▼a타이밍 가이던스▼a웨이브폼 도메인 디퓨전-
dc.subjectFoley sound synthesis▼aTiming guidance▼aWaveform domain diffusion-
dc.titleControllable waveform-domain diffusion model for event-guided foley sound synthesis-
dc.title.alternative제어 가능한 이벤트 가이딩 폴리 사운드 합성을 위한 웨이브폼 도메인에서의 디퓨전 모델 활용-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :김재철AI대학원,-
dc.contributor.alternativeauthorNam, Juhan-
Appears in Collection
AI-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0