Audio-driven 3D visual dubbing with mask generation마스크 생성 기법을 활용한 음성 기반 3D 비주얼 더빙

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 2
  • Download : 0
This paper discusses the method of generating a new 3D character dubbing animation using 3D character facial expressions and voice audio. The proposed approach aims to create an animation that preserves the facial expressions of the input animation while incorporating appropriate mouth movements for the voice audio. Deep learning is employed to extract expression information from the input facial animation and generate mouth movement information from the provided voice audio. Based on this information, a new animation is generated, and an additional mechanism is introduced to compare the expression information between the input facial animation and the output dubbed animation, thereby enhancing performance. Additionally, a masking process is trained to separate the regions surrounding the mouth of the 3D facial model from other areas, enabling the prediction of this division without manual intervention.
Advisors
노준용researcher
Description
한국과학기술원 :문화기술대학원,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 문화기술대학원, 2023.8,[iv, 21 p. :]

Keywords

비주얼 더빙▼a음성 기반 얼굴 애니메이션▼a오디오 프로세싱▼a딥러닝; VIsual dubbing▼aAudio-driven facial animation▼aAudio processing▼aDeep-learning

URI
http://hdl.handle.net/10203/320586
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045774&flag=dissertation
Appears in Collection
GCT-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0