DSpace at KOASAS: Audio-driven 3D visual dubbing with mask generation

DSpace at KOASAS

College of Liberal Arts and Convergence Science(인문사회융합과학대학)Graduate School of Culture Technology(문화기술대학원)GCT-Theses_Master(석사논문)

Audio-driven 3D visual dubbing with mask generation마스크 생성 기법을 활용한 음성 기반 3D 비주얼 더빙

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 2
Download : 0

Export

Na, Hyeonho / 나현호

This paper discusses the method of generating a new 3D character dubbing animation using 3D character facial expressions and voice audio. The proposed approach aims to create an animation that preserves the facial expressions of the input animation while incorporating appropriate mouth movements for the voice audio. Deep learning is employed to extract expression information from the input facial animation and generate mouth movement information from the provided voice audio. Based on this information, a new animation is generated, and an additional mechanism is introduced to compare the expression information between the input facial animation and the output dubbed animation, thereby enhancing performance. Additionally, a masking process is trained to separate the regions surrounding the mouth of the 3D facial model from other areas, enabling the prediction of this division without manual intervention.

Advisors: 노준용 researcher

Description: 한국과학기술원 :문화기술대학원,

Publisher: 한국과학기술원

Issue Date: 2023

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 문화기술대학원, 2023.8,[iv, 21 p. :]

Keywords: 비주얼 더빙▼a음성 기반 얼굴 애니메이션▼a오디오 프로세싱▼a딥러닝; VIsual dubbing▼aAudio-driven facial animation▼aAudio processing▼aDeep-learning

URI: http://hdl.handle.net/10203/320586

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045774&flag=dissertation

Appears in Collection: GCT-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Audio-driven 3D visual dubbing with mask generation마스크 생성 기법을 활용한 음성 기반 3D 비주얼 더빙

KOASAS

Communities & Collections