DSpace at KOASAS: (A) study on visual systematic generalization via one-step image generation world model

DSpace at KOASAS

College of Engineering(공과대학)School of Computing(전산학부)CS-Theses_Master(석사논문)

(A) study on visual systematic generalization via one-step image generation world model이미지 생성 월드모델을 통한 시각정보의 체계적 일반화 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 5
Download : 0

Export

Kim, Yeongbin / 김영빈

Systematic compositionality, or the ability to adapt to novel situations by creating a mental model of the world using reusable pieces of knowledge, remains a significant challenge in machine learning. While there has been considerable progress in the language domain, efforts towards systematic visual imagination, or envisioning the dynamical implications of a visual observation, are in their infancy. We introduce the Systematic Visual Imagination Benchmark (SVIB), the first benchmark designed to address this problem head-on. SVIB offers a novel framework for a minimal world modeling problem, where models are evaluated based on their ability to generate one-step image-to-image transformations under a latent world dynamics. The framework provides benefits such as the possibility to jointly optimize for systematic perception and imagination, a range of difficulty levels, and the ability to control the fraction of possible factor combinations used during training. We provide a comprehensive evaluation of various baseline models on SVIB, offering insight into the current state-of-the-art in systematic visual imagination.

Advisors: 안성진 researcher

Description: 한국과학기술원 :전산학부,

Publisher: 한국과학기술원

Issue Date: 2024

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 전산학부, 2024.2,[iv, 40 p. :]

Keywords: 체계적 구성성▼a체계적 일반화▼a벤치마크▼a월드 모델링; Systematic compositionality▼aVisual imagination▼aBenchmark▼aWorld modeling

URI: http://hdl.handle.net/10203/321673

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097253&flag=dissertation

Appears in Collection: CS-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

(A) study on visual systematic generalization via one-step image generation world model이미지 생성 월드모델을 통한 시각정보의 체계적 일반화 연구

KOASAS

Communities & Collections