Image translation and manipulation with disentangled representations분리된 표현을 통한 이미지 변환 및 조작

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 144
  • Download : 0
This paper proposes various methods to improve controllability during the generation process by disentangling the representation in the generation process. The study presents five methodologies. Firstly, by introducing new structural elements to StyleGAN, we divided the generation process into spatially invariant style control and spatial content control parts, thus achieving improved generation controllability. Secondly, a new methodology for image style transfer using text conditions was proposed, allowing the change of image texture information according to the given text condition while maintaining the structural components of the image. Thirdly, we proposed a new methodology for fine-tuning StyleGAN, enabling the generation of a new model capable of translating the images into the style of a given single-shot target image. Fourthly, we proposed a novel image translation strategy using a diffusion-based generative model that maintains structural content information during the sampling process while only changing style information. Lastly, leveraging the most recent text-to-image diffusion model, we extended the advantages of representation disentanglement, which was the goal of this research, to other tasks by combining it with image translation and concept personalization tasks.
Advisors
예종철researcher
Description
한국과학기술원 :바이오및뇌공학과,
Publisher
한국과학기술원
Issue Date
2024
Identifier
325007
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 바이오및뇌공학과, 2024.8,[xiv,127 p. :]

Keywords

Computer vision; Image Translation; Style Transfer; Generative Model; Disentangled Representation; 컴퓨터 비전; 이미지 변환; 스타일 변환; 생성모델; 분리된 표현

URI
http://hdl.handle.net/10203/332016
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1109660&flag=dissertation
Appears in Collection
AI-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0