Deep attention models for visual recognition and completion시각 인식 및 합성을 위한 딥러닝 기반 어텐션 모델

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 101
  • Download : 0
The core of human vision is undoubtedly the perception ability, processing and translating the input visuals into structured, symbolic representations that provide the basis for higher-level understanding (visual recognition). Interestingly, this primary function is also designed to be robust on external interruptions. For example, humans can perceive and extract key information from the input, even if they are blurred, occluded, or damaged. We have a strong reasoning ability to examine the input visual signals against the corruptions (visual completion). Indeed, visual recognition and completion are crucial for understanding and interacting with our dynamic visual world. In this thesis, we first explore the effective ways to endow the machine with these two important human abilities and identify the key components to achieve this goal. Moreover, we investigate the synergy of bridging these two seemingly independent fields to further empower the machine vision ability and note the initial signals in this new direction.
Advisors
Kweon, In Soresearcher권인소researcher
Description
한국과학기술원 :전기및전자공학부,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(박사) - 한국과학기술원 : 전기및전자공학부, 2023.2,[x, 104 p. :]

Keywords

시각인식▼a시각합성▼a합성에 의한 인식; Visual recognition▼aVisual completion▼aRecognition by completion

URI
http://hdl.handle.net/10203/309066
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1030563&flag=dissertation
Appears in Collection
EE-Theses_Ph.D.(박사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0