Multi-modal depth estimation from misaligned thermal and RGB images비정렬 열 영상과 자연 영상으로부터의 다중 모달 깊이 추정

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 5
  • Download : 0
DC FieldValueLanguage
dc.contributor.advisor김문철-
dc.contributor.authorKwon, Byeongjun-
dc.contributor.author권병준-
dc.date.accessioned2024-07-30T19:31:27Z-
dc.date.available2024-07-30T19:31:27Z-
dc.date.issued2024-
dc.identifier.urihttp://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1097161&flag=dissertationen_US
dc.identifier.urihttp://hdl.handle.net/10203/321589-
dc.description학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2024.2,[v, 45 p. :]-
dc.description.abstractDepth estimation is a research area that focuses on predicting the depth of each pixel in an input image when matched to a 3D space. Research on depth estimation is highly applicable in various fields such as autonomous driving and virtual reality. Particularly in recent years, it has become a crucial study in the field of autonomous driving and robot vision. In this thesis, we propose an effective deep learning method for estimating the depth of images by simultaneously utilizing thermal and RGB images, which are actively studied for enhancing driver and pedestrian safety through automatic pedestrian detection in autonomous driving. In this thesis, we propose the method that complementarily predicts the depth from misaligned thermal and RGB images. Specifically, to utilize consistent information from thermal images captured during nighttime and RGB images representing consistent information during daytime, we propose: (i) feature extraction from misaligned thermal and RGB images and their Cross-fusion module, (ii) a shared encoder and decoder structure for multi-modal image input, and (iii) Multi-objective training strategy for simultaneous supervised training from multi-modal supervision. In particular, we use cross-attention methods to effectively extract features for depth prediction from corresponding positions in misaligned thermal and RGB images. Through various experiments, our proposed method demonstrates its effectiveness, achieving performance improvements of 7%-points and 4%-points, respectively, compared to using only each modal input (thermal or RGB images).-
dc.languageeng-
dc.publisher한국과학기술원-
dc.subject깊이 추정▼a멀티 모달▼a비정렬▼a강인함-
dc.subjectDepth estimation (DE)▼aMulti-modal▼aMisalignment▼aRobustness-
dc.titleMulti-modal depth estimation from misaligned thermal and RGB images-
dc.title.alternative비정렬 열 영상과 자연 영상으로부터의 다중 모달 깊이 추정-
dc.typeThesis(Master)-
dc.identifier.CNRN325007-
dc.description.department한국과학기술원 :전기및전자공학부,-
dc.contributor.alternativeauthorKim, Munchurl-
Appears in Collection
EE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0