DC Field | Value | Language |
---|---|---|
dc.contributor.advisor | Yoo, Chang Dong | - |
dc.contributor.advisor | 유창동 | - |
dc.contributor.author | Nguyen, Van Cao | - |
dc.date.accessioned | 2021-05-13T19:34:41Z | - |
dc.date.available | 2021-05-13T19:34:41Z | - |
dc.date.issued | 2020 | - |
dc.identifier.uri | http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=911426&flag=dissertation | en_US |
dc.identifier.uri | http://hdl.handle.net/10203/284796 | - |
dc.description | 학위논문(석사) - 한국과학기술원 : 전기및전자공학부, 2020.2,[iii, 22 p. :] | - |
dc.description.abstract | Scene graph generation is the task of detecting objects and their pairwise visual relationships in an image. This task is challenging because it requires not only detecting object instances of various categories but also high-level understandings of visual relation. Besides accurate object detection, a high-performance predicate classifier is critical for an applicable scene graph generation model for real-world systems. To the best of our knowledge, all prior works in scene graph generation align a union region of two objects and flatten it into a vector without any processing in the middle. In this work, we propose a Union Region Attention module to focus on objects and the interaction between them in union image regions. Namely, this module generates a spatial attention mask for the union region of two objects by using embedding features from their labels, appearances, and spatial configuration. The mask puts higher weights on important points of objects than irrelevant points in the union region. Despite its simplicity, the model with our proposed module outperforms other state-of-the-art predicate prediction methods. Experiments on Visual Genome confirm the efficacy of our module. We also conduct ablation studies to provide insight into the operation of each component of our union region attention module. | - |
dc.language | eng | - |
dc.publisher | 한국과학기술원 | - |
dc.subject | Scene Graph Generation▼aPredicate Prediction▼aGuided Attention▼aComputer Vision▼aDeep Learning | - |
dc.subject | 장면 그래프 생성▼a술어 예측▼a지도된 집중▼a컴퓨터 비전▼a딥 러닝 | - |
dc.title | Scene graph generation with attended predicate prediction | - |
dc.title.alternative | 주의 집중된 관계 추론을 통한 장면 그래프 생성 | - |
dc.type | Thesis(Master) | - |
dc.identifier.CNRN | 325007 | - |
dc.description.department | 한국과학기술원 :전기및전자공학부, | - |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.