Learning 3D object decomposition via natural language descriptions자연어를 활용한 3차원 객체 분할 학습

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 149
  • Download : 0
We introduce PartGlot, a neural framework and associated architectures for learning semantic part segmentation of 3D shape geometry, based solely on part referential language. We exploit the fact that linguistic descriptions of a shape can provide priors on the shape’s parts – as natural language has evolved to reflect human perception of the compositional structure of objects, essential to their recognition and use. For training we use ShapeGlot’s paired geometry / language data collected via a reference game where a speaker produces an utterance to differentiate a target shape from two distractors and the listener has to find the target based on this utterance. Our network is designed to solve this target multi-modal recognition problem, by carefully incorporating a Transformer-based attention module so that the output attention can precisely highlight the semantic part or parts described in the language. Remarkably, the network operates without any direct supervision on the 3D geometry itself. Furthermore, we also demonstrate that the learned part information is generalizable to shape classes unseen during training. Our approach opens the possibility of learning 3D shape parts from language alone, without the need for large-scale part geometry annotations, thus facilitating annotation acquisition. This thesis is written based on a published paper that the candidate wrote as the first author.
Advisors
Sung, Minhyukresearcher성민혁researcher
Description
한국과학기술원 :전산학부,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 전산학부, 2023.2,[iv, 40 p. :]

Keywords

3D Object Decomposition▼aNatural Language Processing▼aMulti-Modal Learning▼aComputer Vision▼aComputer Graphics▼aDeep Neural Network▼aDeep Learning; 3차원 객체 분할▼a자연어 처리▼a멀티 모달 학습▼a컴퓨터 비전▼a컴퓨터 그래픽스▼a심층 신경망▼a딥러닝

URI
http://hdl.handle.net/10203/309544
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1032959&flag=dissertation
Appears in Collection
CS-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0