PartGlot: Learning Shape Part Segmentation from Language Reference Games

Cited 3 time in webofscience Cited 0 time in scopus
  • Hit : 503
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorKoo, Juilko
dc.contributor.authorHuang, Ianko
dc.contributor.authorAchlioptas, Panosko
dc.contributor.authorGuibas, Leonidasko
dc.contributor.authorSung, Minhyukko
dc.date.accessioned2022-11-15T06:00:51Z-
dc.date.available2022-11-15T06:00:51Z-
dc.date.created2022-11-13-
dc.date.created2022-11-13-
dc.date.created2022-11-13-
dc.date.created2022-11-13-
dc.date.issued2022-06-
dc.identifier.citation2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.16484 - 16493-
dc.identifier.issn1063-6919-
dc.identifier.urihttp://hdl.handle.net/10203/299640-
dc.description.abstractWe introduce PartGlot, a neural framework and associated architectures for learning semantic part segmentation of 3D shape geometry, based solely on part referential language. We exploit the fact that linguistic descriptions of a shape can provide priors on the shape's parts - as natural language has evolved to reflect human perception of the compositional structure of objects, essential to their recognition and use. For training we use ShapeGlot's paired geometry /language data collected via a reference game where a speaker produces an utterance to differentiate a target shape from two distractors and the listener has to find the target based on this utterance [3]. Our network is designed to solve this target multi-modal recognition problem, by carefully incorporating a Transformer-based attention module so that the output attention can precisely highlight the semantic part or parts described in the language. Remarkably, the network operates without any direct supervision on the 3D geometry itself. Furthermore, we also demonstrate that the learned part information is generaliz-able to shape classes unseen during training. Our approach opens the possibility of learning 3D shape parts from language alone, without the need for large-scale part geometry annotations, thus facilitating annotation acquisition. The code is available at https://github.com/63days/PartGlot.-
dc.languageEnglish-
dc.publisherCVF and IEEE Computer Society-
dc.titlePartGlot: Learning Shape Part Segmentation from Language Reference Games-
dc.typeConference-
dc.identifier.wosid000870783002029-
dc.identifier.scopusid2-s2.0-85141766725-
dc.type.rimsCONF-
dc.citation.beginningpage16484-
dc.citation.endingpage16493-
dc.citation.publicationname2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)-
dc.identifier.conferencecountryUS-
dc.identifier.conferencelocationNew Orleans, LA-
dc.identifier.doi10.1109/cvpr52688.2022.01601-
dc.contributor.localauthorSung, Minhyuk-
dc.contributor.nonIdAuthorHuang, Ian-
dc.contributor.nonIdAuthorAchlioptas, Panos-
dc.contributor.nonIdAuthorGuibas, Leonidas-
Appears in Collection
CS-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.
This item is cited by other documents in WoS
⊙ Detail Information in WoSⓡ Click to see webofscience_button
⊙ Cited 3 items in WoS Click to see citing articles in records_button

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0