Prompt-guided DETR with RoI-pruned masked attention for open-vocabulary object detection

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 6
  • Download : 0
DC FieldValueLanguage
dc.contributor.authorSong, Hwanjunko
dc.contributor.authorBang, Jihwanko
dc.date.accessioned2024-08-29T02:00:07Z-
dc.date.available2024-08-29T02:00:07Z-
dc.date.created2024-06-11-
dc.date.issued2024-11-
dc.identifier.citationPATTERN RECOGNITION, v.155-
dc.identifier.issn0031-3203-
dc.identifier.urihttp://hdl.handle.net/10203/322447-
dc.description.abstractPrompt-OVD is an efficient and effective DETR-based framework for open -vocabulary object detection that utilizes class embeddings from CLIP as prompts, guiding the Transformer decoder to detect objects in base and novel classes. Additionally, our RoI-pruned masked attention helps leverage the zero -shot classification ability of the Vision Transformer -based CLIP, resulting in improved detection performance at a minimal computational cost. Our experiments on the OV-COCO and OV-LVIS datasets demonstrate that Prompt-OVD achieves an impressive 21.2 times faster inference speed than the first end -to -end open -vocabulary detection method (OVDETR), while also achieving higher APs than four two -stage methods operating within similar inference time ranges. We release the code at https://github.com/DISL-Lab/Prompt-OVD.-
dc.languageEnglish-
dc.publisherELSEVIER SCI LTD-
dc.titlePrompt-guided DETR with RoI-pruned masked attention for open-vocabulary object detection-
dc.typeArticle-
dc.identifier.wosid001257471900001-
dc.identifier.scopusid2-s2.0-85195806366-
dc.type.rimsART-
dc.citation.volume155-
dc.citation.publicationnamePATTERN RECOGNITION-
dc.identifier.doi10.1016/j.patcog.2024.110648-
dc.contributor.localauthorSong, Hwanjun-
dc.description.isOpenAccessN-
dc.type.journalArticleArticle-
dc.subject.keywordAuthorOVD-
dc.subject.keywordAuthorTransformer-
dc.subject.keywordAuthorObject detection-
dc.subject.keywordAuthorOpen-vocabulary detection-
Appears in Collection
IE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0