Sequential Image-based 3D Object Detection with Location Refinement

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 103
  • Download : 0
Recent advances in object detection tasks enable the detection network to predict 3D objects from a monocular image, but the performance of monocular 3D object detectors is inferior due to the depth information lost in the image. Most monocular 3D detectors do not utilize sequential information from multi-frame images, even though the object’s temporal motion is very informative for 3D object detection. In this paper, we propose a sequential image-based 3D object detection architecture that focuses on improving the localization performance of 3D detectors using temporal information for autonomous driving applications. To this end, the proposed network is trained with a pair of sequential images to predict 3D objects with their localization uncertainties on each image. Afterward, the object detected from sequential images is associated, and paired object features are fed to the sub-network to predict the depth displacement between frames. Finally, paired objects and their predicted depths and depth displacement are refined to minimize residuals between predictions and output the final 3D location of objects. The experimental results on challenging the nuScenes dataset demonstrate that our method improves the performance of the 3D detector by reducing the localization error.
Publisher
IEEE
Issue Date
2022-09-29
Language
English
Citation

2022 26th International Conference on Pattern Recognition, ICPR 2022, pp.3625 - 3631

ISSN
1051-4651
DOI
10.1109/ICPR56361.2022.9956157
URI
http://hdl.handle.net/10203/302173
Appears in Collection
GT-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0