DSpace at KOASAS: Multi-Modal Place Recognition via Vectorized HD Maps and Images Fusion for Autonomous Driving

DSpace at KOASAS

College of Engineering(공과대학)Cho Chun Shik Graduate School for Mobility(조천식모빌리티대학원)GT-Journal Papers(저널논문)

Multi-Modal Place Recognition via Vectorized HD Maps and Images Fusion for Autonomous Driving

Cited 0 time in webofscience

Cited 0 time in

Hit : 74
Download : 0

Export

Jeong, Hyeonjun / Shin, Juyeb / Rameau, Francois / Kum, Dongsuk researcher

The deployment of autonomous vehicles and mobile robots requires light, fast, and robust visual place recognition strategies. While visual place recognition has proven effective in favorable conditions, its performance quickly drops when faced with abundant visual cues, such as repeating image patterns commonly found in driving environments. To address this problem, a new representation that incorporates geometric cues with structural semantics can also be utilized to find the position of an agent to distribute the reliance on visual cues. In this letter, we present the first multi-modal place recognition for autonomous driving that utilizes both images and vectorized HD maps. The vectorized HD maps have the advantage of being lightweight and providing geometric cues with structural semantics, making them particularly well-suited for place recognition. To accomplish this, we employ a hierarchical graph neural network to extract a compact and robust descriptor from a local vectorized map that can be captured from surrounding images. Although HD maps provide concise geometric cues with structural semantics, they sometimes do not provide sufficient features for place recognition, contrary to images. To cope with this limitation, we propose to adaptively fuse both descriptors extracted from maps and images in order to combine the best complementary aspects of each modality via a transformer-based solution. Extensive experiments on large-scale driving datasets, NuScenes and Argoverse2, demonstrate that our multi-modal visual localization outperforms visual-only approaches. Specifically, ours improves the baseline up to 6.48%p in Recall@1 with less than 10 ms additional computation.

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Issue Date: 2024-05

Language: English

Article Type: Article

Citation: IEEE ROBOTICS AND AUTOMATION LETTERS, v.9, no.5, pp.4710 - 4717

ISSN: 2377-3766

DOI: 10.1109/lra.2024.3374193

URI: http://hdl.handle.net/10203/319049

Appears in Collection: GT-Journal Papers(저널논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Multi-Modal Place Recognition via Vectorized HD Maps and Images Fusion for Autonomous Driving

KOASAS

Communities & Collections