Semantic-guided de-attention with sharpened triplet marginal loss for visual place recognition

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 453
  • Download : 0
Thanks to Earth-level Street View images from Google Maps, a visual image geo-localization can estimate the coarse location of a query image with a visual place recognition process. However, this can get very challenging when non-static objects change with time, severely degrading image retrieval accuracy. We address the problem of city-scale visual place recognition in complex urban environments crowded with non-static clutters. To this end, we first analyze what clutters degrade similarity matching between the query and database images. Second, we design a self-supervised trainable de-attention module that pre-vents the network from focusing on non-static objects in an input image. In addition, we propose a novel triplet marginal loss called sharpened triplet marginal loss to make feature descriptors more discriminative. Lastly, due to the lack of geo-tagged public datasets with a high density of non-static objects, we propose a clutter augmentation method to evaluate our approach. The experimental results show that our model has notably improved over the existing attention methods in geo-localization tasks on the public bench-mark datasets and on their augmented versions with high population and traffic. Our code is available at https://github.com/ccsmm78/deattention _ with _ stml _ for _ vpr .(c) 2023 The Author(s). Published by Elsevier Ltd.This is an open access article under the CC BY-NC-ND license ( http://creativecommons.org/licenses/by-nc-nd/4.0/ )
Publisher
ELSEVIER SCI LTD
Issue Date
2023-09
Language
English
Article Type
Article
Citation

PATTERN RECOGNITION, v.141

ISSN
0031-3203
DOI
10.1016/j.patcog.2023.109645
URI
http://hdl.handle.net/10203/307421
Appears in Collection
EE-Journal Papers(저널논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0