Natural Language Representation as Features for Place Recognition

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 86
  • Download : 0
Visual information is rich in content, and robots require computer vision techniques to encode images into information to utilize the images. Robot vision transforms the image into descriptors using predefined patterns, whether defined by handcrafted or learned methods. However, the image descriptors are not explainable to human intelligence and limit human-robot interaction upon vision tasks. On the other hand, recent studies have discovered an efficient and expandable method of transforming an image into natural language forms. With visual transformers, the context in an image is translated into natural language representations. To create an image representation both understandable to humans and artificial intelligence, in this paper, we present a method of using the language-image model as natural representations for robotic place recognition tasks.
Publisher
Institute of Electrical and Electronics Engineers Inc.
Issue Date
2022-07-04
Language
English
Citation

19th International Conference on Ubiquitous Robots, UR 2022, pp.284 - 287

DOI
10.1109/UR55393.2022.9826253
URI
http://hdl.handle.net/10203/298432
Appears in Collection
EE-Conference Papers(학술회의논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0