Learning-based object detection and 6D pose estimation toward vision-based underwater robotic grasping = 수중 로봇의 물체 파지를 위한 인공지능 기반 물체 인식 및 자세 추정 방법

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 75
  • Download : 0
Our final goal is grasping the object in underwater based on the RGB camera installed in AUV. In this thesis, we made the base framework for grasping the object. So we propose the method of improving underwater object detection and pose estimation sequentially. Firstly, underwater images are affected by the various optical variation such as color distortion, intensity degeneration, haze, and so on. So to recognize the object in underwater images, we have to add additional process to remove the optical variation for the accurate object detection. Especially, in deep learning-based object detection model, the training set applied above process is the most effective to obtain the outstanding performance. In this thesis, we propose the novel method of generating the underwater dataset. This dataset reflects the various optical conditions which are color distortion, intensity degeneration, haze effect. Also, the object occlusion is included in our dataset generation process. In the experiment, we evaluate the suitability of our dataset for the underwater environment to determine if our dataset reflects the underwater environment. As the next step, we introduce a rotational primitive prediction based 6D object pose estimation using a single image as an input. Our approach initially trains a Variational AutoEncoder (VAE) to learn the code for each object, which is then further refined by a novel rotational primitive decoder. Doing so substantially improves the orientation estimation in a direct regression fashion as well as overall pose estimation performance. To better capture the representation of the learned code, we concatenate the sampled codes prior to the orientation estimation. Lastly, translation is inferred using an object relocalization module. Because of the enhanced rotational discriminative code, high accuracy is achieved for symmetric and occluded objects. In addition, to make a more accurate pose estimation result, we propose RGB-based pose refinement network.
Kim, Ayoungresearcher김아영researcher
한국과학기술원 :로봇공학학제전공,
Issue Date

학위논문(석사) - 한국과학기술원 : 로봇공학학제전공, 2020.2,[iv, 47 p. :]


Underwater Environment▼aObject Detection▼a6D Object Pose Estimation▼aDeep Learning▼aauto encoder; 수중환경▼a물체 인식▼a물체 자세▼a딥러닝▼a오토엔코더

Appears in Collection
Files in This Item
There are no files associated with this item.


  • mendeley


rss_1.0 rss_2.0 atom_1.0