DSpace at KOASAS: Deep reinforcement learning using time-series data for collision avoidance of UAV in high-speed environments

DSpace at KOASAS

College of Engineering(공과대학)School of Mechanical and Aerospace Engineering(기계항공공학부)Dept. of Aerospace Engineering(항공우주공학과)AE-Theses_Master(석사논문)

Deep reinforcement learning using time-series data for collision avoidance of UAV in high-speed environments무인기 고속 운용 환경에서의 충돌 회피 기동을 위한 시계열 데이터 기반 심층 강화학습 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 3
Download : 0

Export

Lim, Chulsoo / 임철수

With urbanization and acceleration, the demand for mission performance within cities and indoors is increasing. One of the key elements for successful indoor flight is the ability to perform real-time data-based collision avoidance. While optimization-based algorithms and rule-based approaches have been proposed for collision avoidance, they have shown limitations in their applicability to real flight environments and the requirement for users to define all scenarios. Deep reinforcement learning (DRL) is a methodology that uses neural network structures for unmanned aerial vehicles to learn autonomously based on their actions, states, and reward states. Deep reinforcement learning learns to tackle partially observable Markov decision processes without users having to define every scenario. In this study, we propose a deep reinforcement learning algorithm that can be applied in random environments, incorporating sequential depth map data merged into a 2D representation to enhance the visual information the reinforcement learning agent perceives. Additionally, we include sequential linear velocity data to better understand high-speed environments as an additional input to the network. The deep reinforcement learning network utilized Proximal Policy Optimization (PPO). The depth map data is processed through a Convolution Neural Network (CNN) for feature extraction, while the linear velocity information is combined with the image network after flattening, which completes feature extraction through a Multi-Layer Perceptron (MLP) network. The network employed in this study can be operated in an end-to-end environment. Furthermore, the validity of this algorithm was verified through simulations.

Advisors: 방효충 researcher

Description: 한국과학기술원 :항공우주공학과,

Publisher: 한국과학기술원

Issue Date: 2023

Identifier: 325007

Language: eng

Description: 학위논문(석사) - 한국과학기술원 : 항공우주공학과, 2023.8,[v, 43 p. :]

Keywords: 심층 강화 학습▼a충돌 회피▼a무인기▼a고속 비행 환경▼a시계열 데이터; Deep reinforcement learning (DRL)▼aCollision avoidance▼aUAV▼aHigh-speed environments▼aTime-series data

URI: http://hdl.handle.net/10203/320760

Link: http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045990&flag=dissertation

Appears in Collection: AE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Deep reinforcement learning using time-series data for collision avoidance of UAV in high-speed environments무인기 고속 운용 환경에서의 충돌 회피 기동을 위한 시계열 데이터 기반 심층 강화학습 연구

KOASAS

Communities & Collections