DSpace at KOASAS: Deep reinforcement learning using time-series data for collision avoidance of UAV in high-speed environments

DSpace at KOASAS

College of Engineering(공과대학)School of Mechanical and Aerospace Engineering(기계항공공학부)Dept. of Aerospace Engineering(항공우주공학과)AE-Theses_Master(석사논문)

Deep reinforcement learning using time-series data for collision avoidance of UAV in high-speed environments무인기 고속 운용 환경에서의 충돌 회피 기동을 위한 시계열 데이터 기반 심층 강화학습 연구

Cited 0 time in webofscience

Cited 0 time in scopus

Hit : 4
Download : 0

Export

DC Field	Value	Language
dc.contributor.advisor	방효충	-
dc.contributor.author	Lim, Chulsoo	-
dc.contributor.author	임철수	-
dc.date.accessioned	2024-07-26T19:30:12Z	-
dc.date.available	2024-07-26T19:30:12Z	-
dc.date.issued	2023	-
dc.identifier.uri	http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045990&flag=dissertation	en_US
dc.identifier.uri	http://hdl.handle.net/10203/320760	-
dc.description	학위논문(석사) - 한국과학기술원 : 항공우주공학과, 2023.8,[v, 43 p. :]	-
dc.description.abstract	With urbanization and acceleration, the demand for mission performance within cities and indoors is increasing. One of the key elements for successful indoor flight is the ability to perform real-time data-based collision avoidance. While optimization-based algorithms and rule-based approaches have been proposed for collision avoidance, they have shown limitations in their applicability to real flight environments and the requirement for users to define all scenarios. Deep reinforcement learning (DRL) is a methodology that uses neural network structures for unmanned aerial vehicles to learn autonomously based on their actions, states, and reward states. Deep reinforcement learning learns to tackle partially observable Markov decision processes without users having to define every scenario. In this study, we propose a deep reinforcement learning algorithm that can be applied in random environments, incorporating sequential depth map data merged into a 2D representation to enhance the visual information the reinforcement learning agent perceives. Additionally, we include sequential linear velocity data to better understand high-speed environments as an additional input to the network. The deep reinforcement learning network utilized Proximal Policy Optimization (PPO). The depth map data is processed through a Convolution Neural Network (CNN) for feature extraction, while the linear velocity information is combined with the image network after flattening, which completes feature extraction through a Multi-Layer Perceptron (MLP) network. The network employed in this study can be operated in an end-to-end environment. Furthermore, the validity of this algorithm was verified through simulations.	-
dc.language	eng	-
dc.publisher	한국과학기술원	-
dc.subject	심층 강화 학습▼a충돌 회피▼a무인기▼a고속 비행 환경▼a시계열 데이터	-
dc.subject	Deep reinforcement learning (DRL)▼aCollision avoidance▼aUAV▼aHigh-speed environments▼aTime-series data	-
dc.title	Deep reinforcement learning using time-series data for collision avoidance of UAV in high-speed environments	-
dc.title.alternative	무인기 고속 운용 환경에서의 충돌 회피 기동을 위한 시계열 데이터 기반 심층 강화학습 연구	-
dc.type	Thesis(Master)	-
dc.identifier.CNRN	325007	-
dc.description.department	한국과학기술원 :항공우주공학과,	-
dc.contributor.alternativeauthor	Bang, Hyochoong	-

Appears in Collection: AE-Theses_Master(석사논문)

Files in This Item: There are no files associated with this item.

Display Simple Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Deep reinforcement learning using time-series data for collision avoidance of UAV in high-speed environments무인기 고속 운용 환경에서의 충돌 회피 기동을 위한 시계열 데이터 기반 심층 강화학습 연구

KOASAS

Communities & Collections