Deep reinforcement learning using time-series data for collision avoidance of UAV in high-speed environments무인기 고속 운용 환경에서의 충돌 회피 기동을 위한 시계열 데이터 기반 심층 강화학습 연구

Cited 0 time in webofscience Cited 0 time in scopus
  • Hit : 3
  • Download : 0
With urbanization and acceleration, the demand for mission performance within cities and indoors is increasing. One of the key elements for successful indoor flight is the ability to perform real-time data-based collision avoidance. While optimization-based algorithms and rule-based approaches have been proposed for collision avoidance, they have shown limitations in their applicability to real flight environments and the requirement for users to define all scenarios. Deep reinforcement learning (DRL) is a methodology that uses neural network structures for unmanned aerial vehicles to learn autonomously based on their actions, states, and reward states. Deep reinforcement learning learns to tackle partially observable Markov decision processes without users having to define every scenario. In this study, we propose a deep reinforcement learning algorithm that can be applied in random environments, incorporating sequential depth map data merged into a 2D representation to enhance the visual information the reinforcement learning agent perceives. Additionally, we include sequential linear velocity data to better understand high-speed environments as an additional input to the network. The deep reinforcement learning network utilized Proximal Policy Optimization (PPO). The depth map data is processed through a Convolution Neural Network (CNN) for feature extraction, while the linear velocity information is combined with the image network after flattening, which completes feature extraction through a Multi-Layer Perceptron (MLP) network. The network employed in this study can be operated in an end-to-end environment. Furthermore, the validity of this algorithm was verified through simulations.
Advisors
방효충researcher
Description
한국과학기술원 :항공우주공학과,
Publisher
한국과학기술원
Issue Date
2023
Identifier
325007
Language
eng
Description

학위논문(석사) - 한국과학기술원 : 항공우주공학과, 2023.8,[v, 43 p. :]

Keywords

심층 강화 학습▼a충돌 회피▼a무인기▼a고속 비행 환경▼a시계열 데이터; Deep reinforcement learning (DRL)▼aCollision avoidance▼aUAV▼aHigh-speed environments▼aTime-series data

URI
http://hdl.handle.net/10203/320760
Link
http://library.kaist.ac.kr/search/detail/view.do?bibCtrlNo=1045990&flag=dissertation
Appears in Collection
AE-Theses_Master(석사논문)
Files in This Item
There are no files associated with this item.

qr_code

  • mendeley

    citeulike


rss_1.0 rss_2.0 atom_1.0