Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning

Bibliographic Details
Main Author: De Almeida, Aline Gabriel [UNESP]
Publication Date: 2023
Other Authors: Colombini, Esther Luna, Da Silva Simoes, Alexandre [UNESP]
Format: Conference object
Language: eng
Source: Repositório Institucional da UNESP
Download full: http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034
https://hdl.handle.net/11449/297822
Summary: Unmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs.
id UNSP_e5a8dfac668d45908a986967a86d2eeb
oai_identifier_str oai:repositorio.unesp.br:11449/297822
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement LearningProximal Policy Optimization (PPO)Reinforcement LearningTiltrotorTwin-Delayed Deep Deterministic Policy Gradient (TD3)Unmanned Aerial Vehicle (UAV)Unmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs.Inst. of Science and Tech. of Sorocaba São Paulo State University (Unesp)Institute of Computing University of Campinas (Unicamp)Inst. of Science and Tech. of Sorocaba São Paulo State University (Unesp)Universidade Estadual Paulista (UNESP)Universidade Estadual de Campinas (UNICAMP)De Almeida, Aline Gabriel [UNESP]Colombini, Esther LunaDa Silva Simoes, Alexandre [UNESP]2025-04-29T18:07:49Z2023-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject107-112http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112.https://hdl.handle.net/11449/29782210.1109/LARS/SBR/WRE59448.2023.103330342-s2.0-85181114602Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengProceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023info:eu-repo/semantics/openAccess2025-04-30T13:53:20Zoai:repositorio.unesp.br:11449/297822Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestrepositoriounesp@unesp.bropendoar:29462025-04-30T13:53:20Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
spellingShingle Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
De Almeida, Aline Gabriel [UNESP]
Proximal Policy Optimization (PPO)
Reinforcement Learning
Tiltrotor
Twin-Delayed Deep Deterministic Policy Gradient (TD3)
Unmanned Aerial Vehicle (UAV)
title_short Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title_full Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title_fullStr Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title_full_unstemmed Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title_sort Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
author De Almeida, Aline Gabriel [UNESP]
author_facet De Almeida, Aline Gabriel [UNESP]
Colombini, Esther Luna
Da Silva Simoes, Alexandre [UNESP]
author_role author
author2 Colombini, Esther Luna
Da Silva Simoes, Alexandre [UNESP]
author2_role author
author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (UNESP)
Universidade Estadual de Campinas (UNICAMP)
dc.contributor.author.fl_str_mv De Almeida, Aline Gabriel [UNESP]
Colombini, Esther Luna
Da Silva Simoes, Alexandre [UNESP]
dc.subject.por.fl_str_mv Proximal Policy Optimization (PPO)
Reinforcement Learning
Tiltrotor
Twin-Delayed Deep Deterministic Policy Gradient (TD3)
Unmanned Aerial Vehicle (UAV)
topic Proximal Policy Optimization (PPO)
Reinforcement Learning
Tiltrotor
Twin-Delayed Deep Deterministic Policy Gradient (TD3)
Unmanned Aerial Vehicle (UAV)
description Unmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs.
publishDate 2023
dc.date.none.fl_str_mv 2023-01-01
2025-04-29T18:07:49Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034
Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112.
https://hdl.handle.net/11449/297822
10.1109/LARS/SBR/WRE59448.2023.10333034
2-s2.0-85181114602
url http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034
https://hdl.handle.net/11449/297822
identifier_str_mv Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112.
10.1109/LARS/SBR/WRE59448.2023.10333034
2-s2.0-85181114602
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 107-112
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv repositoriounesp@unesp.br
_version_ 1834482577513119744