Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
Main Author: | |
---|---|
Publication Date: | 2023 |
Other Authors: | , |
Format: | Conference object |
Language: | eng |
Source: | Repositório Institucional da UNESP |
Download full: | http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034 https://hdl.handle.net/11449/297822 |
Summary: | Unmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs. |
id |
UNSP_e5a8dfac668d45908a986967a86d2eeb |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/297822 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement LearningProximal Policy Optimization (PPO)Reinforcement LearningTiltrotorTwin-Delayed Deep Deterministic Policy Gradient (TD3)Unmanned Aerial Vehicle (UAV)Unmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs.Inst. of Science and Tech. of Sorocaba São Paulo State University (Unesp)Institute of Computing University of Campinas (Unicamp)Inst. of Science and Tech. of Sorocaba São Paulo State University (Unesp)Universidade Estadual Paulista (UNESP)Universidade Estadual de Campinas (UNICAMP)De Almeida, Aline Gabriel [UNESP]Colombini, Esther LunaDa Silva Simoes, Alexandre [UNESP]2025-04-29T18:07:49Z2023-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject107-112http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112.https://hdl.handle.net/11449/29782210.1109/LARS/SBR/WRE59448.2023.103330342-s2.0-85181114602Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengProceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023info:eu-repo/semantics/openAccess2025-04-30T13:53:20Zoai:repositorio.unesp.br:11449/297822Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestrepositoriounesp@unesp.bropendoar:29462025-04-30T13:53:20Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning |
title |
Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning |
spellingShingle |
Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning De Almeida, Aline Gabriel [UNESP] Proximal Policy Optimization (PPO) Reinforcement Learning Tiltrotor Twin-Delayed Deep Deterministic Policy Gradient (TD3) Unmanned Aerial Vehicle (UAV) |
title_short |
Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning |
title_full |
Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning |
title_fullStr |
Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning |
title_full_unstemmed |
Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning |
title_sort |
Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning |
author |
De Almeida, Aline Gabriel [UNESP] |
author_facet |
De Almeida, Aline Gabriel [UNESP] Colombini, Esther Luna Da Silva Simoes, Alexandre [UNESP] |
author_role |
author |
author2 |
Colombini, Esther Luna Da Silva Simoes, Alexandre [UNESP] |
author2_role |
author author |
dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (UNESP) Universidade Estadual de Campinas (UNICAMP) |
dc.contributor.author.fl_str_mv |
De Almeida, Aline Gabriel [UNESP] Colombini, Esther Luna Da Silva Simoes, Alexandre [UNESP] |
dc.subject.por.fl_str_mv |
Proximal Policy Optimization (PPO) Reinforcement Learning Tiltrotor Twin-Delayed Deep Deterministic Policy Gradient (TD3) Unmanned Aerial Vehicle (UAV) |
topic |
Proximal Policy Optimization (PPO) Reinforcement Learning Tiltrotor Twin-Delayed Deep Deterministic Policy Gradient (TD3) Unmanned Aerial Vehicle (UAV) |
description |
Unmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-01-01 2025-04-29T18:07:49Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034 Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112. https://hdl.handle.net/11449/297822 10.1109/LARS/SBR/WRE59448.2023.10333034 2-s2.0-85181114602 |
url |
http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034 https://hdl.handle.net/11449/297822 |
identifier_str_mv |
Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112. 10.1109/LARS/SBR/WRE59448.2023.10333034 2-s2.0-85181114602 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
107-112 |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
repositoriounesp@unesp.br |
_version_ |
1834482577513119744 |