Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning

De Almeida, Aline Gabriel [UNESP]; Colombini, Esther Luna; Da Silva Simoes, Alexandre [UNESP]

Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning

Detalhes bibliográficos
Autor(a) principal:	De Almeida, Aline Gabriel [UNESP]
Data de Publicação:	2023
Outros Autores:	Colombini, Esther Luna, Da Silva Simoes, Alexandre [UNESP]
Tipo de documento:	Artigo de conferência
Idioma:	eng
Título da fonte:	Repositório Institucional da UNESP
Texto Completo:	http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034 https://hdl.handle.net/11449/297822
Resumo:	Unmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs.

Metadados do item

id	UNSP_e5a8dfac668d45908a986967a86d2eeb
oai_identifier_str	oai:repositorio.unesp.br:11449/297822
network_acronym_str	UNSP
network_name_str	Repositório Institucional da UNESP
repository_id_str	2946
spelling	Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement LearningProximal Policy Optimization (PPO)Reinforcement LearningTiltrotorTwin-Delayed Deep Deterministic Policy Gradient (TD3)Unmanned Aerial Vehicle (UAV)Unmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs.Inst. of Science and Tech. of Sorocaba São Paulo State University (Unesp)Institute of Computing University of Campinas (Unicamp)Inst. of Science and Tech. of Sorocaba São Paulo State University (Unesp)Universidade Estadual Paulista (UNESP)Universidade Estadual de Campinas (UNICAMP)De Almeida, Aline Gabriel [UNESP]Colombini, Esther LunaDa Silva Simoes, Alexandre [UNESP]2025-04-29T18:07:49Z2023-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject107-112http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112.https://hdl.handle.net/11449/29782210.1109/LARS/SBR/WRE59448.2023.103330342-s2.0-85181114602Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengProceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023info:eu-repo/semantics/openAccess2025-11-15T05:09:02Zoai:repositorio.unesp.br:11449/297822Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestrepositoriounesp@unesp.bropendoar:29462025-11-15T05:09:02Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv	Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title	Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
spellingShingle	Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning De Almeida, Aline Gabriel [UNESP] Proximal Policy Optimization (PPO) Reinforcement Learning Tiltrotor Twin-Delayed Deep Deterministic Policy Gradient (TD3) Unmanned Aerial Vehicle (UAV)
title_short	Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title_full	Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title_fullStr	Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title_full_unstemmed	Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
title_sort	Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning
author	De Almeida, Aline Gabriel [UNESP]
author_facet	De Almeida, Aline Gabriel [UNESP] Colombini, Esther Luna Da Silva Simoes, Alexandre [UNESP]
author_role	author
author2	Colombini, Esther Luna Da Silva Simoes, Alexandre [UNESP]
author2_role	author author
dc.contributor.none.fl_str_mv	Universidade Estadual Paulista (UNESP) Universidade Estadual de Campinas (UNICAMP)
dc.contributor.author.fl_str_mv	De Almeida, Aline Gabriel [UNESP] Colombini, Esther Luna Da Silva Simoes, Alexandre [UNESP]
dc.subject.por.fl_str_mv	Proximal Policy Optimization (PPO) Reinforcement Learning Tiltrotor Twin-Delayed Deep Deterministic Policy Gradient (TD3) Unmanned Aerial Vehicle (UAV)
topic	Proximal Policy Optimization (PPO) Reinforcement Learning Tiltrotor Twin-Delayed Deep Deterministic Policy Gradient (TD3) Unmanned Aerial Vehicle (UAV)
description	Unmanned Aerial Vehicles (UAVs) have gained significant attention in various domains due to their versatility and potential applications. Effective control of UAVs is crucial for achieving desired flight behaviors and optimizing their performance. This paper presents a comprehensive exploration of learning-based approaches for controlling UAVs with fixed-rotors and tiltrotors, specifically focusing on the Proximal Policy Optimization (PPO) and Twin-Delayed Deep Deterministic Policy Gradient (TD3) algorithms. The study aims to compare and evaluate the efficacy of these two state-of-the-art reinforcement learning algorithms in controlling UAVs with varying designs and control complexities. By utilizing PPO and TD3, we address the challenges associated with maneuvering UAVs in dynamic environments and achieving precise control under different flight conditions. We conducted extensive simulations to assess the performance of PPO and TD3 algorithms in diverse UAV scenarios, considering multiple design configurations and control requirements. The evaluation criteria encompassed stability, robustness, trajectory tracking accuracy, and control efficiency. Results demonstrate the suitability and effectiveness of both PPO and TD3 in controlling UAVs.
publishDate	2023
dc.date.none.fl_str_mv	2023-01-01 2025-04-29T18:07:49Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/conferenceObject
format	conferenceObject
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034 Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112. https://hdl.handle.net/11449/297822 10.1109/LARS/SBR/WRE59448.2023.10333034 2-s2.0-85181114602
url	http://dx.doi.org/10.1109/LARS/SBR/WRE59448.2023.10333034 https://hdl.handle.net/11449/297822
identifier_str_mv	Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023, p. 107-112. 10.1109/LARS/SBR/WRE59448.2023.10333034 2-s2.0-85181114602
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	Proceedings - 2023 Latin American Robotics Symposium, 2023 Brazilian Symposium on Robotics, and 2023 Workshop of Robotics in Education, LARS/SBR/WRE 2023
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	107-112
dc.source.none.fl_str_mv	Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP
instname_str	Universidade Estadual Paulista (UNESP)
instacron_str	UNESP
institution	UNESP
reponame_str	Repositório Institucional da UNESP
collection	Repositório Institucional da UNESP
repository.name.fl_str_mv	Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv	repositoriounesp@unesp.br
_version_	1851766683045199872

Controlling Tiltrotors Unmanned Aerial Vehicles (UAVs) with Deep Reinforcement Learning

Registros relacionados