A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO
| Main Author: | |
|---|---|
| Publication Date: | 2025 |
| Other Authors: | , , |
| Format: | Article |
| Language: | eng |
| Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| Download full: | https://hdl.handle.net/10216/164104 |
Summary: | An autonomous vehicle can sense its environment and operate without human involvement. Its adequate management in an intelligent transportation system could significantly reduce traffic congestion and overall travel time in a network. Adaptive traffic signal controller (ATSC) based on multi-agent systems using state-action-reward-state-action (SARSA (lambda)) are well-known state-of-the-art models to manage autonomous vehicles within urban areas. However, this study found inefficient weights updating mechanisms of the conventional SARSA (lambda) models. Therefore, it proposes a Gaussian function to regulate the eligibility trace vector's decay mechanism effectively. On the other hand, an efficient understanding of the state of the traffic environment is crucial for an agent to take optimal actions. The conventional models feed the state values to the agents through the MinMax normalization technique, which sometimes shows less efficiency and robustness. So, this study suggests the MaxAbs scaled state values instead of MinMax to address the problem. Furthermore, the combination of the A-star routing algorithm and proposed model demonstrated a good increase in performance relatively to the conventional SARSA (lambda)-based routing algorithms. The proposed model and the baselines were implemented in a microscopic traffic simulation environment using the SUMO package over a complex real-world-like 21-intersections network to evaluate their performance. The results showed a reduction of the vehicle's average total waiting time and total stops by a mean value of 59.9% and 17.55% compared to the considered baselines. Also, the A-star combined with the proposed controller outperformed the conventional approaches by increasing the vehicle's average trip speed by 3.4%. |
| id |
RCAP_a2e3b8d763e9c3695cbe5c8846c79d55 |
|---|---|
| oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/164104 |
| network_acronym_str |
RCAP |
| network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| repository_id_str |
https://opendoar.ac.uk/repository/7160 |
| spelling |
A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMOCiências Tecnológicas, Ciências da engenharia e tecnologiasTechnological sciences, Engineering and technologyAn autonomous vehicle can sense its environment and operate without human involvement. Its adequate management in an intelligent transportation system could significantly reduce traffic congestion and overall travel time in a network. Adaptive traffic signal controller (ATSC) based on multi-agent systems using state-action-reward-state-action (SARSA (lambda)) are well-known state-of-the-art models to manage autonomous vehicles within urban areas. However, this study found inefficient weights updating mechanisms of the conventional SARSA (lambda) models. Therefore, it proposes a Gaussian function to regulate the eligibility trace vector's decay mechanism effectively. On the other hand, an efficient understanding of the state of the traffic environment is crucial for an agent to take optimal actions. The conventional models feed the state values to the agents through the MinMax normalization technique, which sometimes shows less efficiency and robustness. So, this study suggests the MaxAbs scaled state values instead of MinMax to address the problem. Furthermore, the combination of the A-star routing algorithm and proposed model demonstrated a good increase in performance relatively to the conventional SARSA (lambda)-based routing algorithms. The proposed model and the baselines were implemented in a microscopic traffic simulation environment using the SUMO package over a complex real-world-like 21-intersections network to evaluate their performance. The results showed a reduction of the vehicle's average total waiting time and total stops by a mean value of 59.9% and 17.55% compared to the considered baselines. Also, the A-star combined with the proposed controller outperformed the conventional approaches by increasing the vehicle's average trip speed by 3.4%.20252025-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://hdl.handle.net/10216/164104eng0266-472010.1111/exsy.13301Selim RezaMarta Campos FerreiraJ. J. M. MachadoJoão Manuel R. S. Tavaresinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-28T01:22:03Zoai:repositorio-aberto.up.pt:10216/164104Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T22:49:46.937030Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
| dc.title.none.fl_str_mv |
A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO |
| title |
A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO |
| spellingShingle |
A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO Selim Reza Ciências Tecnológicas, Ciências da engenharia e tecnologias Technological sciences, Engineering and technology |
| title_short |
A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO |
| title_full |
A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO |
| title_fullStr |
A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO |
| title_full_unstemmed |
A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO |
| title_sort |
A citywide TD-learning based intelligent traffic signal control for autonomous vehicles: Performance evaluation using SUMO |
| author |
Selim Reza |
| author_facet |
Selim Reza Marta Campos Ferreira J. J. M. Machado João Manuel R. S. Tavares |
| author_role |
author |
| author2 |
Marta Campos Ferreira J. J. M. Machado João Manuel R. S. Tavares |
| author2_role |
author author author |
| dc.contributor.author.fl_str_mv |
Selim Reza Marta Campos Ferreira J. J. M. Machado João Manuel R. S. Tavares |
| dc.subject.por.fl_str_mv |
Ciências Tecnológicas, Ciências da engenharia e tecnologias Technological sciences, Engineering and technology |
| topic |
Ciências Tecnológicas, Ciências da engenharia e tecnologias Technological sciences, Engineering and technology |
| description |
An autonomous vehicle can sense its environment and operate without human involvement. Its adequate management in an intelligent transportation system could significantly reduce traffic congestion and overall travel time in a network. Adaptive traffic signal controller (ATSC) based on multi-agent systems using state-action-reward-state-action (SARSA (lambda)) are well-known state-of-the-art models to manage autonomous vehicles within urban areas. However, this study found inefficient weights updating mechanisms of the conventional SARSA (lambda) models. Therefore, it proposes a Gaussian function to regulate the eligibility trace vector's decay mechanism effectively. On the other hand, an efficient understanding of the state of the traffic environment is crucial for an agent to take optimal actions. The conventional models feed the state values to the agents through the MinMax normalization technique, which sometimes shows less efficiency and robustness. So, this study suggests the MaxAbs scaled state values instead of MinMax to address the problem. Furthermore, the combination of the A-star routing algorithm and proposed model demonstrated a good increase in performance relatively to the conventional SARSA (lambda)-based routing algorithms. The proposed model and the baselines were implemented in a microscopic traffic simulation environment using the SUMO package over a complex real-world-like 21-intersections network to evaluate their performance. The results showed a reduction of the vehicle's average total waiting time and total stops by a mean value of 59.9% and 17.55% compared to the considered baselines. Also, the A-star combined with the proposed controller outperformed the conventional approaches by increasing the vehicle's average trip speed by 3.4%. |
| publishDate |
2025 |
| dc.date.none.fl_str_mv |
2025 2025-01-01T00:00:00Z |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
| format |
article |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/164104 |
| url |
https://hdl.handle.net/10216/164104 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
0266-4720 10.1111/exsy.13301 |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf |
| dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
| instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
| instacron_str |
RCAAP |
| institution |
RCAAP |
| reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
| repository.mail.fl_str_mv |
info@rcaap.pt |
| _version_ |
1833599886187036672 |