Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction

Bibliographic Details
Main Author: Santos, Daniel
Publication Date: 2021
Other Authors: Saias, José, Quaresma, Paulo, Nogueira, Vitor
Format: Article
Language: por
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/10174/32115
https://doi.org/10.3390/computers10120157
Summary: Traffic accidents are one of the most important concerns of the world, since they result in numerous casualties, injuries, and fatalities each year, as well as significant economic losses. There are many factors that are responsible for causing road accidents. If these factors can be better understood and predicted, it might be possible to take measures to mitigate the damages and its severity. The purpose of this work is to identify these factors using accident data from 2016 to 2019 from the district of Setúbal, Portugal. This work aims at developing models that can select a set of influential factors that may be used to classify the severity of an accident, supporting an analysis on the accident data. In addition, this study also proposes a predictive model for future road accidents based on past data. Various machine learning approaches are used to create these models. Supervised machine learning methods such as decision trees (DT), random forests (RF), logistic regression (LR), and naive Bayes (NB) are used, as well as unsupervised machine learning techniques including DBSCAN and hierarchical clustering. Results show that a rule-based model using the C5.0 algorithm is capable of accurately detecting the most relevant factors describing a road accident severity. Further, the results of the predictive model suggests the RF model could be a useful tool for forecasting accident hotspots.
id RCAP_cc293c6e73ba912ebbe0bcab19ce05eb
oai_identifier_str oai:dspace.uevora.pt:10174/32115
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Machine Learning Approaches to Traffic Accident Analysis and Hotspot Predictionmachine learningdata analysisoad accident dataclusteringdecision treesrandom forestsTraffic accidents are one of the most important concerns of the world, since they result in numerous casualties, injuries, and fatalities each year, as well as significant economic losses. There are many factors that are responsible for causing road accidents. If these factors can be better understood and predicted, it might be possible to take measures to mitigate the damages and its severity. The purpose of this work is to identify these factors using accident data from 2016 to 2019 from the district of Setúbal, Portugal. This work aims at developing models that can select a set of influential factors that may be used to classify the severity of an accident, supporting an analysis on the accident data. In addition, this study also proposes a predictive model for future road accidents based on past data. Various machine learning approaches are used to create these models. Supervised machine learning methods such as decision trees (DT), random forests (RF), logistic regression (LR), and naive Bayes (NB) are used, as well as unsupervised machine learning techniques including DBSCAN and hierarchical clustering. Results show that a rule-based model using the C5.0 algorithm is capable of accurately detecting the most relevant factors describing a road accident severity. Further, the results of the predictive model suggests the RF model could be a useful tool for forecasting accident hotspots.FCT Fundação para a Ciência e a Tecnologia, under the project with reference FCT DSAIPA/DS/0090/2018, “MOPREVIS—Modelação e Predição de Acidentes de Viação no Distrito de Setúbal”.MDPI2022-05-30T11:00:32Z2022-05-302021-11-24T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/32115http://hdl.handle.net/10174/32115https://doi.org/10.3390/computers10120157porSantos, D.; Saias, J.; Quaresma, P.; Nogueira, V.B. Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction. Computers 2021, 10, 157.https://www.mdpi.com/2073-431X/10/12/157/htmdfsantos@uevora.ptjsaias@uevora.ptpq@uevora.ptvbn@uevora.pt283Santos, DanielSaias, JoséQuaresma, PauloNogueira, Vitorinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-01-03T19:32:30Zoai:dspace.uevora.pt:10174/32115Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T12:27:11.836219Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction
title Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction
spellingShingle Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction
Santos, Daniel
machine learning
data analysis
oad accident data
clustering
decision trees
random forests
title_short Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction
title_full Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction
title_fullStr Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction
title_full_unstemmed Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction
title_sort Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction
author Santos, Daniel
author_facet Santos, Daniel
Saias, José
Quaresma, Paulo
Nogueira, Vitor
author_role author
author2 Saias, José
Quaresma, Paulo
Nogueira, Vitor
author2_role author
author
author
dc.contributor.author.fl_str_mv Santos, Daniel
Saias, José
Quaresma, Paulo
Nogueira, Vitor
dc.subject.por.fl_str_mv machine learning
data analysis
oad accident data
clustering
decision trees
random forests
topic machine learning
data analysis
oad accident data
clustering
decision trees
random forests
description Traffic accidents are one of the most important concerns of the world, since they result in numerous casualties, injuries, and fatalities each year, as well as significant economic losses. There are many factors that are responsible for causing road accidents. If these factors can be better understood and predicted, it might be possible to take measures to mitigate the damages and its severity. The purpose of this work is to identify these factors using accident data from 2016 to 2019 from the district of Setúbal, Portugal. This work aims at developing models that can select a set of influential factors that may be used to classify the severity of an accident, supporting an analysis on the accident data. In addition, this study also proposes a predictive model for future road accidents based on past data. Various machine learning approaches are used to create these models. Supervised machine learning methods such as decision trees (DT), random forests (RF), logistic regression (LR), and naive Bayes (NB) are used, as well as unsupervised machine learning techniques including DBSCAN and hierarchical clustering. Results show that a rule-based model using the C5.0 algorithm is capable of accurately detecting the most relevant factors describing a road accident severity. Further, the results of the predictive model suggests the RF model could be a useful tool for forecasting accident hotspots.
publishDate 2021
dc.date.none.fl_str_mv 2021-11-24T00:00:00Z
2022-05-30T11:00:32Z
2022-05-30
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10174/32115
http://hdl.handle.net/10174/32115
https://doi.org/10.3390/computers10120157
url http://hdl.handle.net/10174/32115
https://doi.org/10.3390/computers10120157
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv Santos, D.; Saias, J.; Quaresma, P.; Nogueira, V.B. Machine Learning Approaches to Traffic Accident Analysis and Hotspot Prediction. Computers 2021, 10, 157.
https://www.mdpi.com/2073-431X/10/12/157/htm
dfsantos@uevora.pt
jsaias@uevora.pt
pq@uevora.pt
vbn@uevora.pt
283
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv MDPI
publisher.none.fl_str_mv MDPI
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833592828882583552