Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming

Detalhes bibliográficos
Autor(a) principal: Carneiro, Murillo Guimarães
Data de Publicação: 2022
Outros Autores: Dutra, Bruna Luiza, Paiva, José Gustavo S., Gabriel, Paulo Henrique Ribeiro, Araújo, Rafael Dias
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Revista Brasileira de Informática na Educação
Texto Completo: https://journals-sol.sbc.org.br/index.php/rbie/article/view/2518
Resumo: Several works in the literature emphasized data mining as efficient tools to identify factors related to retention and dropout in higher education. However, most of these works do not discuss if (or how) such factors may effectively contribute to decrease such rates. This article presents a data mining approach conceived to identify students at retention risk in a course of Intro to Computer Programming as well as guide preventive interventions to help such students to overcome this situation. Our results indicated an averaged predictive performance superior to 80% in both accuracy and F1 when identifying factors related to the retention. Moreover, during the two years of the project execution, the annual success rates in the course were the highest in comparison to the last five years.
id SBC-6_ca42db1004e9a438b84621d2f34aadd2
oai_identifier_str oai:journals-sol.sbc.org.br:article/2518
network_acronym_str SBC-6
network_name_str Revista Brasileira de Informática na Educação
repository_id_str
spelling Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programmingEducational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programmingEducational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programmingEducational data miningHigher educationRetention preventionDropout preventionAcademic analyticsMachine learningData classificationEducational data miningHigher educationRetention preventionDropout preventionAcademic analyticsMachine learningData classificationEducational data miningHigher educationRetention preventionDropout preventionAcademic analyticsMachine learningData classificationSeveral works in the literature emphasized data mining as efficient tools to identify factors related to retention and dropout in higher education. However, most of these works do not discuss if (or how) such factors may effectively contribute to decrease such rates. This article presents a data mining approach conceived to identify students at retention risk in a course of Intro to Computer Programming as well as guide preventive interventions to help such students to overcome this situation. Our results indicated an averaged predictive performance superior to 80% in both accuracy and F1 when identifying factors related to the retention. Moreover, during the two years of the project execution, the annual success rates in the course were the highest in comparison to the last five years.Several works in the literature emphasized data mining as efficient tools to identify factors related to retention and dropout in higher education. However, most of these works do not discuss if (or how) such factors may effectively contribute to decrease such rates. This article presents a data mining approach conceived to identify students at retention risk in a course of Intro to Computer Programming as well as guide preventive interventions to help such students to overcome this situation. Our results indicated an averaged predictive performance superior to 80% in both accuracy and F1 when identifying factors related to the retention. Moreover, during the two years of the project execution, the annual success rates in the course were the highest in comparison to the last five years.Several works in the literature emphasized data mining as efficient tools to identify factors related to retention and dropout in higher education. However, most of these works do not discuss if (or how) such factors may effectively contribute to decrease such rates. This article presents a data mining approach conceived to identify students at retention risk in a course of Intro to Computer Programming as well as support preventive interventions to help such students to overcome this situation. Our results indicated an averaged predictive performance superior to 80% in both accuracy and F1 when identifying factors related to the retention. Moreover, during the two years of the project execution, the annual success rates in the course were the highest in comparison to the last five years.Sociedade Brasileira de Computação2022-09-27info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionPeer-reviewed articleArtículo revisado por paresArtigo avaliado pelos paresapplication/pdfhttps://journals-sol.sbc.org.br/index.php/rbie/article/view/251810.5753/rbie.2022.2518Revista Brasileña de Informática en la Educación; Vol. 30 (2022); 379-395Revista Brasileira de Informática na Educação; Vol. 30 (2022); 379-395Brazilian Journal of Computers in Education; Vol. 30 (2022); 379-3952317-61211414-5685reponame:Revista Brasileira de Informática na Educaçãoinstname:Sociedade Brasileira de Computação (SBC)instacron:SBCenghttps://journals-sol.sbc.org.br/index.php/rbie/article/view/2518/2040Copyright (c) 2022 Murillo Guimarães Carneiro, Bruna Luiza Dutra, José Gustavo S. Paiva, Paulo Henrique Ribeiro Gabriel, Rafael Dias Araújohttps://creativecommons.org/licenses/by-nc-nd/4.0info:eu-repo/semantics/openAccessCarneiro, Murillo GuimarãesDutra, Bruna LuizaPaiva, José Gustavo S.Gabriel, Paulo Henrique RibeiroAraújo, Rafael Dias2022-07-30T14:03:34Zoai:journals-sol.sbc.org.br:article/2518Revistahttps://journals-sol.sbc.org.br/index.php/rbieONGhttps://journals-sol.sbc.org.br/index.php/rbie/oaipublicacoes@sbc.org.br2317-61211414-5685opendoar:2022-07-30T14:03:34Revista Brasileira de Informática na Educação - Sociedade Brasileira de Computação (SBC)false
dc.title.none.fl_str_mv Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
title Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
spellingShingle Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
Carneiro, Murillo Guimarães
Educational data mining
Higher education
Retention prevention
Dropout prevention
Academic analytics
Machine learning
Data classification
Educational data mining
Higher education
Retention prevention
Dropout prevention
Academic analytics
Machine learning
Data classification
Educational data mining
Higher education
Retention prevention
Dropout prevention
Academic analytics
Machine learning
Data classification
title_short Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
title_full Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
title_fullStr Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
title_full_unstemmed Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
title_sort Educational data mining to support identification and prevention of academic retention and dropout: a case study in introductory programming
author Carneiro, Murillo Guimarães
author_facet Carneiro, Murillo Guimarães
Dutra, Bruna Luiza
Paiva, José Gustavo S.
Gabriel, Paulo Henrique Ribeiro
Araújo, Rafael Dias
author_role author
author2 Dutra, Bruna Luiza
Paiva, José Gustavo S.
Gabriel, Paulo Henrique Ribeiro
Araújo, Rafael Dias
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Carneiro, Murillo Guimarães
Dutra, Bruna Luiza
Paiva, José Gustavo S.
Gabriel, Paulo Henrique Ribeiro
Araújo, Rafael Dias
dc.subject.por.fl_str_mv Educational data mining
Higher education
Retention prevention
Dropout prevention
Academic analytics
Machine learning
Data classification
Educational data mining
Higher education
Retention prevention
Dropout prevention
Academic analytics
Machine learning
Data classification
Educational data mining
Higher education
Retention prevention
Dropout prevention
Academic analytics
Machine learning
Data classification
topic Educational data mining
Higher education
Retention prevention
Dropout prevention
Academic analytics
Machine learning
Data classification
Educational data mining
Higher education
Retention prevention
Dropout prevention
Academic analytics
Machine learning
Data classification
Educational data mining
Higher education
Retention prevention
Dropout prevention
Academic analytics
Machine learning
Data classification
description Several works in the literature emphasized data mining as efficient tools to identify factors related to retention and dropout in higher education. However, most of these works do not discuss if (or how) such factors may effectively contribute to decrease such rates. This article presents a data mining approach conceived to identify students at retention risk in a course of Intro to Computer Programming as well as guide preventive interventions to help such students to overcome this situation. Our results indicated an averaged predictive performance superior to 80% in both accuracy and F1 when identifying factors related to the retention. Moreover, during the two years of the project execution, the annual success rates in the course were the highest in comparison to the last five years.
publishDate 2022
dc.date.none.fl_str_mv 2022-09-27
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Peer-reviewed article
Artículo revisado por pares
Artigo avaliado pelos pares
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://journals-sol.sbc.org.br/index.php/rbie/article/view/2518
10.5753/rbie.2022.2518
url https://journals-sol.sbc.org.br/index.php/rbie/article/view/2518
identifier_str_mv 10.5753/rbie.2022.2518
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://journals-sol.sbc.org.br/index.php/rbie/article/view/2518/2040
dc.rights.driver.fl_str_mv https://creativecommons.org/licenses/by-nc-nd/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv https://creativecommons.org/licenses/by-nc-nd/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Sociedade Brasileira de Computação
publisher.none.fl_str_mv Sociedade Brasileira de Computação
dc.source.none.fl_str_mv Revista Brasileña de Informática en la Educación; Vol. 30 (2022); 379-395
Revista Brasileira de Informática na Educação; Vol. 30 (2022); 379-395
Brazilian Journal of Computers in Education; Vol. 30 (2022); 379-395
2317-6121
1414-5685
reponame:Revista Brasileira de Informática na Educação
instname:Sociedade Brasileira de Computação (SBC)
instacron:SBC
instname_str Sociedade Brasileira de Computação (SBC)
instacron_str SBC
institution SBC
reponame_str Revista Brasileira de Informática na Educação
collection Revista Brasileira de Informática na Educação
repository.name.fl_str_mv Revista Brasileira de Informática na Educação - Sociedade Brasileira de Computação (SBC)
repository.mail.fl_str_mv publicacoes@sbc.org.br
_version_ 1832111042353692672