Question Classification with Constrained Resources: A Study with Coding Exercises

Bibliographic Details
Main Author: Rodrigues L.
Publication Date: 2023
Other Authors: Pereira F., Santos J., Oliveira E., Mello R., Marques L., Dermeval D., Gasparini, Isabela, Bittencourt I.I., Isotani S.
Format: Conference object
Language: eng
Source: Repositório Institucional da Udesc
Download full: https://repositorio.udesc.br/handle/UDESC/2610
Summary: © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.Evidence-based learning strategies, such as the testing effect, might help address the achievement gap. However, exploiting the testing effect depends on having a set of instructional activities with fine-grained tagging. While instructors might find questions in textbooks, they often lack fine-grained tagging, and data labeling is laborious. Despite much research on text classification, to our best knowledge, state-of-the-art question classifiers are mostly based on extensive models (i.e., BERT) and English text. Respectively, those are incompatible with the resource-constrained devices (e.g., mobile) and languages (e.g., Portuguese) of many underprivileged countries in the global south. Therefore, we developed a question classifier on top of DistilBERT, a version of BERT compatible with resource-constrained applications, using grid search and hold-out. Based on a corpus of 1045 coding questions written in Brazilian Portuguese, we found a model that achieved a near-perfect performance on unseen data, similar to last-generation results using BERT for English text. Thus, we present a step towards equitable education by i) providing underprivileged Portuguese-speaking countries with the support that enables opportunities already available for first-world countries and ii) demonstrating the feasibility of creating resource-constrained applications compatible with state-of-the-art AIED systems.
id UDESC-2_da630b49328f886c76d8d1b1313b501c
oai_identifier_str oai:repositorio.udesc.br:UDESC/2610
network_acronym_str UDESC-2
network_name_str Repositório Institucional da Udesc
repository_id_str 6391
spelling Question Classification with Constrained Resources: A Study with Coding Exercises© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.Evidence-based learning strategies, such as the testing effect, might help address the achievement gap. However, exploiting the testing effect depends on having a set of instructional activities with fine-grained tagging. While instructors might find questions in textbooks, they often lack fine-grained tagging, and data labeling is laborious. Despite much research on text classification, to our best knowledge, state-of-the-art question classifiers are mostly based on extensive models (i.e., BERT) and English text. Respectively, those are incompatible with the resource-constrained devices (e.g., mobile) and languages (e.g., Portuguese) of many underprivileged countries in the global south. Therefore, we developed a question classifier on top of DistilBERT, a version of BERT compatible with resource-constrained applications, using grid search and hold-out. Based on a corpus of 1045 coding questions written in Brazilian Portuguese, we found a model that achieved a near-perfect performance on unseen data, similar to last-generation results using BERT for English text. Thus, we present a step towards equitable education by i) providing underprivileged Portuguese-speaking countries with the support that enables opportunities already available for first-world countries and ii) demonstrating the feasibility of creating resource-constrained applications compatible with state-of-the-art AIED systems.2024-12-05T16:27:57Z2023info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObjectp. 734 - 7401865-093710.1007/978-3-031-36336-8_113https://repositorio.udesc.br/handle/UDESC/2610Communications in Computer and Information Science1831 CCISRodrigues L.Pereira F.Santos J.Oliveira E.Mello R.Marques L.Dermeval D.Gasparini, IsabelaBittencourt I.I.Isotani S.engreponame:Repositório Institucional da Udescinstname:Universidade do Estado de Santa Catarina (UDESC)instacron:UDESCinfo:eu-repo/semantics/openAccess2024-12-07T20:39:16Zoai:repositorio.udesc.br:UDESC/2610Biblioteca Digital de Teses e Dissertaçõeshttps://pergamumweb.udesc.br/biblioteca/index.phpPRIhttps://repositorio-api.udesc.br/server/oai/requestri@udesc.bropendoar:63912024-12-07T20:39:16Repositório Institucional da Udesc - Universidade do Estado de Santa Catarina (UDESC)false
dc.title.none.fl_str_mv Question Classification with Constrained Resources: A Study with Coding Exercises
title Question Classification with Constrained Resources: A Study with Coding Exercises
spellingShingle Question Classification with Constrained Resources: A Study with Coding Exercises
Rodrigues L.
title_short Question Classification with Constrained Resources: A Study with Coding Exercises
title_full Question Classification with Constrained Resources: A Study with Coding Exercises
title_fullStr Question Classification with Constrained Resources: A Study with Coding Exercises
title_full_unstemmed Question Classification with Constrained Resources: A Study with Coding Exercises
title_sort Question Classification with Constrained Resources: A Study with Coding Exercises
author Rodrigues L.
author_facet Rodrigues L.
Pereira F.
Santos J.
Oliveira E.
Mello R.
Marques L.
Dermeval D.
Gasparini, Isabela
Bittencourt I.I.
Isotani S.
author_role author
author2 Pereira F.
Santos J.
Oliveira E.
Mello R.
Marques L.
Dermeval D.
Gasparini, Isabela
Bittencourt I.I.
Isotani S.
author2_role author
author
author
author
author
author
author
author
author
dc.contributor.author.fl_str_mv Rodrigues L.
Pereira F.
Santos J.
Oliveira E.
Mello R.
Marques L.
Dermeval D.
Gasparini, Isabela
Bittencourt I.I.
Isotani S.
description © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.Evidence-based learning strategies, such as the testing effect, might help address the achievement gap. However, exploiting the testing effect depends on having a set of instructional activities with fine-grained tagging. While instructors might find questions in textbooks, they often lack fine-grained tagging, and data labeling is laborious. Despite much research on text classification, to our best knowledge, state-of-the-art question classifiers are mostly based on extensive models (i.e., BERT) and English text. Respectively, those are incompatible with the resource-constrained devices (e.g., mobile) and languages (e.g., Portuguese) of many underprivileged countries in the global south. Therefore, we developed a question classifier on top of DistilBERT, a version of BERT compatible with resource-constrained applications, using grid search and hold-out. Based on a corpus of 1045 coding questions written in Brazilian Portuguese, we found a model that achieved a near-perfect performance on unseen data, similar to last-generation results using BERT for English text. Thus, we present a step towards equitable education by i) providing underprivileged Portuguese-speaking countries with the support that enables opportunities already available for first-world countries and ii) demonstrating the feasibility of creating resource-constrained applications compatible with state-of-the-art AIED systems.
publishDate 2023
dc.date.none.fl_str_mv 2023
2024-12-05T16:27:57Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv 1865-0937
10.1007/978-3-031-36336-8_113
https://repositorio.udesc.br/handle/UDESC/2610
identifier_str_mv 1865-0937
10.1007/978-3-031-36336-8_113
url https://repositorio.udesc.br/handle/UDESC/2610
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Communications in Computer and Information Science
1831 CCIS
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv p. 734 - 740
dc.source.none.fl_str_mv reponame:Repositório Institucional da Udesc
instname:Universidade do Estado de Santa Catarina (UDESC)
instacron:UDESC
instname_str Universidade do Estado de Santa Catarina (UDESC)
instacron_str UDESC
institution UDESC
reponame_str Repositório Institucional da Udesc
collection Repositório Institucional da Udesc
repository.name.fl_str_mv Repositório Institucional da Udesc - Universidade do Estado de Santa Catarina (UDESC)
repository.mail.fl_str_mv ri@udesc.br
_version_ 1848168395904647168