Question Classification with Constrained Resources: A Study with Coding Exercises
| Main Author: | |
|---|---|
| Publication Date: | 2023 |
| Other Authors: | , , , , , , , , |
| Format: | Conference object |
| Language: | eng |
| Source: | Repositório Institucional da Udesc |
| Download full: | https://repositorio.udesc.br/handle/UDESC/2610 |
Summary: | © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.Evidence-based learning strategies, such as the testing effect, might help address the achievement gap. However, exploiting the testing effect depends on having a set of instructional activities with fine-grained tagging. While instructors might find questions in textbooks, they often lack fine-grained tagging, and data labeling is laborious. Despite much research on text classification, to our best knowledge, state-of-the-art question classifiers are mostly based on extensive models (i.e., BERT) and English text. Respectively, those are incompatible with the resource-constrained devices (e.g., mobile) and languages (e.g., Portuguese) of many underprivileged countries in the global south. Therefore, we developed a question classifier on top of DistilBERT, a version of BERT compatible with resource-constrained applications, using grid search and hold-out. Based on a corpus of 1045 coding questions written in Brazilian Portuguese, we found a model that achieved a near-perfect performance on unseen data, similar to last-generation results using BERT for English text. Thus, we present a step towards equitable education by i) providing underprivileged Portuguese-speaking countries with the support that enables opportunities already available for first-world countries and ii) demonstrating the feasibility of creating resource-constrained applications compatible with state-of-the-art AIED systems. |
| id |
UDESC-2_da630b49328f886c76d8d1b1313b501c |
|---|---|
| oai_identifier_str |
oai:repositorio.udesc.br:UDESC/2610 |
| network_acronym_str |
UDESC-2 |
| network_name_str |
Repositório Institucional da Udesc |
| repository_id_str |
6391 |
| spelling |
Question Classification with Constrained Resources: A Study with Coding Exercises© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.Evidence-based learning strategies, such as the testing effect, might help address the achievement gap. However, exploiting the testing effect depends on having a set of instructional activities with fine-grained tagging. While instructors might find questions in textbooks, they often lack fine-grained tagging, and data labeling is laborious. Despite much research on text classification, to our best knowledge, state-of-the-art question classifiers are mostly based on extensive models (i.e., BERT) and English text. Respectively, those are incompatible with the resource-constrained devices (e.g., mobile) and languages (e.g., Portuguese) of many underprivileged countries in the global south. Therefore, we developed a question classifier on top of DistilBERT, a version of BERT compatible with resource-constrained applications, using grid search and hold-out. Based on a corpus of 1045 coding questions written in Brazilian Portuguese, we found a model that achieved a near-perfect performance on unseen data, similar to last-generation results using BERT for English text. Thus, we present a step towards equitable education by i) providing underprivileged Portuguese-speaking countries with the support that enables opportunities already available for first-world countries and ii) demonstrating the feasibility of creating resource-constrained applications compatible with state-of-the-art AIED systems.2024-12-05T16:27:57Z2023info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObjectp. 734 - 7401865-093710.1007/978-3-031-36336-8_113https://repositorio.udesc.br/handle/UDESC/2610Communications in Computer and Information Science1831 CCISRodrigues L.Pereira F.Santos J.Oliveira E.Mello R.Marques L.Dermeval D.Gasparini, IsabelaBittencourt I.I.Isotani S.engreponame:Repositório Institucional da Udescinstname:Universidade do Estado de Santa Catarina (UDESC)instacron:UDESCinfo:eu-repo/semantics/openAccess2024-12-07T20:39:16Zoai:repositorio.udesc.br:UDESC/2610Biblioteca Digital de Teses e Dissertaçõeshttps://pergamumweb.udesc.br/biblioteca/index.phpPRIhttps://repositorio-api.udesc.br/server/oai/requestri@udesc.bropendoar:63912024-12-07T20:39:16Repositório Institucional da Udesc - Universidade do Estado de Santa Catarina (UDESC)false |
| dc.title.none.fl_str_mv |
Question Classification with Constrained Resources: A Study with Coding Exercises |
| title |
Question Classification with Constrained Resources: A Study with Coding Exercises |
| spellingShingle |
Question Classification with Constrained Resources: A Study with Coding Exercises Rodrigues L. |
| title_short |
Question Classification with Constrained Resources: A Study with Coding Exercises |
| title_full |
Question Classification with Constrained Resources: A Study with Coding Exercises |
| title_fullStr |
Question Classification with Constrained Resources: A Study with Coding Exercises |
| title_full_unstemmed |
Question Classification with Constrained Resources: A Study with Coding Exercises |
| title_sort |
Question Classification with Constrained Resources: A Study with Coding Exercises |
| author |
Rodrigues L. |
| author_facet |
Rodrigues L. Pereira F. Santos J. Oliveira E. Mello R. Marques L. Dermeval D. Gasparini, Isabela Bittencourt I.I. Isotani S. |
| author_role |
author |
| author2 |
Pereira F. Santos J. Oliveira E. Mello R. Marques L. Dermeval D. Gasparini, Isabela Bittencourt I.I. Isotani S. |
| author2_role |
author author author author author author author author author |
| dc.contributor.author.fl_str_mv |
Rodrigues L. Pereira F. Santos J. Oliveira E. Mello R. Marques L. Dermeval D. Gasparini, Isabela Bittencourt I.I. Isotani S. |
| description |
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.Evidence-based learning strategies, such as the testing effect, might help address the achievement gap. However, exploiting the testing effect depends on having a set of instructional activities with fine-grained tagging. While instructors might find questions in textbooks, they often lack fine-grained tagging, and data labeling is laborious. Despite much research on text classification, to our best knowledge, state-of-the-art question classifiers are mostly based on extensive models (i.e., BERT) and English text. Respectively, those are incompatible with the resource-constrained devices (e.g., mobile) and languages (e.g., Portuguese) of many underprivileged countries in the global south. Therefore, we developed a question classifier on top of DistilBERT, a version of BERT compatible with resource-constrained applications, using grid search and hold-out. Based on a corpus of 1045 coding questions written in Brazilian Portuguese, we found a model that achieved a near-perfect performance on unseen data, similar to last-generation results using BERT for English text. Thus, we present a step towards equitable education by i) providing underprivileged Portuguese-speaking countries with the support that enables opportunities already available for first-world countries and ii) demonstrating the feasibility of creating resource-constrained applications compatible with state-of-the-art AIED systems. |
| publishDate |
2023 |
| dc.date.none.fl_str_mv |
2023 2024-12-05T16:27:57Z |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
| format |
conferenceObject |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
1865-0937 10.1007/978-3-031-36336-8_113 https://repositorio.udesc.br/handle/UDESC/2610 |
| identifier_str_mv |
1865-0937 10.1007/978-3-031-36336-8_113 |
| url |
https://repositorio.udesc.br/handle/UDESC/2610 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
Communications in Computer and Information Science 1831 CCIS |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
p. 734 - 740 |
| dc.source.none.fl_str_mv |
reponame:Repositório Institucional da Udesc instname:Universidade do Estado de Santa Catarina (UDESC) instacron:UDESC |
| instname_str |
Universidade do Estado de Santa Catarina (UDESC) |
| instacron_str |
UDESC |
| institution |
UDESC |
| reponame_str |
Repositório Institucional da Udesc |
| collection |
Repositório Institucional da Udesc |
| repository.name.fl_str_mv |
Repositório Institucional da Udesc - Universidade do Estado de Santa Catarina (UDESC) |
| repository.mail.fl_str_mv |
ri@udesc.br |
| _version_ |
1848168395904647168 |