Annotation of Named Entities in the Gaming domain
Main Author: | |
---|---|
Publication Date: | 2022 |
Other Authors: | , |
Format: | Article |
Language: | por |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | https://doi.org/10.26334/2183-9077/rapln9ano2022a15 |
Summary: | This paper aims to analyse the effects of including gaming entities in the performance of the NER system, for the English language and in a machine translation industrial context of customer support content. To identify and classify gaming entities (by the Named Entity Recognition (NER) model), three new categories were created and added to the already used annotation typology: GAME NAME, GAME FEATURE and GAME CURRENCY. A set of reference annotations (gold standard) was also developed, allowing not only the training of the NER system but also the evaluation of its performance and accuracy in a more objective way, namely by counting the number of entities that the system identifies and categorises correctly. In the scope of this work, 6618 sentences from 7 gaming clients were manually annotated, constituting the gold standard which was then used to train and evaluate the NER system. The objective of the experiments was to assess whether the existing NER system improved its performance when trained with the gold standard created specifically for the gaming domain and if it could handle the new gaming categories added to the typology by identifying and categorizing them correctly. The results of both experiments were auspicious and positive, demonstrating the relevance of greater investment in domain-specific entity recognition, namely in the context of customer service text processing. |
id |
RCAP_c3cc1e9cfe0724ed36895dd5e69b5486 |
---|---|
oai_identifier_str |
oai:ojs3.ojs.apl.pt:article/149 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Annotation of Named Entities in the Gaming domainAnotação de Entidades Mencionadas na área do GamingEntidades MencionadasReconhecimento de Entidades MencionadasAnotaçãoNamed EntitiesNamed Entity RecognitionAnnotationGamingThis paper aims to analyse the effects of including gaming entities in the performance of the NER system, for the English language and in a machine translation industrial context of customer support content. To identify and classify gaming entities (by the Named Entity Recognition (NER) model), three new categories were created and added to the already used annotation typology: GAME NAME, GAME FEATURE and GAME CURRENCY. A set of reference annotations (gold standard) was also developed, allowing not only the training of the NER system but also the evaluation of its performance and accuracy in a more objective way, namely by counting the number of entities that the system identifies and categorises correctly. In the scope of this work, 6618 sentences from 7 gaming clients were manually annotated, constituting the gold standard which was then used to train and evaluate the NER system. The objective of the experiments was to assess whether the existing NER system improved its performance when trained with the gold standard created specifically for the gaming domain and if it could handle the new gaming categories added to the typology by identifying and categorizing them correctly. The results of both experiments were auspicious and positive, demonstrating the relevance of greater investment in domain-specific entity recognition, namely in the context of customer service text processing.Associação Portuguesa de Linguística2022-10-25info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttps://doi.org/10.26334/2183-9077/rapln9ano2022a15https://doi.org/10.26334/2183-9077/rapln9ano2022a15Revista da Associação Portuguesa de Linguística; No. 9 (2022): Journal of the Portuguese Linguistics Association; 223-235Revista da Associação Portuguesa de Linguística; N.º 9 (2022): Revista da Associação Portuguesa de Linguística; 223-2352183-907710.26334/2183-9077/rapln9ano2022reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAPporhttps://ojs.apl.pt/index.php/rapl/article/view/149https://ojs.apl.pt/index.php/rapl/article/view/149/142Direitos de Autor (c) 2022 Rita Silva, Vera Cabarrão, Sara Mendesinfo:eu-repo/semantics/openAccessSilva, RitaCabarrão, VeraMendes, Sara2023-12-02T10:18:01Zoai:ojs3.ojs.apl.pt:article/149Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T11:13:19.408379Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Annotation of Named Entities in the Gaming domain Anotação de Entidades Mencionadas na área do Gaming |
title |
Annotation of Named Entities in the Gaming domain |
spellingShingle |
Annotation of Named Entities in the Gaming domain Silva, Rita Entidades Mencionadas Reconhecimento de Entidades Mencionadas Anotação Named Entities Named Entity Recognition Annotation Gaming |
title_short |
Annotation of Named Entities in the Gaming domain |
title_full |
Annotation of Named Entities in the Gaming domain |
title_fullStr |
Annotation of Named Entities in the Gaming domain |
title_full_unstemmed |
Annotation of Named Entities in the Gaming domain |
title_sort |
Annotation of Named Entities in the Gaming domain |
author |
Silva, Rita |
author_facet |
Silva, Rita Cabarrão, Vera Mendes, Sara |
author_role |
author |
author2 |
Cabarrão, Vera Mendes, Sara |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Silva, Rita Cabarrão, Vera Mendes, Sara |
dc.subject.por.fl_str_mv |
Entidades Mencionadas Reconhecimento de Entidades Mencionadas Anotação Named Entities Named Entity Recognition Annotation Gaming |
topic |
Entidades Mencionadas Reconhecimento de Entidades Mencionadas Anotação Named Entities Named Entity Recognition Annotation Gaming |
description |
This paper aims to analyse the effects of including gaming entities in the performance of the NER system, for the English language and in a machine translation industrial context of customer support content. To identify and classify gaming entities (by the Named Entity Recognition (NER) model), three new categories were created and added to the already used annotation typology: GAME NAME, GAME FEATURE and GAME CURRENCY. A set of reference annotations (gold standard) was also developed, allowing not only the training of the NER system but also the evaluation of its performance and accuracy in a more objective way, namely by counting the number of entities that the system identifies and categorises correctly. In the scope of this work, 6618 sentences from 7 gaming clients were manually annotated, constituting the gold standard which was then used to train and evaluate the NER system. The objective of the experiments was to assess whether the existing NER system improved its performance when trained with the gold standard created specifically for the gaming domain and if it could handle the new gaming categories added to the typology by identifying and categorizing them correctly. The results of both experiments were auspicious and positive, demonstrating the relevance of greater investment in domain-specific entity recognition, namely in the context of customer service text processing. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-10-25 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://doi.org/10.26334/2183-9077/rapln9ano2022a15 https://doi.org/10.26334/2183-9077/rapln9ano2022a15 |
url |
https://doi.org/10.26334/2183-9077/rapln9ano2022a15 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.none.fl_str_mv |
https://ojs.apl.pt/index.php/rapl/article/view/149 https://ojs.apl.pt/index.php/rapl/article/view/149/142 |
dc.rights.driver.fl_str_mv |
Direitos de Autor (c) 2022 Rita Silva, Vera Cabarrão, Sara Mendes info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Direitos de Autor (c) 2022 Rita Silva, Vera Cabarrão, Sara Mendes |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Associação Portuguesa de Linguística |
publisher.none.fl_str_mv |
Associação Portuguesa de Linguística |
dc.source.none.fl_str_mv |
Revista da Associação Portuguesa de Linguística; No. 9 (2022): Journal of the Portuguese Linguistics Association; 223-235 Revista da Associação Portuguesa de Linguística; N.º 9 (2022): Revista da Associação Portuguesa de Linguística; 223-235 2183-9077 10.26334/2183-9077/rapln9ano2022 reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833591697465933824 |