Named entity annotation of an 18th-century transcribed corpus: problems and challenges
Main Author: | |
---|---|
Publication Date: | 2022 |
Other Authors: | , , |
Format: | Article |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | http://hdl.handle.net/10174/32163 |
Summary: | This paper reviews a stage of the process of annotating named entities in 18th-century texts to enrich historical research sources and link them to other bases. The categories in question are person, location and organisation, valid categories for historian analysis. We discuss the difficulties observed in the process and point eventual solutions. |
id |
RCAP_596a6c77111e02e900c0bde6ad9fd4a7 |
---|---|
oai_identifier_str |
oai:dspace.uevora.pt:10174/32163 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Named entity annotation of an 18th-century transcribed corpus: problems and challengesDigital HumanitiesNamed Entity RecognitionThis paper reviews a stage of the process of annotating named entities in 18th-century texts to enrich historical research sources and link them to other bases. The categories in question are person, location and organisation, valid categories for historian analysis. We discuss the difficulties observed in the process and point eventual solutions.Partially supported by the Portuguese Foundation FCT, under the projects CEECIND/01997/2017 and UIDB/00057/2020.CEUR2022-06-02T10:56:03Z2022-06-022022-04-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/32163http://hdl.handle.net/10174/32163engCameron, H.F., Olival, F., Vieira, R., Neto, J.F.S.: Named entity annotation of an 18th century transcribed corpus: problems, challenges. In: Proceedings of the Second Workshop on Digital Humanities and Natural Language Processing (2nd DHandNLP 2022), co-located with International Conference on the Computational Processing of Portuguese (PROPOR 2022), Virtual Event, Fortaleza, Brazil, 21st March, 2022. CEUR Workshop Proceedings, vol. 3128, pp. 18–25. http://ceur-ws.org/Vol-3128/paper8.pdhttp://ceur-ws.org/Vol-3128/paper8.pdfhelenafc@uevora.ptmfo@uevora.ptrenatav@uevora.ptnd299Cameron, HelenaOlival, FernandaVieira, RenataSantos, Joaquiminfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-01-03T19:32:27Zoai:dspace.uevora.pt:10174/32163Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T12:27:10.299882Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
spellingShingle |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges Cameron, Helena Digital Humanities Named Entity Recognition |
title_short |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title_full |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title_fullStr |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title_full_unstemmed |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title_sort |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
author |
Cameron, Helena |
author_facet |
Cameron, Helena Olival, Fernanda Vieira, Renata Santos, Joaquim |
author_role |
author |
author2 |
Olival, Fernanda Vieira, Renata Santos, Joaquim |
author2_role |
author author author |
dc.contributor.author.fl_str_mv |
Cameron, Helena Olival, Fernanda Vieira, Renata Santos, Joaquim |
dc.subject.por.fl_str_mv |
Digital Humanities Named Entity Recognition |
topic |
Digital Humanities Named Entity Recognition |
description |
This paper reviews a stage of the process of annotating named entities in 18th-century texts to enrich historical research sources and link them to other bases. The categories in question are person, location and organisation, valid categories for historian analysis. We discuss the difficulties observed in the process and point eventual solutions. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-06-02T10:56:03Z 2022-06-02 2022-04-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10174/32163 http://hdl.handle.net/10174/32163 |
url |
http://hdl.handle.net/10174/32163 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Cameron, H.F., Olival, F., Vieira, R., Neto, J.F.S.: Named entity annotation of an 18th century transcribed corpus: problems, challenges. In: Proceedings of the Second Workshop on Digital Humanities and Natural Language Processing (2nd DHandNLP 2022), co-located with International Conference on the Computational Processing of Portuguese (PROPOR 2022), Virtual Event, Fortaleza, Brazil, 21st March, 2022. CEUR Workshop Proceedings, vol. 3128, pp. 18–25. http://ceur-ws.org/Vol-3128/paper8.pd http://ceur-ws.org/Vol-3128/paper8.pdf helenafc@uevora.pt mfo@uevora.pt renatav@uevora.pt nd 299 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
CEUR |
publisher.none.fl_str_mv |
CEUR |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833592828839591936 |