Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA

Bibliographic Details
Main Author: Camilla Oliveira da Silveira
Publication Date: 2022
Format: Master thesis
Language: por
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: https://hdl.handle.net/10216/143034
Summary: The Portuguese National Archives (Arquivo Nacional da Torre do Tombo) is an example of an institution that maintains a collection of great relevance to the Portuguese national memory. It holds original documents from the 9th century to the present day, many already digitized and available to be searched by researchers and the general public. The archive has been adapting to embrace an open perspective in which archival data is linked to global information sources, following a worldwide trend of making information from cultural heritage repositories more accessible. This work analyzed a sample of 25 records from the Portuguese National Archives, chosen by archival specialists as representative of different fonds and description levels, to identify entities and properties and explore relationships with other non-archival resources. The goal is to provide additional information to the archival interfaces. After selecting the entities in the records, Wikidata, DBpedia and Europeana databases were chosen for data enrichment and linking. The analysis of the entities in the records provided hundreds of properties directly related to classes in the corresponding databases. A more significant set of properties was obtained by selecting properties that had more data entered into the databases for entities of the corresponding classes. These entities and properties were then subjected to tests using Wikidata Query Service, DBpedia SPARQL Explorer and the Europeana Virtuoso SPARQL Query Editor. The queries performed with the entities in Wikidata Query Service and DBpedia SPARQL Explorer only provided results for certain properties, since not all entities have data in the database. On the other hand, the properties tested proved to enable relevant data about the entities, providing additional information about the entities they are connected with. However, with Europeana the entity searches did not prove productive, and it was not possible to retrieve information about entity properties as performed in the previous databases. Europeana is a great aggregator of metadata, however its metadata is still in the process of being converted into linked open data. Finally, further studies on repositories are needed in order to investigate the feasibility of exploring the links between archival resources and information in large open archives. Keywords: archives, linked open data, semantic web
id RCAP_4fea5f30513d0a555c62d91fd1976b4d
oai_identifier_str oai:repositorio-aberto.up.pt:10216/143034
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISACiências da comunicaçãoMedia and communicationsThe Portuguese National Archives (Arquivo Nacional da Torre do Tombo) is an example of an institution that maintains a collection of great relevance to the Portuguese national memory. It holds original documents from the 9th century to the present day, many already digitized and available to be searched by researchers and the general public. The archive has been adapting to embrace an open perspective in which archival data is linked to global information sources, following a worldwide trend of making information from cultural heritage repositories more accessible. This work analyzed a sample of 25 records from the Portuguese National Archives, chosen by archival specialists as representative of different fonds and description levels, to identify entities and properties and explore relationships with other non-archival resources. The goal is to provide additional information to the archival interfaces. After selecting the entities in the records, Wikidata, DBpedia and Europeana databases were chosen for data enrichment and linking. The analysis of the entities in the records provided hundreds of properties directly related to classes in the corresponding databases. A more significant set of properties was obtained by selecting properties that had more data entered into the databases for entities of the corresponding classes. These entities and properties were then subjected to tests using Wikidata Query Service, DBpedia SPARQL Explorer and the Europeana Virtuoso SPARQL Query Editor. The queries performed with the entities in Wikidata Query Service and DBpedia SPARQL Explorer only provided results for certain properties, since not all entities have data in the database. On the other hand, the properties tested proved to enable relevant data about the entities, providing additional information about the entities they are connected with. However, with Europeana the entity searches did not prove productive, and it was not possible to retrieve information about entity properties as performed in the previous databases. Europeana is a great aggregator of metadata, however its metadata is still in the process of being converted into linked open data. Finally, further studies on repositories are needed in order to investigate the feasibility of exploring the links between archival resources and information in large open archives. Keywords: archives, linked open data, semantic web2022-07-192022-07-19T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/143034TID:203164105porCamilla Oliveira da Silveirainfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-02-27T18:16:37Zoai:repositorio-aberto.up.pt:10216/143034Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T22:43:41.702707Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA
title Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA
spellingShingle Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA
Camilla Oliveira da Silveira
Ciências da comunicação
Media and communications
title_short Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA
title_full Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA
title_fullStr Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA
title_full_unstemmed Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA
title_sort Entidades Em Documentos De Arquivo E Sua Expansão Com Fontes De Dados Ligados No Projeto EPISA
author Camilla Oliveira da Silveira
author_facet Camilla Oliveira da Silveira
author_role author
dc.contributor.author.fl_str_mv Camilla Oliveira da Silveira
dc.subject.por.fl_str_mv Ciências da comunicação
Media and communications
topic Ciências da comunicação
Media and communications
description The Portuguese National Archives (Arquivo Nacional da Torre do Tombo) is an example of an institution that maintains a collection of great relevance to the Portuguese national memory. It holds original documents from the 9th century to the present day, many already digitized and available to be searched by researchers and the general public. The archive has been adapting to embrace an open perspective in which archival data is linked to global information sources, following a worldwide trend of making information from cultural heritage repositories more accessible. This work analyzed a sample of 25 records from the Portuguese National Archives, chosen by archival specialists as representative of different fonds and description levels, to identify entities and properties and explore relationships with other non-archival resources. The goal is to provide additional information to the archival interfaces. After selecting the entities in the records, Wikidata, DBpedia and Europeana databases were chosen for data enrichment and linking. The analysis of the entities in the records provided hundreds of properties directly related to classes in the corresponding databases. A more significant set of properties was obtained by selecting properties that had more data entered into the databases for entities of the corresponding classes. These entities and properties were then subjected to tests using Wikidata Query Service, DBpedia SPARQL Explorer and the Europeana Virtuoso SPARQL Query Editor. The queries performed with the entities in Wikidata Query Service and DBpedia SPARQL Explorer only provided results for certain properties, since not all entities have data in the database. On the other hand, the properties tested proved to enable relevant data about the entities, providing additional information about the entities they are connected with. However, with Europeana the entity searches did not prove productive, and it was not possible to retrieve information about entity properties as performed in the previous databases. Europeana is a great aggregator of metadata, however its metadata is still in the process of being converted into linked open data. Finally, further studies on repositories are needed in order to investigate the feasibility of exploring the links between archival resources and information in large open archives. Keywords: archives, linked open data, semantic web
publishDate 2022
dc.date.none.fl_str_mv 2022-07-19
2022-07-19T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/143034
TID:203164105
url https://hdl.handle.net/10216/143034
identifier_str_mv TID:203164105
dc.language.iso.fl_str_mv por
language por
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833599832719097857