Lexical semantics annotation for enriched Portuguese corpora
Main Author: | |
---|---|
Publication Date: | 2016 |
Other Authors: | , , |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | http://hdl.handle.net/10451/33110 |
Summary: | The semantic annotation of corpora has an important role to play in ensuring that sentences occurring in natural language texts are correctly understood based on their intended context. Two examples of lexical semantic units that contribute to this knowledge are word senses – which allow words with multiple meanings to be understood based on the context in which they are used – and named entities – which can be disambiguated and linked back to the specific encyclopedic resources that describe them. In this paper, we describe the construction of lexical semanticallyannotated corpora for Portuguese, annotated with both word senses linked to senses in a Portuguese wordnet and named entities linked to Portuguese Wikipedia entries using DBpedia. The result is a goldstandard lexical semantically-annotated resource that is useful in supporting the training and evaluation of tools for the disambiguation of these lexical units in Portuguese. |
id |
RCAP_390920eecdc0e62e94c0afab34be0fc9 |
---|---|
oai_identifier_str |
oai:repositorio.ulisboa.pt:10451/33110 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Lexical semantics annotation for enriched Portuguese corporaAnnotated corporaLexical semanticsWord sensesNamed entitiesPortugueseThe semantic annotation of corpora has an important role to play in ensuring that sentences occurring in natural language texts are correctly understood based on their intended context. Two examples of lexical semantic units that contribute to this knowledge are word senses – which allow words with multiple meanings to be understood based on the context in which they are used – and named entities – which can be disambiguated and linked back to the specific encyclopedic resources that describe them. In this paper, we describe the construction of lexical semanticallyannotated corpora for Portuguese, annotated with both word senses linked to senses in a Portuguese wordnet and named entities linked to Portuguese Wikipedia entries using DBpedia. The result is a goldstandard lexical semantically-annotated resource that is useful in supporting the training and evaluation of tools for the disambiguation of these lexical units in Portuguese.SpringerRepositório da Universidade de LisboaNeale, StevenValadas, RitaSilva, JoãoBranco, António2018-05-04T10:35:15Z20162016-01-01T00:00:00Zbook partinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10451/33110engNeale, S., Rita Valadas Pereira, João Silva, & António Branco. "Lexical semantics annotation for enriched Portuguese corpora". In Lecture Notes in Artificial Intelligence, 9727, Berlim: Springer, pp. 296-305.10.1007/978-3-319-41552-9 30info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-17T13:52:59Zoai:repositorio.ulisboa.pt:10451/33110Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T02:56:55.494324Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Lexical semantics annotation for enriched Portuguese corpora |
title |
Lexical semantics annotation for enriched Portuguese corpora |
spellingShingle |
Lexical semantics annotation for enriched Portuguese corpora Neale, Steven Annotated corpora Lexical semantics Word senses Named entities Portuguese |
title_short |
Lexical semantics annotation for enriched Portuguese corpora |
title_full |
Lexical semantics annotation for enriched Portuguese corpora |
title_fullStr |
Lexical semantics annotation for enriched Portuguese corpora |
title_full_unstemmed |
Lexical semantics annotation for enriched Portuguese corpora |
title_sort |
Lexical semantics annotation for enriched Portuguese corpora |
author |
Neale, Steven |
author_facet |
Neale, Steven Valadas, Rita Silva, João Branco, António |
author_role |
author |
author2 |
Valadas, Rita Silva, João Branco, António |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
Repositório da Universidade de Lisboa |
dc.contributor.author.fl_str_mv |
Neale, Steven Valadas, Rita Silva, João Branco, António |
dc.subject.por.fl_str_mv |
Annotated corpora Lexical semantics Word senses Named entities Portuguese |
topic |
Annotated corpora Lexical semantics Word senses Named entities Portuguese |
description |
The semantic annotation of corpora has an important role to play in ensuring that sentences occurring in natural language texts are correctly understood based on their intended context. Two examples of lexical semantic units that contribute to this knowledge are word senses – which allow words with multiple meanings to be understood based on the context in which they are used – and named entities – which can be disambiguated and linked back to the specific encyclopedic resources that describe them. In this paper, we describe the construction of lexical semanticallyannotated corpora for Portuguese, annotated with both word senses linked to senses in a Portuguese wordnet and named entities linked to Portuguese Wikipedia entries using DBpedia. The result is a goldstandard lexical semantically-annotated resource that is useful in supporting the training and evaluation of tools for the disambiguation of these lexical units in Portuguese. |
publishDate |
2016 |
dc.date.none.fl_str_mv |
2016 2016-01-01T00:00:00Z 2018-05-04T10:35:15Z |
dc.type.driver.fl_str_mv |
book part |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10451/33110 |
url |
http://hdl.handle.net/10451/33110 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Neale, S., Rita Valadas Pereira, João Silva, & António Branco. "Lexical semantics annotation for enriched Portuguese corpora". In Lecture Notes in Artificial Intelligence, 9727, Berlim: Springer, pp. 296-305. 10.1007/978-3-319-41552-9 30 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Springer |
publisher.none.fl_str_mv |
Springer |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833601540980473857 |