Lexical semantics annotation for enriched Portuguese corpora

Bibliographic Details
Main Author: Neale, Steven
Publication Date: 2016
Other Authors: Valadas, Rita, Silva, João, Branco, António
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/10451/33110
Summary: The semantic annotation of corpora has an important role to play in ensuring that sentences occurring in natural language texts are correctly understood based on their intended context. Two examples of lexical semantic units that contribute to this knowledge are word senses – which allow words with multiple meanings to be understood based on the context in which they are used – and named entities – which can be disambiguated and linked back to the specific encyclopedic resources that describe them. In this paper, we describe the construction of lexical semanticallyannotated corpora for Portuguese, annotated with both word senses linked to senses in a Portuguese wordnet and named entities linked to Portuguese Wikipedia entries using DBpedia. The result is a goldstandard lexical semantically-annotated resource that is useful in supporting the training and evaluation of tools for the disambiguation of these lexical units in Portuguese.
id RCAP_390920eecdc0e62e94c0afab34be0fc9
oai_identifier_str oai:repositorio.ulisboa.pt:10451/33110
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Lexical semantics annotation for enriched Portuguese corporaAnnotated corporaLexical semanticsWord sensesNamed entitiesPortugueseThe semantic annotation of corpora has an important role to play in ensuring that sentences occurring in natural language texts are correctly understood based on their intended context. Two examples of lexical semantic units that contribute to this knowledge are word senses – which allow words with multiple meanings to be understood based on the context in which they are used – and named entities – which can be disambiguated and linked back to the specific encyclopedic resources that describe them. In this paper, we describe the construction of lexical semanticallyannotated corpora for Portuguese, annotated with both word senses linked to senses in a Portuguese wordnet and named entities linked to Portuguese Wikipedia entries using DBpedia. The result is a goldstandard lexical semantically-annotated resource that is useful in supporting the training and evaluation of tools for the disambiguation of these lexical units in Portuguese.SpringerRepositório da Universidade de LisboaNeale, StevenValadas, RitaSilva, JoãoBranco, António2018-05-04T10:35:15Z20162016-01-01T00:00:00Zbook partinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10451/33110engNeale, S., Rita Valadas Pereira, João Silva, & António Branco. "Lexical semantics annotation for enriched Portuguese corpora". In Lecture Notes in Artificial Intelligence, 9727, Berlim: Springer, pp. 296-305.10.1007/978-3-319-41552-9 30info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-17T13:52:59Zoai:repositorio.ulisboa.pt:10451/33110Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T02:56:55.494324Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Lexical semantics annotation for enriched Portuguese corpora
title Lexical semantics annotation for enriched Portuguese corpora
spellingShingle Lexical semantics annotation for enriched Portuguese corpora
Neale, Steven
Annotated corpora
Lexical semantics
Word senses
Named entities
Portuguese
title_short Lexical semantics annotation for enriched Portuguese corpora
title_full Lexical semantics annotation for enriched Portuguese corpora
title_fullStr Lexical semantics annotation for enriched Portuguese corpora
title_full_unstemmed Lexical semantics annotation for enriched Portuguese corpora
title_sort Lexical semantics annotation for enriched Portuguese corpora
author Neale, Steven
author_facet Neale, Steven
Valadas, Rita
Silva, João
Branco, António
author_role author
author2 Valadas, Rita
Silva, João
Branco, António
author2_role author
author
author
dc.contributor.none.fl_str_mv Repositório da Universidade de Lisboa
dc.contributor.author.fl_str_mv Neale, Steven
Valadas, Rita
Silva, João
Branco, António
dc.subject.por.fl_str_mv Annotated corpora
Lexical semantics
Word senses
Named entities
Portuguese
topic Annotated corpora
Lexical semantics
Word senses
Named entities
Portuguese
description The semantic annotation of corpora has an important role to play in ensuring that sentences occurring in natural language texts are correctly understood based on their intended context. Two examples of lexical semantic units that contribute to this knowledge are word senses – which allow words with multiple meanings to be understood based on the context in which they are used – and named entities – which can be disambiguated and linked back to the specific encyclopedic resources that describe them. In this paper, we describe the construction of lexical semanticallyannotated corpora for Portuguese, annotated with both word senses linked to senses in a Portuguese wordnet and named entities linked to Portuguese Wikipedia entries using DBpedia. The result is a goldstandard lexical semantically-annotated resource that is useful in supporting the training and evaluation of tools for the disambiguation of these lexical units in Portuguese.
publishDate 2016
dc.date.none.fl_str_mv 2016
2016-01-01T00:00:00Z
2018-05-04T10:35:15Z
dc.type.driver.fl_str_mv book part
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10451/33110
url http://hdl.handle.net/10451/33110
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Neale, S., Rita Valadas Pereira, João Silva, & António Branco. "Lexical semantics annotation for enriched Portuguese corpora". In Lecture Notes in Artificial Intelligence, 9727, Berlim: Springer, pp. 296-305.
10.1007/978-3-319-41552-9 30
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Springer
publisher.none.fl_str_mv Springer
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833601540980473857