Probabilistic synSet based concept location

Bibliographic Details
Main Author: Carvalho, Nuno Ramos
Publication Date: 2012
Other Authors: Almeida, José João, Pereira, Maria João, Henriques, Pedro Rangel
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/10198/7062
Summary: Concept location is a common task in program comprehension techniques, essential in many approaches used for software care and software evolution. An important goal of this process is to discover a mapping between source code and human oriented concepts. Although programs are written in a strict and formal language, natural language terms and sentences like identifiers (variables or functions names), constant strings or comments, can still be found embedded in programs. Using terminology concepts and natural language processing techniques these terms can be exploited to discover clues about which real world concepts source code is addressing. This work extends symbol tables build by compilers with ontology driven constructs, extends synonym sets defined by linguistics, with automatically created Probabilistic SynSets from software domain parallel corpora. And using a relational algebra, creates semantic bridges between program elements and human oriented concepts, to enhance concept location tasks.
id RCAP_a89e07bab35d89583f780b2bc1158174
oai_identifier_str oai:bibliotecadigital.ipb.pt:10198/7062
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Probabilistic synSet based concept locationProgram comprehensionConcept locationIdentifier analysisProgram and problem domainsConcept location is a common task in program comprehension techniques, essential in many approaches used for software care and software evolution. An important goal of this process is to discover a mapping between source code and human oriented concepts. Although programs are written in a strict and formal language, natural language terms and sentences like identifiers (variables or functions names), constant strings or comments, can still be found embedded in programs. Using terminology concepts and natural language processing techniques these terms can be exploited to discover clues about which real world concepts source code is addressing. This work extends symbol tables build by compilers with ontology driven constructs, extends synonym sets defined by linguistics, with automatically created Probabilistic SynSets from software domain parallel corpora. And using a relational algebra, creates semantic bridges between program elements and human oriented concepts, to enhance concept location tasks.CCTC - Universidade do MinhoAlberto Simões and Ricardo Queirós and Daniela da CruzBiblioteca Digital do IPBCarvalho, Nuno RamosAlmeida, José JoãoPereira, Maria JoãoHenriques, Pedro Rangel2012-06-25T13:15:06Z20122012-01-01T00:00:00Zconference objectinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10198/7062engCarvalho, Nuno; Almeida, José João; Pereira, Maria João; Henriques, Pedro (2012). Probabilistic synSet based concept location. In SLATe'12 Symposium on Languages, Applications and Technologies. Universidade do Minho, Portugal. p. 239-253. ISBN 978-3-939879-40-8978-3-939879-40-8info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-02-25T11:59:10Zoai:bibliotecadigital.ipb.pt:10198/7062Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T11:22:36.535511Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Probabilistic synSet based concept location
title Probabilistic synSet based concept location
spellingShingle Probabilistic synSet based concept location
Carvalho, Nuno Ramos
Program comprehension
Concept location
Identifier analysis
Program and problem domains
title_short Probabilistic synSet based concept location
title_full Probabilistic synSet based concept location
title_fullStr Probabilistic synSet based concept location
title_full_unstemmed Probabilistic synSet based concept location
title_sort Probabilistic synSet based concept location
author Carvalho, Nuno Ramos
author_facet Carvalho, Nuno Ramos
Almeida, José João
Pereira, Maria João
Henriques, Pedro Rangel
author_role author
author2 Almeida, José João
Pereira, Maria João
Henriques, Pedro Rangel
author2_role author
author
author
dc.contributor.none.fl_str_mv Biblioteca Digital do IPB
dc.contributor.author.fl_str_mv Carvalho, Nuno Ramos
Almeida, José João
Pereira, Maria João
Henriques, Pedro Rangel
dc.subject.por.fl_str_mv Program comprehension
Concept location
Identifier analysis
Program and problem domains
topic Program comprehension
Concept location
Identifier analysis
Program and problem domains
description Concept location is a common task in program comprehension techniques, essential in many approaches used for software care and software evolution. An important goal of this process is to discover a mapping between source code and human oriented concepts. Although programs are written in a strict and formal language, natural language terms and sentences like identifiers (variables or functions names), constant strings or comments, can still be found embedded in programs. Using terminology concepts and natural language processing techniques these terms can be exploited to discover clues about which real world concepts source code is addressing. This work extends symbol tables build by compilers with ontology driven constructs, extends synonym sets defined by linguistics, with automatically created Probabilistic SynSets from software domain parallel corpora. And using a relational algebra, creates semantic bridges between program elements and human oriented concepts, to enhance concept location tasks.
publishDate 2012
dc.date.none.fl_str_mv 2012-06-25T13:15:06Z
2012
2012-01-01T00:00:00Z
dc.type.driver.fl_str_mv conference object
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10198/7062
url http://hdl.handle.net/10198/7062
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Carvalho, Nuno; Almeida, José João; Pereira, Maria João; Henriques, Pedro (2012). Probabilistic synSet based concept location. In SLATe'12 Symposium on Languages, Applications and Technologies. Universidade do Minho, Portugal. p. 239-253. ISBN 978-3-939879-40-8
978-3-939879-40-8
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Alberto Simões and Ricardo Queirós and Daniela da Cruz
publisher.none.fl_str_mv Alberto Simões and Ricardo Queirós and Daniela da Cruz
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833591853993164800