Termos de (onco)mastologia: uma abordagem mediada por corpus

Detalhes bibliográficos
Ano de defesa: 2011
Autor(a) principal: Teixeira, Rosana de Barros Silva e lattes
Orientador(a): Sardinha, Antonio Paulo Berber
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Pontifícia Universidade Católica de São Paulo
Programa de Pós-Graduação: Programa de Estudos Pós-Graduados em Linguística Aplicada e Estudos da Linguagem
Departamento: Lingüística
País: BR
Palavras-chave em Português:
Palavras-chave em Inglês:
Área do conhecimento CNPq:
Link de acesso: https://tede2.pucsp.br/handle/handle/13496
Resumo: Limited to the research field of Applied Linguistics, articulating area of multiple domains of knowledge, this research, by adding the theoretical and methodological basis of Terminology-communicational language (Communicative Theory of Terminology CTT) and Corpus Linguistics, has the purpose of achieving two goals. The first objective aims to organize a monolingual glossary (same title of the research) designed to scientific journalists. The glossary s purpose is to help these professionals make the scientific terminology understood by non-scientific ones. This initiative is based on the fact that breast cancer causes the most deaths among women in Brazil each year, about 22% of new cases are diagnosed according to Health Institute. In order to get language in use, Corpus Linguistics has been chosen to go to that specialty language by observing empirical data, i.e., in vivo perspective, from a corpus of 563,482 words, according to WordSmith Tools 3.0. To do so, taking into consideration computer softwares available to corpus text, I have decided as a second objective to check the achievement accurancy of four tools (Corpógrafo 4.0, WordSmith Tools 3.0, e-Terms and ZExtractor) in relation to index of positive-candidates (terms). As pointed data, Corpógrafo 4.0 leads this ranking, with 27.56% of accurancy, followed respectively by ZExtractor (26.05%), WordSmith Tools 3.0 (21.77%) and e-Terms (14.44%). In order to make it feasible, it was developed a methodology based on the usage of Microsoft Office Excel 2007 to filter the common candidates extracted among all tools and exclusive ones of each. This data cutting, besides offering support to results achievement, provided the recognition of this methodology as a possible resource in terms of optimizing the extraction of terminology groups, starting from processed lists by two or more programs, since all of them are limited. In this way, 237 terms obtained by unigrams were listed, among which 104 were elected to head the entries that are more relevant in terms of conception