INTERFACES ENTRE LEXICOGRAFIA EDIALETOLOGIA: POR UM PROTÓTIPO DEVOCABULÁRIO DIALETAL ELETRÔNICO DAREGIÃO NORTE DO BRASIL

Detalhes bibliográficos
Ano de defesa: 2023
Autor(a) principal: Jorge Luiz Nunes dos Santos Junior
Orientador(a): Aparecida Negri Isquerdo
Banca de defesa: Não Informado pela instituição
Tipo de documento: Tese
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Fundação Universidade Federal de Mato Grosso do Sul
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Brasil
Palavras-chave em Português:
Link de acesso: https://repositorio.ufms.br/handle/123456789/5845
Resumo: This Thesis is located at the intersection of Lexicography with Dialectology and Computational Linguistics and arises from the need to contribute to filling the gap related to the lexicographical and electronic treatment of the dialectal data from the Atlas Linguístico do Brasil (AliB) Project, documented in 18 locations which compose the network of the ALiB Project points, at the interior from Brazil Northern region and, by extension, collaborate with the Brazilian Dialectal Dictionary (MACHADO FILHO, 2010) which aims to produce a dialect dictionary from ALiB data. The main study objective is to produce and to present a digital lexicographical prototype developed from ALiB Project dialectal corpus. Thus, the final product of this Thesis is characterized by a set of electronic tools which compose the prototype of the Dialectal Vocabulary from the Brazil Northern region (VoDiNorte), available at the link <http://vodinorte.bombadil.fr/>. These tools are capable of retrieving structured dialectal information from an XML (Extensible Markup Language) database and present it to the user in a lexicographic organization. These tools are classified into two groups: i) prototypical tools - designed for the common and intermediate user and resembling conventional electronic lexicographic products, and ii) an advanced search engine in the research corpus, aimed at the advanced user, which is innovative from the perspective of Electronic Lexicography. Furthermore, the Thesis presents the methodology employed to encourage linguistics researchers to rethink the outsourcing of services related to low-complexity software programming, opening up space to the exploration of a new horizon in which the scholar can invest in specific knowledge to develop their own computational tools, that is, the creation of computerized solutions that meet specific objectives of a particular scientific research. In summary, this Thesis demonstrates not only the functionalities of the VoDiNorte prototype as a product but also extends an invitation to researchers in Linguistics related areas to consider the possibility of creating their own research corpora in XML, experimenting with ways of manipulating data and retrieving information in an automated and personalized manner, similar to the VoDiNorte prototype, which can be seen, from a technical standpoint, as a web application that functions from an XML database. Further the database of this Thesis can serve as a basis to another products and/or integrate another projects such as the Dicionário Dialetal Brasileiro (MACHADO FILHO, 2010), given that XML is a compatible and easy to use language alongside other computerized systems. We highlight that this research is based on Lexicography (TARP, 2019; 2018; 2011; 2008); (FUERTES-OLIVERA; BERGENHOLTZ, 2015); (FUERTES-OLIVERA; TARP, 2014); (LEROYER, 2011); (PORTO DAPENA, 2004), on Dialectology (CARDOSO, 2010); (RADTKE; THUN, 1996); (CHAMBERS; TRUDGILL, 1994); on Corpus Linguistics (McENERY; HARDIE, 2012); (O'KEEFFE; MCCARTHY, 2010); (BERBER SARDINHA, 2004); (SINCLAIR, 1991) and on Computational Linguistics (KEDIA; RASU, 2020); (SRINIVASA-DESIKAN, 2018); (HARERT, 2004; 2005). Finally, the research demonstrated that creating XML databases for lexical data provides researchers with subsidies to the development of computational tools that can be useful to linguistic studies, in addition to enabling the creation of digital products from data of a scientific research. Keywords: ALiB Project; Lexicography; Dialectology; XML database; computational tools.