Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual

Gondim, Romberg de Sá

Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual

Detalhes bibliográficos
Autor(a) principal:	Gondim, Romberg de Sá
Data de Publicação:	2023
Tipo de documento:	Trabalho de conclusão de curso
Idioma:	por
Título da fonte:	Biblioteca Digital de Teses e Dissertações da UFPB
Texto Completo:	https://repositorio.ufpb.br/jspui/handle/123456789/27186
Resumo:	This article comes from a Scientific Initiation research Project at UFPB, with the objective of creating programming scripts to collect, clean, organize and analyze the textual corpus of the research. The project stems from understanding the importance of changing political discourse as an indicator of the domestic salience of international norms, in this case, the global development agenda expressed by the Sustainable Development Goals (SDGs). Initially, the research started with the effort to build an extensive database, containing all the speeches of the Presidents of the Republic of Brazil since 1985, a milestone of the country's redemocratization. As a conceptual apparatus, salience is understood as the variation in the legitimacy of international norms in a domestic context, measured through institutional, political and discourse or rhetoric changes. Dealing with such a long period, there was a change in the way of storing the collection of the Presidency of the Republic, which resulted in the need to create specific scripts to automate the collection and processing of data in an operational way for future analysis. As a result, this work describes the technologies developed, in the form of scripts in the Python language, for the construction of an extensive textual corpus, with more than six thousand speeches, and adaptable for future collection. In addition to overcoming the lack of standardization and systematization resulting from changes in government and technologies, the result shows a fruitful approximation between International Relations and other social sciences with elements of computing. Initially thought for the specific project, the merit of this research can be used for several other analyses, in an operational way, and easily adapted to modern software. Once the database was operationalized, the pre-reading of the material led to changes in the analysis intentions, which is why analysis scripts were not continued.

Metadados do item

id	UFPB_ccadd28e5902ff4245a05d8dc288ab3c
oai_identifier_str	oai:repositorio.ufpb.br:123456789/27186
network_acronym_str	UFPB
network_name_str	Biblioteca Digital de Teses e Dissertações da UFPB
repository_id_str
spelling	Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textualPesquisa qualitativaColeta de dadosTratamento de dadosAnálise de conteúdoDiscurso PolíticoRotinas de programaçãoCNPQ::OUTROS::RELACOES INTERNACIONAISThis article comes from a Scientific Initiation research Project at UFPB, with the objective of creating programming scripts to collect, clean, organize and analyze the textual corpus of the research. The project stems from understanding the importance of changing political discourse as an indicator of the domestic salience of international norms, in this case, the global development agenda expressed by the Sustainable Development Goals (SDGs). Initially, the research started with the effort to build an extensive database, containing all the speeches of the Presidents of the Republic of Brazil since 1985, a milestone of the country's redemocratization. As a conceptual apparatus, salience is understood as the variation in the legitimacy of international norms in a domestic context, measured through institutional, political and discourse or rhetoric changes. Dealing with such a long period, there was a change in the way of storing the collection of the Presidency of the Republic, which resulted in the need to create specific scripts to automate the collection and processing of data in an operational way for future analysis. As a result, this work describes the technologies developed, in the form of scripts in the Python language, for the construction of an extensive textual corpus, with more than six thousand speeches, and adaptable for future collection. In addition to overcoming the lack of standardization and systematization resulting from changes in government and technologies, the result shows a fruitful approximation between International Relations and other social sciences with elements of computing. Initially thought for the specific project, the merit of this research can be used for several other analyses, in an operational way, and easily adapted to modern software. Once the database was operationalized, the pre-reading of the material led to changes in the analysis intentions, which is why analysis scripts were not continued.O presente artigo provém de um Projeto de Iniciação Científica da UFPB, com o objetivo de criar rotinas de programação para coletar, tratar, organizar e analisar o corpus textual da pesquisa. O projeto se origina na compreensão da importância da mudança do discurso político como indicador da saliência doméstica de normas internacionais, neste caso, a agenda de desenvolvimento global expressa pelos Objetivos do Desenvolvimento Sustentável (ODS). De início, a pesquisa partiu no esforço de construir uma base de dados extensiva, contendo a totalidade dos discursos dos Presidentes da República do Brasil desde 1985, marco da redemocratização do País. Como aparato conceitual, compreende-se saliência como a variação da legitimidade das normas internacionais em um contexto doméstico, medido por meio de mudanças institucionais, políticas e de discurso, ou retóricas. Tratando sobre um período tão longo, observou-se a mudança na forma de armazenamento do acervo da Presidência da República, das quais decorreu a necessidade de criação de scripts específicos para automatizar a coleta e o tratamento dos dados de forma operacional para análises futuras. Como resultado, este trabalho descreve as tecnologias desenvolvidas, na forma de scripts na linguagem Python, para a construção de um corpus textual extenso, de mais de seis mil discursos, e adaptável para a coleta futura. Além de ultrapassar a falta de padronização e sistematização decorrentes de mudanças de governo e tecnologias, o resultado evidencia uma aproximação frutífera entre as Relações Internacionais e demais ciências sociais com elementos da computação. Inicialmente pensado para o projeto específico, o mérito desta pesquisa pode ser utilizado para diversas outras análises, de forma operacionalizável, e facilmente adequada a softwares modernos. Operacionalizada a base de dados, a pré-leitura do material motivou mudanças nas pretensões de análise, razão pela qual scripts para análise não foram continuados.Universidade Federal da ParaíbaBrasilCiências Sociais AplicadasUFPBGonçalves, Pascoal Teófilo CarvalhoGondim, Romberg de Sá2023-06-22T17:23:05Z2023-06-22T17:23:05Z2023-03-09info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/bachelorThesishttps://repositorio.ufpb.br/jspui/handle/123456789/27186porAttribution-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nd/3.0/br/info:eu-repo/semantics/openAccessreponame:Biblioteca Digital de Teses e Dissertações da UFPBinstname:Universidade Federal da Paraíba (UFPB)instacron:UFPB2023-06-23T06:03:47Zoai:repositorio.ufpb.br:123456789/27186Biblioteca Digital de Teses e Dissertaçõeshttps://repositorio.ufpb.br/PUBhttp://tede.biblioteca.ufpb.br:8080/oai/requestdiretoria@ufpb.br\|\| bdtd@biblioteca.ufpb.bropendoar:2023-06-23T06:03:47Biblioteca Digital de Teses e Dissertações da UFPB - Universidade Federal da Paraíba (UFPB)false
dc.title.none.fl_str_mv	Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual
title	Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual
spellingShingle	Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual Gondim, Romberg de Sá Pesquisa qualitativa Coleta de dados Tratamento de dados Análise de conteúdo Discurso Político Rotinas de programação CNPQ::OUTROS::RELACOES INTERNACIONAIS
title_short	Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual
title_full	Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual
title_fullStr	Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual
title_full_unstemmed	Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual
title_sort	Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual
author	Gondim, Romberg de Sá
author_facet	Gondim, Romberg de Sá
author_role	author
dc.contributor.none.fl_str_mv	Gonçalves, Pascoal Teófilo Carvalho
dc.contributor.author.fl_str_mv	Gondim, Romberg de Sá
dc.subject.por.fl_str_mv	Pesquisa qualitativa Coleta de dados Tratamento de dados Análise de conteúdo Discurso Político Rotinas de programação CNPQ::OUTROS::RELACOES INTERNACIONAIS
topic	Pesquisa qualitativa Coleta de dados Tratamento de dados Análise de conteúdo Discurso Político Rotinas de programação CNPQ::OUTROS::RELACOES INTERNACIONAIS
description	This article comes from a Scientific Initiation research Project at UFPB, with the objective of creating programming scripts to collect, clean, organize and analyze the textual corpus of the research. The project stems from understanding the importance of changing political discourse as an indicator of the domestic salience of international norms, in this case, the global development agenda expressed by the Sustainable Development Goals (SDGs). Initially, the research started with the effort to build an extensive database, containing all the speeches of the Presidents of the Republic of Brazil since 1985, a milestone of the country's redemocratization. As a conceptual apparatus, salience is understood as the variation in the legitimacy of international norms in a domestic context, measured through institutional, political and discourse or rhetoric changes. Dealing with such a long period, there was a change in the way of storing the collection of the Presidency of the Republic, which resulted in the need to create specific scripts to automate the collection and processing of data in an operational way for future analysis. As a result, this work describes the technologies developed, in the form of scripts in the Python language, for the construction of an extensive textual corpus, with more than six thousand speeches, and adaptable for future collection. In addition to overcoming the lack of standardization and systematization resulting from changes in government and technologies, the result shows a fruitful approximation between International Relations and other social sciences with elements of computing. Initially thought for the specific project, the merit of this research can be used for several other analyses, in an operational way, and easily adapted to modern software. Once the database was operationalized, the pre-reading of the material led to changes in the analysis intentions, which is why analysis scripts were not continued.
publishDate	2023
dc.date.none.fl_str_mv	2023-06-22T17:23:05Z 2023-06-22T17:23:05Z 2023-03-09
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/bachelorThesis
format	bachelorThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	https://repositorio.ufpb.br/jspui/handle/123456789/27186
url	https://repositorio.ufpb.br/jspui/handle/123456789/27186
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	Attribution-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nd/3.0/br/ info:eu-repo/semantics/openAccess
rights_invalid_str_mv	Attribution-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nd/3.0/br/
eu_rights_str_mv	openAccess
dc.publisher.none.fl_str_mv	Universidade Federal da Paraíba Brasil Ciências Sociais Aplicadas UFPB
publisher.none.fl_str_mv	Universidade Federal da Paraíba Brasil Ciências Sociais Aplicadas UFPB
dc.source.none.fl_str_mv	reponame:Biblioteca Digital de Teses e Dissertações da UFPB instname:Universidade Federal da Paraíba (UFPB) instacron:UFPB
instname_str	Universidade Federal da Paraíba (UFPB)
instacron_str	UFPB
institution	UFPB
reponame_str	Biblioteca Digital de Teses e Dissertações da UFPB
collection	Biblioteca Digital de Teses e Dissertações da UFPB
repository.name.fl_str_mv	Biblioteca Digital de Teses e Dissertações da UFPB - Universidade Federal da Paraíba (UFPB)
repository.mail.fl_str_mv	diretoria@ufpb.br\|\| bdtd@biblioteca.ufpb.br
_version_	1831313887997198336

Soluções para coleta e tratamento de dados qualitativos sobre difusão e apropriação da Agenda 2030: a linguagem Python aplicada à construção de um corpus textual

Registros relacionados