Pentágono da privacidade no big data privacy analytics: proposta de modelo multifacetado de garantia da privacidade e do valor analítico

Detalhes bibliográficos
Ano de defesa: 2024
Autor(a) principal: Brenner Lopes
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Tese
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Minas Gerais
Brasil
ECI - ESCOLA DE CIENCIA DA INFORMAÇÃO
Programa de Pós-Graduação em Gestão e Organização do Conhecimento
UFMG
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: http://hdl.handle.net/1843/74112
Resumo: The continuous increase in the amount of data generated and available has created an environment characterized by a gigantic mass of data, which grows not only in terms of volume and quantity, but also in terms of variety, being created and moving at high speed. If before, the focus and availability of data had as a priority interface the so-called structured data, today they are in much smaller quantity and importance, and adjustments and improvements in technologies and analyzes were partly carried out to adapt to this new reality which is conventionally called big data. The value created by big data analysis has brought many positive factors to different fields, but, on the other hand, it has also brought some problems. One of the issues of great relevance in this context that has raised great concern in society are the threats to privacy, brought about by advanced analyzes of large volumes of data. The question raised as a result of several researches is that the procedures, techniques and technologies currently available cannot fully guarantee privacy. As well as legislation that regulates data protection and privacy, such as in the Brazilian case the General Data Protection Law (LGPD). On the other hand, it is possible to obtain a very high level of privacy, but at the cost of nullifying the possibilities of extracting value from big data. Given this complex scenario, the focus of this research is to propose a multifaceted model (because it needs to be composed of multiple approaches and constructs) within the scope of big data analytics, which guarantees privacy, at the same time, in which it does not make it impossible to extract value. The methodology proposed for this work will be based on a quantitative approach, using text mining and unsupervised machine learning with a focus on creating clusters. After the quantitative phase, the privacy policies of 28 companies were analyzed using a qualitative approach, with two main objectives: finding the best number of clusters; and obtain an understanding of their distinctive characteristics, in order to improve the analysis carried out. The privacy policies of the thousand largest companies in Brazil were selected, classified as such by the Valor1000 ranking (2021). The results pointed to the consistency of the Privacy Pentagon model in the proposed Big Data Analytics; as well as what was called “opaque privacy” in this work.