Detalhes bibliográficos
Ano de defesa: |
2023 |
Autor(a) principal: |
Brito, Felipe Timbó |
Orientador(a): |
Não Informado pela instituição |
Banca de defesa: |
Não Informado pela instituição |
Tipo de documento: |
Tese
|
Tipo de acesso: |
Acesso aberto |
Idioma: |
eng |
Instituição de defesa: |
Não Informado pela instituição
|
Programa de Pós-Graduação: |
Não Informado pela instituição
|
Departamento: |
Não Informado pela instituição
|
País: |
Não Informado pela instituição
|
Palavras-chave em Português: |
|
Link de acesso: |
http://repositorio.ufc.br/handle/riufc/75056
|
Resumo: |
Many complex systems are commonly modeled as count-weighted graphs, where nodes represent entities, edges model relationships between them and edge weights define some counting statistics associated with each relationship. As graph data usually contain sensitive information, preserving privacy when releasing this type of data becomes an important issue. In this context, differential privacy (DP) has become the de facto standard for data release under strong mathematical guarantees. When dealing with DP for weighted graphs, most state-of-the-art works assume that the graph topology is known. However, in several real-world applications, the privacy of the graph topology also needs to be ensured. In this dissertation, we aim to bridge the gap between DP and count-weighted graph data release, considering both graph structure and edge weights as private information. We first adapt the weighted graph DP definition to take into account the privacy of the graph structure. We then introduce a scalable technique to randomly add noise to the edge weights and to the graph topology. We also leverage the post-processing property of DP to improve the data utility, considering graph domain constraints. Finally, these combined contributions are used as the foundation for the development of two novel approaches to privately releasing count-weighted graphs under the notions of global and local DP. Experiments using real-world graph data demonstrate the superiority of our approaches in terms of utility over existing techniques, enabling subsequent computation of a variety of statistics on the released graph with high utility, in some cases comparable to the non-private results. |