Articles producció científica> Enginyeria Informàtica i Matemàtiques

Conciliating Privacy and Utility in Data Releases via Individual Differential Privacy and Microaggregation

  • Datos identificativos

    Identificador: imarina:9411537
    Autores:
    Soria-Comas JSánchez DDomingo-Ferrer JMartínez SDel Vasto-Terrientes L
    Resumen:
    ϵ-Differential privacy (DP) is a well-known privacy model that offers strong privacy guar-antees. However, when applied to data releases, DP significantly deteriorates the analytical utility of the protected outcomes. To keep data utility at reasonable levels, practical applications of DP to data releases have used weak privacy parameters (large ϵ), which dilute the privacy guarantees of DP. In this work, we tackle this issue by using an alternative formulation of the DP privacy guarantees, named ϵ-individual differential privacy (iDP), which causes less data distortion while providing the same protection as DP to subjects. We enforce iDP in data releases by relying on attribute masking plus a pre-processing step based on data microaggregation. The goal of this step is to reduce the sensitivity to record changes, which determines the amount of noise required to enforce iDP (and DP). Specifically, we propose data microaggregation strategies designed for iDP whose sensitivities are significantly lower than those used in DP. As a result, we obtain iDP-protected data with significantly better utility than with DP. We report on experiments that show how our approach can provide strong privacy (small ϵ) while yielding protected data that do not significantly degrade the accuracy of secondary data analysis.
  • Otros:

    Autor según el artículo: Soria-Comas J; Sánchez D; Domingo-Ferrer J; Martínez S; Del Vasto-Terrientes L
    Departamento: Enginyeria Informàtica i Matemàtiques
    Autor/es de la URV: DEL VASTO TERRIENTES, LUIS MIGUEL / Domingo Ferrer, Josep / Martinez Lluis, Sergio / Sánchez Ruenes, David / SORIA COMAS, JORGE
    Palabras clave: Data microaggregation Data releases Individual differential privacy Machine learning
    Resumen: ϵ-Differential privacy (DP) is a well-known privacy model that offers strong privacy guar-antees. However, when applied to data releases, DP significantly deteriorates the analytical utility of the protected outcomes. To keep data utility at reasonable levels, practical applications of DP to data releases have used weak privacy parameters (large ϵ), which dilute the privacy guarantees of DP. In this work, we tackle this issue by using an alternative formulation of the DP privacy guarantees, named ϵ-individual differential privacy (iDP), which causes less data distortion while providing the same protection as DP to subjects. We enforce iDP in data releases by relying on attribute masking plus a pre-processing step based on data microaggregation. The goal of this step is to reduce the sensitivity to record changes, which determines the amount of noise required to enforce iDP (and DP). Specifically, we propose data microaggregation strategies designed for iDP whose sensitivities are significantly lower than those used in DP. As a result, we obtain iDP-protected data with significantly better utility than with DP. We report on experiments that show how our approach can provide strong privacy (small ϵ) while yielding protected data that do not significantly degrade the accuracy of secondary data analysis.
    Áreas temáticas: Ciência da computação Computer science, theory & methods Software Statistics and probability
    Acceso a la licencia de uso: https://creativecommons.org/licenses/by/3.0/es/
    Direcció de correo del autor: josep.domingo@urv.cat sergio.martinezl@urv.cat david.sanchez@urv.cat
    Identificador del autor: 0000-0001-7213-4962 0000-0002-3941-5348 0000-0001-7275-7887
    Fecha de alta del registro: 2025-02-18
    Versión del articulo depositado: info:eu-repo/semantics/publishedVersion
    Referencia al articulo segun fuente origial: Transactions On Data Privacy. 18 (1): 29-50
    Referencia de l'ítem segons les normes APA: Soria-Comas J; Sánchez D; Domingo-Ferrer J; Martínez S; Del Vasto-Terrientes L (2025). Conciliating Privacy and Utility in Data Releases via Individual Differential Privacy and Microaggregation. Transactions On Data Privacy, 18(1), 29-50
    URL Documento de licencia: https://repositori.urv.cat/ca/proteccio-de-dades/
    Entidad: Universitat Rovira i Virgili
    Año de publicación de la revista: 2025
    Tipo de publicación: Journal Publications
  • Palabras clave:

    Computer Science, Theory & Methods,Software,Statistics and Probability
    Data microaggregation
    Data releases
    Individual differential privacy
    Machine learning
    Ciência da computação
    Computer science, theory & methods
    Software
    Statistics and probability
  • Documentos:

  • Cerca a google

    Search to google scholar