Articles producció científica> Enginyeria Informàtica i Matemàtiques

Differentially private publication of database streams via hybrid video coding

  • Dades identificatives

    Identificador: imarina:9262256
    Autors:
    Parra-Arnau, JavierStrufe, ThorstenDomingo-Ferrer, Josep
    Resum:
    While most anonymization technology available today is designed for static and small data, the current picture is of massive volumes of dynamic data arriving at unprecedented velocities. From the standpoint of anonymization, the most challenging type of dynamic data is data streams. However, while the majority of proposals deal with publishing either count-based or aggregated statistics about the underlying stream, little attention has been paid to the problem of continuously publishing the stream itself with differential privacy guarantees. In this work, we propose an anonymization method that can publish multiple numerical-attribute, finite microdata streams with high protection as well as high utility, the latter aspect measured as data distortion, delay and record reordering. Our method, which relies on the well-known differential pulse-code modulation scheme, adapts techniques originally intended for hybrid video encoding, to favor and leverage dependencies among the blocks of the original stream and thereby reduce data distortion. The proposed solution is assessed experimentally on two of the largest data sets in the scientific community working in data anonymization. Our extensive empirical evaluation shows the trade-off among privacy protection, data distortion, delay and record reordering, and demonstrates the suitability of adapting video-compression techniques to anonymize database streams.
  • Altres:

    Autor segons l'article: Parra-Arnau, Javier; Strufe, Thorsten; Domingo-Ferrer, Josep
    Departament: Enginyeria Informàtica i Matemàtiques
    Autor/s de la URV: Domingo Ferrer, Josep / PARRA ARNAU, JAVIER
    Paraules clau: Video encoding Privacy Database anonymization Data streams
    Resum: While most anonymization technology available today is designed for static and small data, the current picture is of massive volumes of dynamic data arriving at unprecedented velocities. From the standpoint of anonymization, the most challenging type of dynamic data is data streams. However, while the majority of proposals deal with publishing either count-based or aggregated statistics about the underlying stream, little attention has been paid to the problem of continuously publishing the stream itself with differential privacy guarantees. In this work, we propose an anonymization method that can publish multiple numerical-attribute, finite microdata streams with high protection as well as high utility, the latter aspect measured as data distortion, delay and record reordering. Our method, which relies on the well-known differential pulse-code modulation scheme, adapts techniques originally intended for hybrid video encoding, to favor and leverage dependencies among the blocks of the original stream and thereby reduce data distortion. The proposed solution is assessed experimentally on two of the largest data sets in the scientific community working in data anonymization. Our extensive empirical evaluation shows the trade-off among privacy protection, data distortion, delay and record reordering, and demonstrates the suitability of adapting video-compression techniques to anonymize database streams.
    Àrees temàtiques: Software Matemática / probabilidade e estatística Management information systems Interdisciplinar Information systems and management Información y documentación Engenharias iv Engenharias iii Economia Computer science, artificial intelligence Ciencias sociales Ciências biológicas i Ciência da computação Astronomia / física Artificial intelligence Administração pública e de empresas, ciências contábeis e turismo
    Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
    Adreça de correu electrònic de l'autor: josep.domingo@urv.cat
    Identificador de l'autor: 0000-0001-7213-4962
    Data d'alta del registre: 2024-10-12
    Versió de l'article dipositat: info:eu-repo/semantics/publishedVersion
    URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
    Referència a l'article segons font original: Knowledge-Based Systems. 247 108778-
    Referència de l'ítem segons les normes APA: Parra-Arnau, Javier; Strufe, Thorsten; Domingo-Ferrer, Josep (2022). Differentially private publication of database streams via hybrid video coding. Knowledge-Based Systems, 247(), 108778-. DOI: 10.1016/j.knosys.2022.108778
    Entitat: Universitat Rovira i Virgili
    Any de publicació de la revista: 2022
    Tipus de publicació: Journal Publications
  • Paraules clau:

    Artificial Intelligence,Computer Science, Artificial Intelligence,Information Systems and Management,Management Information Systems,Software
    Video encoding
    Privacy
    Database anonymization
    Data streams
    Software
    Matemática / probabilidade e estatística
    Management information systems
    Interdisciplinar
    Information systems and management
    Información y documentación
    Engenharias iv
    Engenharias iii
    Economia
    Computer science, artificial intelligence
    Ciencias sociales
    Ciências biológicas i
    Ciência da computação
    Astronomia / física
    Artificial intelligence
    Administração pública e de empresas, ciências contábeis e turismo
  • Documents:

  • Cerca a google

    Search to google scholar