Articles producció científicaEnginyeria Informàtica i Matemàtiques

A semantic-preserving differentially private method for releasing query logs

  • Datos identificativos

    Identificador:  imarina:3934495
    Autores:  Sánchez, D; Batet, M; Viejo, A; Rodríguez-García, M; Castellà-Roca, J
    Resumen:
    © 2018 Elsevier Inc. Query logs are of great interest for data analysis. They allow characterizing user profiles, user behaviors and search habits. However, since query logs usually contain personal information, data controllers should implement appropriate data protection mechanisms before releasing them for secondary use. In the past, the anonymization of query logs was tackled from the perspective of statistical disclosure control and by relying on privacy models such as k-anonymity, which do not scale well with the high dimensionality and dynamicity of query logs. To offer better privacy protection, some authors have recently embraced the robust privacy guarantees of ɛ-differential privacy. However, this comes at the cost of limiting the number and types of analyses that can be made on the protected queries. To tackle this issue, in this paper we propose a privacy protection method for query logs that joins the flexibility and convenience of privacy-preserving data releases with the strong privacy guarantees of ɛ-differential privacy. Moreover, to retain the analytical utility of the protected query, we have put special care in capturing, managing and preserving the semantics of the queries during the protection process. The empirical experiments we report show that our method produces differentially private query logs that are more useful for analysis than related works.
  • Otros:

    Enlace a la fuente original: https://www.sciencedirect.com/science/article/abs/pii/S002002551830416X?via%3Dihub
    Referencia de l'ítem segons les normes APA: Sánchez, D; Batet, M; Viejo, A; Rodríguez-García, M; Castellà-Roca, J (2018). A semantic-preserving differentially private method for releasing query logs. Information Sciences, 460-461(), 223-237. DOI: 10.1016/j.ins.2018.05.046
    Referencia al articulo segun fuente origial: Information Sciences. 460-461 223-237
    DOI del artículo: 10.1016/j.ins.2018.05.046
    Año de publicación de la revista: 2018-09-01
    Entidad: Universitat Rovira i Virgili
    Versión del articulo depositado: info:eu-repo/semantics/acceptedVersion
    Fecha de alta del registro: 2026-05-09
    Autor/es de la URV: Batet Sanromà, Montserrat / Castellà Roca, Jordi / Sánchez Ruenes, David / Viejo Galicia, Luis Alexandre
    Departamento: Enginyeria Informàtica i Matemàtiques
    URL Documento de licencia: https://repositori.urv.cat/ca/proteccio-de-dades/
    Tipo de publicación: Journal Publications
    ISSN: 00200255
    Autor según el artículo: Sánchez, D; Batet, M; Viejo, A; Rodríguez-García, M; Castellà-Roca, J
    Acceso a la licencia de uso: https://creativecommons.org/licenses/by/3.0/es/
    Áreas temáticas: Theoretical computer science, Software, Information systems and management, Control and systems engineering, Computer science, information systems, Computer science applications, Ciencias sociales, Ciência da computação, Astronomia / física, Artificial intelligence
    Direcció de correo del autor: montserrat.batet@urv.cat, montserrat.batet@urv.cat, david.sanchez@urv.cat, david.sanchez@urv.cat, jordi.castella@urv.cat, jordi.castella@urv.cat, alexandre.viejo@urv.cat, alexandre.viejo@urv.cat, montserrat.batet@urv.cat
  • Palabras clave:

    User profiling
    Query logs
    Differential privacy
    Data utility
    Artificial Intelligence
    Computer Science Applications
    Computer Science
    Information Systems
    Control and Systems Engineering
    Information Systems and Management
    Software
    Theoretical Computer Science
    Ciencias sociales
    Ciência da computação
    Astronomia / física
  • Documentos:

  • Cerca a google

    Search to google scholar