Articles producció científicaEnginyeria Informàtica i Matemàtiques

Utility preserving query log anonymization via semantic microaggregation

  • Datos identificativos

    Identificador:  imarina:9285292
    Autores:  Batet, M; Erola, A; Sánchez, D; Castellà-Roca, J
    Resumen:
    Query logs are of great interest for scientists and companies for research, statistical and commercial purposes. However, the availability of query logs for secondary uses raises privacy issues since they allow the identification and/or revelation of sensitive information about individual users. Hence, query anonymization is crucial to avoid identity disclosure. To enable the publication of privacy-preserved - but still useful - query logs, in this paper, we present an anonymization method based on semantic microaggregation. Our proposal aims at minimizing the disclosure risk of anonymized query logs while retaining their semantics as much as possible. First, a method to map queries to their formal semantics extracted from the structured categories of the Open Directory Project is presented. Then, a microaggregation method is adapted to perform a semantically-grounded anonymization of query logs. To do so, appropriate semantic similarity and semantic aggregation functions are proposed. Experiments performed using real AOL query logs show that our proposal better retains the utility of anonymized query logs than other related works, while also minimizing the disclosure risk. © 2013 Elsevier Inc. All rights reserved.
  • Otros:

    Enlace a la fuente original: https://www.sciencedirect.com/science/article/abs/pii/S0020025513003174
    Referencia de l'ítem segons les normes APA: Batet, M; Erola, A; Sánchez, D; Castellà-Roca, J (2013). Utility preserving query log anonymization via semantic microaggregation. Information Sciences, 242(), 49-63. DOI: 10.1016/j.ins.2013.04.020
    Referencia al articulo segun fuente origial: Information Sciences. 242 49-63
    DOI del artículo: 10.1016/j.ins.2013.04.020
    Año de publicación de la revista: 2013-09-01
    Entidad: Universitat Rovira i Virgili
    Versión del articulo depositado: info:eu-repo/semantics/acceptedVersion
    Fecha de alta del registro: 2026-05-09
    Autor/es de la URV: Batet Sanromà, Montserrat / Castellà Roca, Jordi / EROLA CAÑELLAS, ARNAU / Sánchez Ruenes, David
    Departamento: Enginyeria Informàtica i Matemàtiques
    URL Documento de licencia: https://repositori.urv.cat/ca/proteccio-de-dades/
    Tipo de publicación: Journal Publications
    Autor según el artículo: Batet, M; Erola, A; Sánchez, D; Castellà-Roca, J
    Acceso a la licencia de uso: https://creativecommons.org/licenses/by/3.0/es/
    Áreas temáticas: Theoretical computer science, Software, Information systems and management, Control and systems engineering, Computer science, information systems, Computer science applications, Ciencias sociales, Ciência da computação, Astronomia / física, Artificial intelligence
    Direcció de correo del autor: montserrat.batet@urv.cat, montserrat.batet@urv.cat, david.sanchez@urv.cat, david.sanchez@urv.cat, jordi.castella@urv.cat, jordi.castella@urv.cat, montserrat.batet@urv.cat
  • Palabras clave:

    Sensitive informations
    Semantics
    Semantic similarity
    Semantic aggregation
    Query logs
    Privacy-preservation
    Privacy preservation
    Open directory projects
    Microaggregation
    Information retrieval
    Data utility
    Data utilities
    Artificial Intelligence
    Computer Science Applications
    Computer Science
    Information Systems
    Control and Systems Engineering
    Information Systems and Management
    Software
    Theoretical Computer Science
    Ciencias sociales
    Ciência da computação
    Astronomia / física
  • Documentos:

  • Cerca a google

    Search to google scholar