Autor según el artículo: Jordi Castellà-Roca; David Pàmies-Estrems; Alexandre Viejo
Departamento: Enginyeria Informàtica i Matemàtiques
Autor/es de la URV: CASTELLÀ ROCA, JORDI; David Pàmies-Estrems; VIEJO GALICIA, LUIS ALEXANDRE
Palabras clave: web search Query logs Privacy
Resumen: The popularity of Web Search Engines (WSEs) enables them to generate a lot of data in form of query logs. These files contain all search queries submitted by users. Economical benefits could be earned by means of selling or releasing those logs to third parties. Nevertheless, this data potentially expose sensitive user information. Removing direct identifiers is not sufficient to preserve the privacy of the users. Some existing privacy-preserving approaches use log batch processing but, as logs are generated and consumed in a real-time environment, a continuous anonymization process would be more convenient. In this way, in this paper we propose: (i) a new method to anonymize query logs, based on k-anonymity; and (ii) some de-anonymization tools to determine possible privacy problems, in case that an attacker gains access to the anonymized query logs. This approach preserves the original user interests, but spreads possible semi-identifier information over many users, preventing linkage attacks. To assess its performance, all the proposed algorithms are implemented and an extensive set of experiments are conducted using real data.
Grupo de investigación: Criptografia i Secret Estadístic
Áreas temáticas: Computer engineering Ingeniería informática Enginyeria informàtica
Acceso a la licencia de uso: https://creativecommons.org/licenses/by/3.0/es/
ISSN: 0957-4174
Identificador del autor: 0000-0002-0037-9888; N/A; 0000-0003-2342-5100
Fecha de alta del registro: 2016-09-21
Página final: 535
Volumen de revista: 64
Versión del articulo depositado: info:eu-repo/semantics/acceptedVersion
URL Documento de licencia: https://repositori.urv.cat/ca/proteccio-de-dades/
Entidad: Universitat Rovira i Virgili
Año de publicación de la revista: 2016
Página inicial: 523
Tipo de publicación: Article Artículo Article