Articles producció científica> Enginyeria Informàtica i Matemàtiques

Working at the web search engine side to generate privacy-preserving user profiles

  • Datos identificativos

    Identificador: PC:1900
    Autores:
    Jordi Castellà-RocaDavid Pàmies-EstremsAlexandre Viejo
    Resumen:
    Filiació URV: SI
  • Otros:

    Autor según el artículo: Jordi Castellà-Roca; David Pàmies-Estrems; Alexandre Viejo
    Departamento: Enginyeria Informàtica i Matemàtiques
    Autor/es de la URV: CASTELLÀ ROCA, JORDI; David Pàmies-Estrems; VIEJO GALICIA, LUIS ALEXANDRE
    Palabras clave: web search Query logs Privacy
    Resumen: The popularity of Web Search Engines (WSEs) enables them to generate a lot of data in form of query logs. These files contain all search queries submitted by users. Economical benefits could be earned by means of selling or releasing those logs to third parties. Nevertheless, this data potentially expose sensitive user information. Removing direct identifiers is not sufficient to preserve the privacy of the users. Some existing privacy-preserving approaches use log batch processing but, as logs are generated and consumed in a real-time environment, a continuous anonymization process would be more convenient. In this way, in this paper we propose: (i) a new method to anonymize query logs, based on k-anonymity; and (ii) some de-anonymization tools to determine possible privacy problems, in case that an attacker gains access to the anonymized query logs. This approach preserves the original user interests, but spreads possible semi-identifier information over many users, preventing linkage attacks. To assess its performance, all the proposed algorithms are implemented and an extensive set of experiments are conducted using real data.
    Grupo de investigación: Criptografia i Secret Estadístic
    Áreas temáticas: Computer engineering Ingeniería informática Enginyeria informàtica
    Acceso a la licencia de uso: https://creativecommons.org/licenses/by/3.0/es/
    ISSN: 0957-4174
    Identificador del autor: 0000-0002-0037-9888; N/A; 0000-0003-2342-5100
    Fecha de alta del registro: 2016-09-21
    Página final: 535
    Volumen de revista: 64
    Versión del articulo depositado: info:eu-repo/semantics/acceptedVersion
    URL Documento de licencia: https://repositori.urv.cat/ca/proteccio-de-dades/
    Entidad: Universitat Rovira i Virgili
    Año de publicación de la revista: 2016
    Página inicial: 523
    Tipo de publicación: Article Artículo Article
  • Palabras clave:

    Protecció de dades
    Cercadors d'Internet
    web search
    Query logs
    Privacy
    Computer engineering
    Ingeniería informática
    Enginyeria informàtica
    0957-4174
  • Documentos:

  • Cerca a google

    Search to google scholar