Articles producció científicaEnginyeria Informàtica i Matemàtiques

Working at the web search engine side to generate privacy-preserving user profiles

  • Identification data

    Identifier:  PC:1900
    Authors:  Jordi Castellà-Roca; David Pàmies-Estrems; Alexandre Viejo
    Abstract:
    The popularity of Web Search Engines (WSEs) enables them to generate a lot of data in form of query logs. These files contain all search queries submitted by users. Economical benefits could be earned by means of selling or releasing those logs to third parties. Nevertheless, this data potentially expose sensitive user information. Removing direct identifiers is not sufficient to preserve the privacy of the users. Some existing privacy-preserving approaches use log batch processing but, as logs are generated and consumed in a real-time environment, a continuous anonymization process would be more convenient. In this way, in this paper we propose: (i) a new method to anonymize query logs, based on k-anonymity; and (ii) some de-anonymization tools to determine possible privacy problems, in case that an attacker gains access to the anonymized query logs. This approach preserves the original user interests, but spreads possible semi-identifier information over many users, preventing linkage attacks. To assess its performance, all the proposed algorithms are implemented and an extensive set of experiments are conducted using real data.
  • Others:

    Link to the original source: https://www.sciencedirect.com/science/article/abs/pii/S0957417416304328?via%3Dihub
    Article's DOI: 10.1016/j.eswa.2016.08.033
    Journal publication year: 2016
    Entity: Universitat Rovira i Virgili
    Paper version: info:eu-repo/semantics/acceptedVersion
    Record's date: 2016-09-21
    First page: 523
    URV's Author/s: CASTELLÀ ROCA, JORDI; David Pàmies-Estrems; VIEJO GALICIA, LUIS ALEXANDRE
    Department: Enginyeria Informàtica i Matemàtiques
    Licence document URL: https://repositori.urv.cat/ca/proteccio-de-dades/
    Publication Type: Article
    Last page: 535
    ISSN: 0957-4174
    Author, as appears in the article.: Jordi Castellà-Roca; David Pàmies-Estrems; Alexandre Viejo
    licence for use: https://creativecommons.org/licenses/by/3.0/es/
    Journal volume: 64
    Research group: Criptografia i Secret Estadístic
    Thematic Areas: Computer engineering