Tesis doctorals> Departament d'Enginyeria Informàtica i Matemàtiques

Simultaneous discrimination prevention and privacy protection in data publishing and mining

  • Dades identificatives

    Identificador: TDX:1221
    Autors:
    Hajian, Sara
    Resum:
    Data mining is an increasingly important technology for extracting useful knowledge hidden in large collections of data. There are, however, negative social perceptions about data mining, among which potential privacy violation and potential discrimination. The former is an unintentional or deliberate disclosure of a user pro le or activity data as part of the output of a data mining algorithm or as a result of data sharing. For this reason, privacy preserving data mining has been introduced to trade o the utility of the resulting data/models for protecting individual privacy. The latter consists of treating people unfairly on the basis of their belonging to a speci c group. Automated data collection and data mining techniques such as classi cation have paved the way to making automated decisions, like loan granting/denial, insurance premium computation, etc. If the training datasets are biased in what regards discriminatory attributes like gender, race, religion, etc., discriminatory decisions may ensue. For this reason, anti-discrimination techniques including discrimination discovery and prevention have been introduced in data mining. Discrimination can be either direct or indirect. Direct discrimination occurs when decisions are made based on discriminatory attributes. Indirect discrimination occurs when decisions are made based on non-discriminatory attributes which are strongly correlated with biased discriminatory ones. In the rst part of this thesis, we tackle discrimination prevention in data mining and propose new techniques applicable for direct or indirect discrimination prevention individually or both at the same time. We discuss how to clean training datasets and outsourced datasets in such a way that direct and/or indirect discriminatory decision rules
  • Altres:

    Data: 2013-06-10
    Departament/Institut: Departament d'Enginyeria Informàtica i Matemàtiques Universitat Rovira i Virgili.
    Idioma: eng
    Identificador: http://hdl.handle.net/10803/119651
    Font: TDX (Tesis Doctorals en Xarxa)
    Autor: Hajian, Sara
    Director: Domingo-Ferrer, Josep, 1965- Pedreschi, Dino
    Format: application/pdf 176 p.
    Editor: Universitat Rovira i Virgili
    Paraula Clau: Simultaneous Discrimination Prevention and Privacy Protection in Data Publishing and Mining
    Títol: Simultaneous discrimination prevention and privacy protection in data publishing and mining
    Matèria: 004 - Informàtica
  • Paraules clau:

    004 - Informàtica
  • Documents:

  • Cerca a google

    Search to google scholar