Articles producció científica> Enginyeria Informàtica i Matemàtiques

EVALUATION OF THE DISCLOSURE RISK OF MASKING METHODS DEALING WITH TEXTUAL ATTRIBUTES

  • Datos identificativos

    Identificador: imarina:9298244
    Autores:
    Martinez, SergioSanchez, DavidValls, Aida
    Resumen:
    Record linkage methods evaluate the disclosure risk of revealing confidential information in anonymized datasets that are publicly distributed. Concretely, they measure the capacity of an intruder to link records in the original dataset with those in the masked one. In the past, masking and record linkage methods have been developed focused on numerical or ordinal data. Recently, motivated by the proliferation of textual information, some authors have proposed masking methods to anonymize textual data. Textual attributes should be interpreted according to their semantics, which makes them more difficult to manage and compare than numerical data. In this paper, we propose a new record linkage method specially tailored to accurately evaluate their disclosure risk. Our method, named Semantic Record Linkage, relies on the theory of semantic similarity and uses widely available ontologies to interpret the semantics of data and propose coherent record linkages. Test performed over a real dataset shows that a semantic record linkage method evaluates better the disclosure risk when compared with a non-semantic approach.
  • Otros:

    Autor según el artículo: Martinez, Sergio; Sanchez, David; Valls, Aida
    Departamento: Enginyeria Informàtica i Matemàtiques
    Autor/es de la URV: Martinez Lluis, Sergio / Sánchez Ruenes, David / Valls Mateu, Aïda
    Palabras clave: Semantic similarity Record linkage Privacy protection Ontologies Disclosure risk
    Resumen: Record linkage methods evaluate the disclosure risk of revealing confidential information in anonymized datasets that are publicly distributed. Concretely, they measure the capacity of an intruder to link records in the original dataset with those in the masked one. In the past, masking and record linkage methods have been developed focused on numerical or ordinal data. Recently, motivated by the proliferation of textual information, some authors have proposed masking methods to anonymize textual data. Textual attributes should be interpreted according to their semantics, which makes them more difficult to manage and compare than numerical data. In this paper, we propose a new record linkage method specially tailored to accurately evaluate their disclosure risk. Our method, named Semantic Record Linkage, relies on the theory of semantic similarity and uses widely available ontologies to interpret the semantics of data and propose coherent record linkages. Test performed over a real dataset shows that a semantic record linkage method evaluates better the disclosure risk when compared with a non-semantic approach.
    Áreas temáticas: Theoretical computer science Software Information systems Engenharias iv Engenharias iii Engenharias i Computer science, artificial intelligence Computational theory and mathematics Ciência da computação Automation & control systems
    Acceso a la licencia de uso: https://creativecommons.org/licenses/by/3.0/es/
    Direcció de correo del autor: david.sanchez@urv.cat aida.valls@urv.cat sergio.martinezl@urv.cat
    Identificador del autor: 0000-0001-7275-7887 0000-0003-3616-7809 0000-0002-3941-5348
    Fecha de alta del registro: 2024-10-12
    Versión del articulo depositado: info:eu-repo/semantics/acceptedVersion
    Enlace a la fuente original: http://www.ijicic.org/contents.htm
    URL Documento de licencia: https://repositori.urv.cat/ca/proteccio-de-dades/
    Referencia al articulo segun fuente origial: International Journal Of Innovative Computing, Information And Control. 8 (7A): 4869-4882
    Referencia de l'ítem segons les normes APA: Martinez, Sergio; Sanchez, David; Valls, Aida (2012). EVALUATION OF THE DISCLOSURE RISK OF MASKING METHODS DEALING WITH TEXTUAL ATTRIBUTES. International Journal Of Innovative Computing, Information And Control, 8(7A), 4869-4882
    Entidad: Universitat Rovira i Virgili
    Año de publicación de la revista: 2012
    Tipo de publicación: Journal Publications
  • Palabras clave:

    Automation & Control Systems,Computational Theory and Mathematics,Computer Science, Artificial Intelligence,Information Systems,Software,Theoretical Computer Science
    Semantic similarity
    Record linkage
    Privacy protection
    Ontologies
    Disclosure risk
    Theoretical computer science
    Software
    Information systems
    Engenharias iv
    Engenharias iii
    Engenharias i
    Computer science, artificial intelligence
    Computational theory and mathematics
    Ciência da computação
    Automation & control systems
  • Documentos:

  • Cerca a google

    Search to google scholar