Articles producció científica> Enginyeria Informàtica i Matemàtiques

C-sanitized: A privacy model for document redaction and sanitization

  • Dades identificatives

    Identificador: imarina:5129871
    Autors:
    Sanchez, DavidBatet, Montserrat
    Resum:
    Vast amounts of information are daily exchanged and/or released. The sensitive nature of much of this information creates a serious privacy threat when documents are uncontrollably made available to untrusted third parties. In such cases, appropriate data protection measures should be undertaken by the responsible organization, especially under the umbrella of current legislation on data privacy. To do so, human experts are usually requested to redact or sanitize document contents. To relieve this burdensome task, this paper presents a privacy model for document redaction/sanitization, which offers several advantages over other models available in the literature. Based on the well-established foundations of data semantics and information theory, our model provides a framework to develop and implement automated and inherently semantic redaction/sanitization tools. Moreover, contrary to ad-hoc redaction methods, our proposal provides a priori privacy guarantees which can be intuitively defined according to current legislations on data privacy. Empirical tests performed within the context of several use cases illustrate the applicability of our model and its ability to mimic the reasoning of human sanitizers.
  • Altres:

    Autor segons l'article: Sanchez, David; Batet, Montserrat
    Departament: Enginyeria Informàtica i Matemàtiques
    Autor/s de la URV: Batet Sanromà, Montserrat / Sánchez Ruenes, David
    Paraules clau: Semantics Privacy Knowledge
    Resum: Vast amounts of information are daily exchanged and/or released. The sensitive nature of much of this information creates a serious privacy threat when documents are uncontrollably made available to untrusted third parties. In such cases, appropriate data protection measures should be undertaken by the responsible organization, especially under the umbrella of current legislation on data privacy. To do so, human experts are usually requested to redact or sanitize document contents. To relieve this burdensome task, this paper presents a privacy model for document redaction/sanitization, which offers several advantages over other models available in the literature. Based on the well-established foundations of data semantics and information theory, our model provides a framework to develop and implement automated and inherently semantic redaction/sanitization tools. Moreover, contrary to ad-hoc redaction methods, our proposal provides a priori privacy guarantees which can be intuitively defined according to current legislations on data privacy. Empirical tests performed within the context of several use cases illustrate the applicability of our model and its ability to mimic the reasoning of human sanitizers.
    Àrees temàtiques: Library and information science Information science & library science Información y documentación Computer science, information systems Ciencias sociales
    Accès a la llicència d'ús: https://creativecommons.org/licenses/by/3.0/es/
    Adreça de correu electrònic de l'autor: montserrat.batet@urv.cat david.sanchez@urv.cat
    Identificador de l'autor: 0000-0001-8174-7592 0000-0001-7275-7887
    Data d'alta del registre: 2024-10-12
    Versió de l'article dipositat: info:eu-repo/semantics/acceptedVersion
    URL Document de llicència: https://repositori.urv.cat/ca/proteccio-de-dades/
    Referència a l'article segons font original: Journal Of The Association For Information Science And Technology. 67 (1): 148-163
    Referència de l'ítem segons les normes APA: Sanchez, David; Batet, Montserrat (2016). C-sanitized: A privacy model for document redaction and sanitization. Journal Of The Association For Information Science And Technology, 67(1), 148-163. DOI: 10.1002/asi.23363
    Entitat: Universitat Rovira i Virgili
    Any de publicació de la revista: 2016
    Tipus de publicació: Journal Publications
  • Paraules clau:

    Computer Science, Information Systems,Information Science & Library Science
    Semantics
    Privacy
    Knowledge
    Library and information science
    Information science & library science
    Información y documentación
    Computer science, information systems
    Ciencias sociales
  • Documents:

  • Cerca a google

    Search to google scholar