Conjunts de dades de producció científicaEnginyeria Informàtica i Matemàtiques

eHealth_hclinic_3402500_clarus.zip

  • Identification data

    Identifier:  PC:2468
    Authors:  Santiago Iriso (Fundació Clínic per la Recerca Biomèdica), David Sánchez and Sergio Martínez (Universitat Rovira i Virgili)
    Abstract:
    This dataset simulates a subset of a Passive Medical Record Database which contains the whole medical history of the patients of a big hospital. These patients have become “passive” due to the lack of encounters over a specific period of time or due to the death or change of residence of the patient, etc. Specifically, The “eHealth_hclinic_3402500_clarus” dataset represents coherent clinical data obtained from the discharge reports over one year (2014). The dataset has been synthetically generated from the real data of the “Hospital Clínic de Barcelona”. All the actual data are artificial and random, even though they preserve some statistical properties of the original data. Since all the data are synthetic, this dataset should not be used in any kind of medical research. Its only purpose is to provide an artificial but realistic medical dataset (with regard to data structure and distribution of variables) that can be used as input for the design and evaluation of privacy-preserving mechanisms.
  • Others:

    Document type: info:eu-repo/semantics/other
    Related publications: Pendent
    Persistent identifier: http://hdl.handle.net/20.500.11797/PC-2468
    Departament: Enginyeria Informàtica i Matemàtiques
    Author: Santiago Iriso (Fundació Clínic per la Recerca Biomèdica), David Sánchez and Sergio Martínez (Universitat Rovira i Virgili)
    Repository ingest date: 2016-01-18
    Funding program action: H2020
    Acronym: CLARUS
    Project code: H2020-ICT-2014-1-644024
    Dataset publication year: 2017
    Remarks: This dataset contains 3402500 records
    Subject matter: eHealth
    Language: en
    Other rights: OpenAccess
    Published by (editorial): Universitat Rovira i Virgili
    Access rights: info:eu-repo/semantics/openAccess
    Data type: SQL Scripts
    Abstract: This dataset simulates a subset of a Passive Medical Record Database which contains the whole medical history of the patients of a big hospital. These patients have become “passive” due to the lack of encounters over a specific period of time or due to the death or change of residence of the patient, etc. Specifically, The “eHealth_hclinic_3402500_clarus” dataset represents coherent clinical data obtained from the discharge reports over one year (2014). The dataset has been synthetically generated from the real data of the “Hospital Clínic de Barcelona”. All the actual data are artificial and random, even though they preserve some statistical properties of the original data. Since all the data are synthetic, this dataset should not be used in any kind of medical research. Its only purpose is to provide an artificial but realistic medical dataset (with regard to data structure and distribution of variables) that can be used as input for the design and evaluation of privacy-preserving mechanisms.
  • Keywords:

    eHealth
    medical records
    synthetic data
    privacy-preserving mechanism
  • Documents:

  • Cerca a google

    Search to google scholar