Remarks: This dataset contains 3402500 records
Subject matter: eHealth
Access rights: info:eu-repo/semantics/openAccess
Persistent identifier: http://hdl.handle.net/20.500.11797/PC-2468
Abstract: This dataset simulates a subset of a Passive Medical Record Database which contains the whole medical history of the patients of a big hospital. These patients have become “passive” due to the lack of encounters over a specific period of time or due to the death or change of residence of the patient, etc. Specifically, The “eHealth_hclinic_3402500_clarus” dataset represents coherent clinical data obtained from the discharge reports over one year (2014).
The dataset has been synthetically generated from the real data of the “Hospital Clínic de Barcelona”. All the actual data are artificial and random, even though they preserve some statistical properties of the original data.
Since all the data are synthetic, this dataset should not be used in any kind of medical research. Its only purpose is to provide an artificial but realistic medical dataset (with regard to data structure and distribution of variables) that can be used as input for the design and evaluation of privacy-preserving mechanisms.
Other rights: OpenAccess
Data type: SQL Scripts
Departament: Enginyeria Informàtica i Matemàtiques
Document type: info:eu-repo/semantics/other
Project code: H2020-ICT-2014-1-644024
Repository ingest date: 2016-01-18
Author: Santiago Iriso (Fundació Clínic per la Recerca Biomèdica), David Sánchez and Sergio Martínez (Universitat Rovira i Virgili)
Keywords: eHealth, medical records, synthetic data, privacy-preserving mechanism
Funding program action: H2020
Acronym: CLARUS