Preliminary Data Analysis in Healthcare Multicentric Data Mining: a Privacy-preserving Distributed Approach
Andrea Damiani, Carlotta Masciocchi, Luca Boldrini, Roberto Gatta, Nicola Dinapoli, Jacopo Lenkowicz, Giuditta Chiloiro, Maria Gambacorta, Luca Tagliaferri, Rosa Autorino, Monica Pagliara, Maria Blasi, Johan van Soest, Andre Dekker, Vincenzo Valentini
Journal of e-Learning and Knowledge Society Volume 14, Number 1, ISSN 1826-6223 e-ISSN 1826-6223 Publisher: Italian e-Learning Association
The new era of cognitive health care systems offers a large amount of patient data that can be used to develop prediction models and clinical decision support systems. In this frame, the multi-institutional approach is strongly encouraged in order to reach more numerous samples for data mining and more reliable statistics. For these purposes, shared ontologies need to be developed for data management to ensure database semantic coherence in accordance with the various centers’ ethical and legal policies. Therefore, we propose a privacy-preserving distributed approach as a preliminary data analysis tool to identify possible compliance issues and heterogeneity from the agreed multi-institutional research protocol before training a clinical prediction model. This kind of preliminary analysis appeared fast and reliable and its results corresponded to those obtained using the traditional centralized approach. A real time interactive dashboard has also been presented to show analysis results and make the workflow swifter and easier.
Damiani, A., Masciocchi, C., Boldrini, L., Gatta, R., Dinapoli, N., Lenkowicz, J., Chiloiro, G., Gambacorta, M., Tagliaferri, L., Autorino, R., Pagliara, M., Blasi, M., van Soest, J., Dekker, A. & Valentini, V. (2018). Preliminary Data Analysis in Healthcare Multicentric Data Mining: a Privacy-preserving Distributed Approach. Journal of e-Learning and Knowledge Society, 14(1),. Italian e-Learning Association.