Secure Extraction of Personal Information from EHR by Federated Machine Learning - Laboratoire de traitement de l'information médicale
Article Dans Une Revue Studies in Health Technology and Informatics Année : 2024

Secure Extraction of Personal Information from EHR by Federated Machine Learning

Résumé

Secure extraction of Personally Identifiable Information (PII) from Electronic Health Records (EHRs) presents significant privacy and security challenges. This study explores the application of Federated Learning (FL) to overcome these challenges within the context of French EHRs. By utilizing a multilingual BERT model in an FL simulation involving 20 hospitals, each represented by a unique medical department or pole, we compared the performance of two setups: individual models, where each hospital uses only its own training and validation data without engaging in the FL process, and federated models, where multiple hospitals collaborate to train a global FL model. Our findings demonstrate that FL models not only preserve data confidentiality but also outperform the individual models. In fact, the Global FL model achieved an F1 score of 75,7%, slightly comparable to that of the Centralized approach at 78,5%. This research underscores the potential of FL in extracting PIIs from EHRs, encouraging its broader adoption in health data analysis.
Fichier principal
Vignette du fichier
SHTI-316-SHTI240488.pdf (270.31 Ko) Télécharger le fichier
Origine Fichiers éditeurs autorisés sur une archive ouverte

Dates et versions

hal-04694435 , version 1 (11-09-2024)

Licence

Identifiants

Citer

Mohamed El Azzouzi, Reda Bellafqira, Gouenou Coatrieux, Marc Cuggia, Guillaume Bouzillé. Secure Extraction of Personal Information from EHR by Federated Machine Learning. Studies in Health Technology and Informatics, 2024, 316, pp.611-615. ⟨10.3233/shti240488⟩. ⟨hal-04694435⟩
33 Consultations
9 Téléchargements

Altmetric

Partager

More