Secure Extraction of Personal Information from EHR by Federated Machine Learning

Secure extraction of Personally Identifiable Information (PII) from Electronic Health Records (EHRs) presents significant privacy and security challenges. This study explores the application of Federated Learning (FL) to overcome these challenges within the context of French EHRs. By utilizing a multilingual BERT model in an FL simulation involving 20 hospitals, each represented by a unique medical department or pole, we compared the performance of two setups: individual models, where each hospital uses only its own training and validation data without engaging in the FL process, and federated models, where multiple hospitals collaborate to train a global FL model. Our findings demonstrate that FL models not only preserve data confidentiality but also outperform the individual models. In fact, the Global FL model achieved an F1 score of 75,7%, slightly comparable to that of the Centralized approach at 78,5%. This research underscores the potential of FL in extracting PIIs from EHRs, encouraging its broader adoption in health data analysis.

Mots clés

EHRs Federated Learning NLP Named Entity Recognition

Domaines

Ingénierie biomédicale

Fichier principal

SHTI-316-SHTI240488.pdf (270.31 Ko)

Origine	Fichiers éditeurs autorisés sur une archive ouverte

Laurent Jonchère : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04694435

Soumis le : mercredi 11 septembre 2024-14:43:54

Dernière modification le : mercredi 18 décembre 2024-11:26:02

Dates et versions

hal-04694435 , version 1 (11-09-2024)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

HAL Id : hal-04694435 , version 1
DOI : 10.3233/shti240488
PUBMED : 39176816

Citer

Mohamed El Azzouzi, Reda Bellafqira, Gouenou Coatrieux, Marc Cuggia, Guillaume Bouzillé. Secure Extraction of Personal Information from EHR by Federated Machine Learning. Studies in Health Technology and Informatics, 2024, 316, pp.611-615. ⟨10.3233/shti240488⟩. ⟨hal-04694435⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSERM UNIV-BREST UNIV-RENNES1 LTSI UR1-MATH-STIC UNIV-RENNES LATIM IBSAM UR1-MATH-NUM UR1-BIO-SA

33 Consultations

9 Téléchargements