Evaluation of Importance Estimators in Deep Learning Classifiers for Computed Tomography

Lennart Brocki; Wistan Marchadour; Jonas Maison; Bogdan Badic; Panagiotis Papadimitroulas; Mathieu Hatt; Franck Vermet; Neo Christopher Chung

doi:10.1007/978-3-031-15565-9_1

Proceedings/Recueil Des Communications Lecture Notes in Computer Science Année : 2022

Evaluation of Importance Estimators in Deep Learning Classifiers for Computed Tomography

(1) , (2, 3) , (3, 4) , (3) , (5) , (3) , (2) , (1)

1
2
3
4
5

Lennart Brocki

Fonction : Auteur

University of Varsaw

Wistan Marchadour

Fonction : Auteur

Laboratoire de Mathématiques de Bretagne Atlantique

Laboratoire de Traitement de l'Information Medicale

Jonas Maison

Fonction : Auteur

Laboratoire de Traitement de l'Information Medicale

Aquilab

Bogdan Badic

Fonction : Auteur

Laboratoire de Traitement de l'Information Medicale

Panagiotis Papadimitroulas

Fonction : Auteur

BIOEMTECH

Mathieu Hatt

Fonction : Auteur

Laboratoire de Traitement de l'Information Medicale

Franck Vermet

Fonction : Auteur
PersonId : 874245
IdHAL : franck-vermet
ORCID : 0000-0003-3816-5401

Laboratoire de Mathématiques de Bretagne Atlantique

Neo Christopher Chung

Fonction : Auteur

University of Varsaw

Résumé

Deep learning has shown superb performance in detecting objects and classifying images, ensuring a great promise for analyzing medical imaging. Translating the success of deep learning to medical imaging, in which doctors need to understand the underlying process, requires the capability to interpret and explain the prediction of neural networks. Interpretability of deep neural networks often relies on estimating the importance of input features (e.g., pixels) with respect to the outcome (e.g., class probability). However, a number of importance estimators (also known as saliency maps) have been developed and it is unclear which ones are more relevant for medical imaging applications. In the present work, we investigated the performance of several importance estimators in explaining the classification of computed tomography (CT) images by a convolutional deep network, using three distinct evaluation metrics. Specifically, the ResNet-50 was trained to classify CT scans of lungs acquired with and without contrast agents, in which clinically relevant anatomical areas were manually determined by experts as segmentation masks in the images. Three evaluation metrics were used to quantify different aspects of interpretability. First, the model-centric fidelity measures a decrease in the model accuracy when certain inputs are perturbed. Second, concordance between importance scores and the expert-defined segmentation masks is measured on a pixel level by a receiver operating characteristic (ROC) curves. Third, we measure a region-wise overlap between a XRAI-based map and the segmentation mask by Dice Similarity Coefficients (DSC). Overall, two versions of SmoothGrad topped the fidelity and ROC rankings, whereas both Integrated Gradients and SmoothGrad excelled in DSC evaluation. Interestingly, there was a critical discrepancy between model-centric (fidelity) and human-centric (ROC and DSC) evaluation. Expert expectation and intuition embedded in segmentation maps does not necessarily align with how the model arrived at its prediction. Understanding this difference in interpretability would help harnessing the power of deep learning in medicine.

Domaines

Statistiques [stat] Mathématiques [math]

Franck VERMET : Connectez-vous pour contacter le contributeur

https://hal.univ-brest.fr/hal-04307817

Soumis le : dimanche 26 novembre 2023-16:24:00

Dernière modification le : mercredi 18 décembre 2024-11:26:02

Dates et versions

hal-04307817 , version 1 (26-11-2023)

Identifiants

HAL Id : hal-04307817 , version 1
ARXIV : 2209.15398
DOI : 10.1007/978-3-031-15565-9_1

Citer

Lennart Brocki, Wistan Marchadour, Jonas Maison, Bogdan Badic, Panagiotis Papadimitroulas, et al.. Evaluation of Importance Estimators in Deep Learning Classifiers for Computed Tomography. 4th International Workshop on EXplainable and TRAnsparent AI and Multi-Agent Systems. International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), Lecture Notes in Computer Science, 13283, Springer International Publishing, pp.3-18, 2022, Lecture Notes in Computer Science, ⟨10.1007/978-3-031-15565-9_1⟩. ⟨hal-04307817⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-BREST CNRS INSMI LMBA UBS CHL LATIM IBSAM IBNM ANR

26 Consultations

0 Téléchargements

Evaluation of Importance Estimators in Deep Learning Classifiers for Computed Tomography

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager