Built to Last? Reproducibility and Reusability of Deep Learning Algorithms in Computational Pathology

Sophia J Wagner; Christian Matek; Sayedali Shetab Boushehri; Melanie Boxberg; Lorenz Lamm; Ario Sadafi; Dominik J E Winter; Carsten Marr; Tingying Peng

doi:10.1016/j.modpat.2023.100350

Built to Last? Reproducibility and Reusability of Deep Learning Algorithms in Computational Pathology

Mod Pathol. 2024 Jan;37(1):100350. doi: 10.1016/j.modpat.2023.100350. Epub 2023 Oct 10.

Authors

Sophia J Wagner¹, Christian Matek², Sayedali Shetab Boushehri³, Melanie Boxberg⁴, Lorenz Lamm⁵, Ario Sadafi⁶, Dominik J E Winter⁷, Carsten Marr⁸, Tingying Peng⁹

Affiliations

¹ Helmholtz AI, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; School of Computation, Information and Technology, Technical University of Munich, Garching, Germany.
² Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; Institute of Pathology, University Hospital Erlangen, Erlangen, Germany.
³ School of Computation, Information and Technology, Technical University of Munich, Garching, Germany; Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; Data & Analytics (D&A), Roche Pharma Research and Early Development (pRED), Roche Innovation Center Munich, Germany.
⁴ Institute of Pathology, Technical University Munich, Munich, Germany; Institute of Pathology Munich-North, Munich, Germany.
⁵ Helmholtz AI, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; Helmholtz Pioneer Campus, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany.
⁶ School of Computation, Information and Technology, Technical University of Munich, Garching, Germany; Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany.
⁷ Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany; School of Life Sciences, Technical University of Munich, Weihenstephan, Germany.
⁸ Institute of AI for Health, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany. Electronic address: [email protected].
⁹ Helmholtz AI, Helmholtz Munich-German Research Center for Environmental Health, Neuherberg, Germany. Electronic address: [email protected].

PMID: 37827448
DOI: 10.1016/j.modpat.2023.100350

Abstract

Recent progress in computational pathology has been driven by deep learning. While code and data availability are essential to reproduce findings from preceding publications, ensuring a deep learning model's reusability is more challenging. For that, the codebase should be well-documented and easy to integrate into existing workflows and models should be robust toward noise and generalizable toward data from different sources. Strikingly, only a few computational pathology algorithms have been reused by other researchers so far, let alone employed in a clinical setting. To assess the current state of reproducibility and reusability of computational pathology algorithms, we evaluated peer-reviewed articles available in PubMed, published between January 2019 and March 2021, in 5 use cases: stain normalization; tissue type segmentation; evaluation of cell-level features; genetic alteration prediction; and inference of grading, staging, and prognostic information. We compiled criteria for data and code availability and statistical result analysis and assessed them in 160 publications. We found that only one-quarter (41 of 160 publications) made code publicly available. Among these 41 studies, three-quarters (30 of 41) analyzed their results statistically, half of them (20 of 41) released their trained model weights, and approximately a third (16 of 41) used an independent cohort for evaluation. Our review is intended for both pathologists interested in deep learning and researchers applying algorithms to computational pathology challenges. We provide a detailed overview of publications with published code in the field, list reusable data handling tools, and provide criteria for reproducibility and reusability.

Keywords: artificial intelligence; computational pathology; deep learning; histology/histopathology; reproducibility; reusability.

Publication types

Review

MeSH terms

Algorithms
Deep Learning*
Humans
Pathologists
Reproducibility of Results