Is this good enough? On expert perception of brain tumor segmentation quality

Katharina Hoebel; Christopher P. Bridge; Sara Ahmed; Oluwatosin Akintola; Caroline Chung M.D.; Raymond Huang; Jason Johnson M.D.; Albert Kim; K. Ina Ly; Ken Chang; Jay Patel; Marco Pinho M.D.; Tracy T. Batchelor M.D.; Bruce Rosen; Elizabeth Gerstner; Jayashree Kalpathy-Cramer

doi:10.1117/12.2611810

4 April 2022 Is this good enough? On expert perception of brain tumor segmentation quality

Katharina Hoebel, Christopher P. Bridge, Sara Ahmed, Oluwatosin Akintola, Caroline Chung M.D., Raymond Huang, Jason Johnson M.D., Albert Kim, K. Ina Ly, Ken Chang, Jay Patel, Marco Pinho M.D., Tracy T. Batchelor M.D., Bruce Rosen, Elizabeth Gerstner, Jayashree Kalpathy-Cramer

Author Affiliations +

Proceedings Volume 12035, Medical Imaging 2022: Image Perception, Observer Performance, and Technology Assessment; 120350P (2022) https://doi.org/10.1117/12.2611810
Event: SPIE Medical Imaging, 2022, San Diego, California, United States

Abstract

The performance of Deep Learning (DL) segmentation algorithms is routinely determined using quantitative metrics like the Dice score and Hausdorff distance. However, these metrics show a low concordance with humans’ perception of segmentation quality. The successful collaboration of health care professionals with DL segmentation algorithms will require a detailed understanding of experts’ assessment of segmentation quality. Here, we present the results of a study on expert quality perception of brain tumor segmentations of brain MR images generated by a DL segmentation algorithm. Eight expert medical professionals were asked to grade the quality of segmentations on a scale from 1 (worst) to 4 (best). To this end, we collected four ratings for a dataset of 60 cases. We observed a low inter-rater agreement among all raters (Krippendorff’s alpha: 0.34), which potentially is a result of different internal cutoffs for the quality ratings. Several factors, including the volume of the segmentation and model uncertainty, were associated with high disagreement between raters. Furthermore, the correlations between the ratings and commonly used quantitative segmentation quality metrics ranged from no to moderate correlation. We conclude that, similar to the inter-rater variability observed for manual brain tumor segmentation, segmentation quality ratings are prone to variability due to the ambiguity of tumor boundaries and individual perceptual differences. Clearer guidelines for quality evaluation could help to mitigate these differences. Importantly, existing technical metrics do not capture clinical perception of segmentation quality. A better understanding of expert quality perception is expected to support the design of more human-centered DL algorithms for integration into the clinical workflow.

Citation Download Citation

Katharina Hoebel, Christopher P. Bridge, Sara Ahmed, Oluwatosin Akintola, Caroline Chung M.D., Raymond Huang, Jason Johnson M.D., Albert Kim, K. Ina Ly, Ken Chang, Jay Patel, Marco Pinho M.D., Tracy T. Batchelor M.D., Bruce Rosen, Elizabeth Gerstner, and Jayashree Kalpathy-Cramer "Is this good enough? On expert perception of brain tumor segmentation quality", Proc. SPIE 12035, Medical Imaging 2022: Image Perception, Observer Performance, and Technology Assessment, 120350P (4 April 2022); https://doi.org/10.1117/12.2611810

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

;

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
11 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Image segmentation

Tumors

Brain

Neuroimaging

Statistical modeling

Medicine

Radiation oncology

Show All Keywords

Keywords/Phrases

Search In:

Publication Years