Training certified detectives to track down the intrinsic shortcuts in COVID-19 chest x-ray data sets

Sci Rep. 2023 Aug 4;13(1):12690. doi: 10.1038/s41598-023-39855-3.

Abstract

Deep learning faces a significant challenge wherein the trained models often underperform when used with external test data sets. This issue has been attributed to spurious correlations between irrelevant features in the input data and corresponding labels. This study uses the classification of COVID-19 from chest x-ray radiographs as an example to demonstrate that the image contrast and sharpness, which are characteristics of a chest radiograph dependent on data acquisition systems and imaging parameters, can be intrinsic shortcuts that impair the model's generalizability. The study proposes training certified shortcut detective models that meet a set of qualification criteria which can then identify these intrinsic shortcuts in a curated data set.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, N.I.H., Extramural

MeSH terms

  • COVID-19*
  • Deep Learning*
  • Humans
  • Radiographic Image Interpretation, Computer-Assisted / methods
  • Radiography, Thoracic / methods
  • X-Rays