TY - JOUR
T1 - Limitations of CNNs for approximating the ideal observer despite quantity of training data or depth of network
AU - Omer, Khalid
AU - Caucci, Luca
AU - Kupinski, Meredith
N1 - Publisher Copyright: c Society for Imaging Science and Technology 2020
PY - 2020
Y1 - 2020
N2 - The performance of a convolutional neural network (CNN) on an image texture detection task as a function of linear image processing and the number of training images is investigated. Performance is quantified by the area under (AUC) the receiver operating characteristic (ROC) curve. The Ideal Observer (IO) maximizes AUC but depends on high-dimensional image likelihoods. In many cases, the CNN performance can approximate the IO performance. This work demonstrates counterexamples where a full-rank linear transform degrades the CNN performance below the IO in the limit of large quantities of training data and network layers. A subsequent linear transform changes the images’ correlation structure, improves the AUC, and again demonstrates the CNN dependence on linear processing. Compression strictly decreases or maintains the IO detection performance while compression can increase the CNN performance especially for small quantities of training data. Results indicate an optimal compression ratio for the CNN based on task difficulty, compression method, and number of training images.
AB - The performance of a convolutional neural network (CNN) on an image texture detection task as a function of linear image processing and the number of training images is investigated. Performance is quantified by the area under (AUC) the receiver operating characteristic (ROC) curve. The Ideal Observer (IO) maximizes AUC but depends on high-dimensional image likelihoods. In many cases, the CNN performance can approximate the IO performance. This work demonstrates counterexamples where a full-rank linear transform degrades the CNN performance below the IO in the limit of large quantities of training data and network layers. A subsequent linear transform changes the images’ correlation structure, improves the AUC, and again demonstrates the CNN dependence on linear processing. Compression strictly decreases or maintains the IO detection performance while compression can increase the CNN performance especially for small quantities of training data. Results indicate an optimal compression ratio for the CNN based on task difficulty, compression method, and number of training images.
UR - http://www.scopus.com/inward/record.url?scp=85101143024&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85101143024&partnerID=8YFLogxK
U2 - https://doi.org/10.2352/J.IMAGINGSCI.TECHNOL.2020.64.6.060408
DO - https://doi.org/10.2352/J.IMAGINGSCI.TECHNOL.2020.64.6.060408
M3 - Article
SN - 1062-3701
VL - 64
JO - Journal of Imaging Science and Technology
JF - Journal of Imaging Science and Technology
IS - 6
M1 - 060408
ER -