Gordon Center for Medical Imaging, Massachusetts General Hospital, Harvard Medical School, USA¶
Authors:
Aoxiao Zhong, Quanzheng Li
Abstract:
A fully convolutional ResNet-101 network with atrous convolution and atrous spatial pyramid pooling was trained. Atrous convolution and atrous spatial pyramid pooling enlarge the field-of-view for prediction and allow capturing objects as well as image context at multiple scales. Patches of size 512*512 at level 1 were used as input to the network. For each mini-batch 10 normal and 10 tumor patches were randomly taken to train the network. The model was trained for 40000 iterations on a single P100 GPU. The classification task is based on features extracted from tumor probability map with a random forest classifier as described in the submission by HMS-MIT Method 1. Further postprocessing step was used to remove candidates that are too small for the lesion-detection task.
Results:
The following figure shows the receiver operating characteristic (ROC) curve of the method.
The following figure shows the free-response receiver operating characteristic (FROC) curve of the method.
The table below presents the average sensitivity of the developed system at 6 predefined false positive rates: 1/4, 1/2, 1, 2, 4, and 8 FPs per whole slide image.
FPs/WSI | 1/4 | 1/2 | 1 | 2 | 4 | 8 |
---|---|---|---|---|---|---|
Sensitivity | 0.667 | 0.707 | 0.747 | 0.791 | 0.818 | 0.831 |