Active Deep Learning for Medical Imaging

ACTIVE DEEP LEARNING
FOR MEDICAL IMAGING
Marc Górriz Xavier Giró-i-Nieto Axel Carlier Emmanuel Faure

2 OUTLINE
1. Motivation
2. State of the art
3. Methodology
4. Results
5. Conclusions

3 MOTIVATION
GOAL.
Use active learning methodology to train a convolutional neural network
for semantic segmentation of lesion areas in medical images.
Skin Lesion Analysis toward Melanoma Detection: A Challenge at the International Symposium on Biomedical Imaging (ISBI)
2016, hosted by the International Skin Imaging Collaboration (ISIC)

4 MOTIVATION
Manual pixel-wise annotation:
▹ Medical expert
▹ 1 image ~ 30 min
2,000images x 25 €/h x 30min/image =
= 25,000 €/database

5 MOTIVATION
▹ Millions of trainable parameters.
▹ Optimization process during training.
▹ Large amounts of labeled data to prevent convergence in local minima.
Deep Convolutional Neural Network

6 MOTIVATION
Active Learning solution
“ auto-selection of useful instances to be labeled in order
to achieve similar performance with few data ”
Computer Vision Laboratory CVLAB. Machine Learning for Biomedical Imaging. Active Learning

7 Toy example: 2D classifier
red & black classifier. Select the best discriminator with as few
labeled data as possible.
Unlabeled dataset
▹ Ground truth in light color
▹ Random labeling initialization
accuracy:
red:
black:
? :
? %
0 / 29
0 / 29
29 / 29

Active Learning approach:
▹ 4 new labels per iteration
▹ Most uncertain selection
accuracy:
red:
black:
? :
85%
04 / 29
04 / 29
21 / 29

accuracy:
red:
black:
? :
85%
04 / 29
04 / 29
21 / 29

accuracy:
red:
black:
? :
85%
06 / 29
06 / 29
17 / 29

accuracy:
red:
black:
? :
90%
06 / 29
06 / 29
17 / 29

accuracy:
red:
black:
? :
90%
08 / 29
09 / 29
12 / 29

accuracy:
red:
black:
? :
100%
08 / 29
09 / 29
12 / 29

Goal completed: best (same) accuracy with few labeled data.
Active Learning Approach Fully Labeled Approach

16 OUTLINE
1. Motivation
2. State of the art
3. Methodology
4. Results
5. Conclusions

17 STATE OF THE ART
Uncertain. Samples near border between
classes. More dubitative for the classifier.
Labeled by human.
Certain. Samples far from the border
between classes. Easier for the classifier.
Labeled by itself (Pseudo Labeling).
Cost effective Active Learning algorithm (CEAL)
Keze Wang, Dongyu Zhang, Ya Li, Ruimao Zhang, and Liang Lin. Cost-effective active learning for deep image classification. CoRR,
abs/1701.03551, 2017.

18 U-NET MODEL
▹ Convolutional Neural Network for biomedical image segmentation.
Olaf Ronneberger et al., U-Net: Convolutional Networks for Biomedical Image Segmentation. CoRR, abs/1505.04597, 2015.

19 OUTLINE
1. Motivation
2. State of the art
3. Methodology
4. Results
5. Conclusions

20 ISIC ARCHIVE DATASET
▹ ISIC 2016 Challenge dataset (modified) for Skin Lesion Analysis
towards melanoma detection.
Training set 1600 images
Test set 400 images
➡

22 METHODOLOGY SCHEME
Initialization

23 INITIALIZATION
Initial datasets definition
ISIC training dataset
Initial labeled set Initial unlabeled set
Labeled size
Labeled size
Initialevaluation
▹ Initial labeled size depending only on the real
world application specifications.

24 INITIALIZATION
Data augmentation
▹ Random transformations to generate new initial data instances.
▹ Increases the variability of the initial data to prevent overfitting.
▹ Helps to achieve a fast great initial performance.

25 METHODOLOGY SCHEME
Complementary Sample selection

26 UNCERTAINTY COMPUTATION
Monte Carlo Dropout methodology
▹ Prediction uncertainty will be the variance of T step predictions
by applying dropout to change randomly the weights setup.
Original image Step predictions
Pixel-wise (PW)
uncertainty

Monte Carlo Dropout methodology
▹ We need a numerical index to rank all the input data.
▹ Overall uncertainty as pixel-wise sum.
▹ A size normalization is needed to decorrelate the uncertainty
component around the cell contour.
U = 162.22 U = 235.21

28 DIAGRAM DATA ANALYSIS
Complementary Data Selection
Unlabeled set predictions
+ uncertainty maps
Uncertainty vs dice coefficient (unlabeled data)

Unlabeled set
predictions
Human manually labeling
▹ Most uncertain samples.
▹ No-detections.
No-detections

Unlabeled set
predictions
System automatic labeling
▹ Best accurate predictions.
▹ Most certain samples.

Uncertainty
axis projection

▹ Candidates for pseudo-labeling
Uncertainty
axis projection

▹ Interfering samples for the
pseudo-labeling selection
Uncertainty
axis projection

Unlabeled set
predictions ▹ Interfering samples for the
pseudo-labeling selection
Solution. Random selection
▹ Select K random samples in the
region to be manually annotated.

35 OUTLINE
1. Motivation
2. State of the art
3. Methodology
4. Results
5. Conclusions

36 CEAL APPROACH
Initialization
Initial labeled set Initial unlabeled set Test set
600 samples 1000 samples 400 samples
Active Learning Loop (sample selection per iteration)
Human Labeling
Pseudo LabelingNo-detections Most uncertain Random
10 samples 10 samples 15 samples 20 + 20 x it (> 5 it ) samples

37 RESULTS
Initial training Active Learning Loop (10 iterations)

38 RESULTS
▹ Regions diagram
evolution.
▹ Red samples:
human annotations.

39 RESULTS
Qualitative evaluation
Original image Active Learning model Fully Labeled model

40 RESULTS

41 RESULTS

42 OUTLINE
1. Motivation
2. State of the art
3. Methodology
4. Results
5. Conclusions

43 CONCLUSIONS
Active Deep Learning for semantic segmentation is few discussed
today due to the network complexity.
Tested. Cost-Effective Active Learning methodology is able for
segmentation.
Satisfactory qualitative results. Imperfect segmentations but
enough in many real world applications.

44 CONCLUSIONS
▹ Save of time and money in the labeling process if the application
not requires a contour perfection.
Labeled data Time cost Money cost
Fully Labeled model 2,000 samples 1,000 h 25,000 €
Active Learning model 900 samples 450 h 11,250 €
Savings 550 h 13,750 €

45 FUTURE WORK
▹ Improve the complementary sample selection in order to take
more advantage to the pseudo labeling process.

46
THANKS!
Any questions?
You can find me at

48 U-NET MODEL
▹ ConvNet weights randomly initialized.
▹ Loss function: Dice coefficient.
▹ Adam optimizer (stochastic gradient-based optimization)
▸ Learning rate: 10e-5
▹ Batch size: 32 samples
Training parameters

Cell size correlation problem
▹ Correlation between cell size and the overall uncertainty value.
▹ We need a cell size normalization.
Overall uncertainty = 163.22 Overall uncertainty = 235.21

Euclidean Distance Transform
x
Prediction Distance map {0,1}
Size-normalized
uncertainty map
Uncertainty
map

Active Deep Learning for Medical Imaging

Recommended

Recommended

More Related Content

More from Universitat Politècnica de Catalunya

More from Universitat Politècnica de Catalunya (20)

Recently uploaded

Recently uploaded (20)

Active Deep Learning for Medical Imaging