Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep...

15
Motivation Related work Proposed approach Empirical results Conclusion Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Devansh Bisla, Anna Choromanska, Russell S. Berman, Jennifer A. Stein, David Polsky New York University, New York, NY, USA Code: https://bit.ly/2KFRp5e Paper: https://bit.ly/2FBgOZP

Transcript of Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep...

Page 1: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Towards Automated Melanoma Detectionwith Deep Learning:

Data Purification and Augmentation

Devansh Bisla, Anna Choromanska, Russell S. Berman,Jennifer A. Stein, David Polsky

New York University, New York, NY, USA

Code: https://bit.ly/2KFRp5e Paper: https://bit.ly/2FBgOZP

Page 2: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Motivation

Figure 1: (Left) Data Imbalancedness (Right) Data Impurities

Page 3: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Existing computational techniques

Traditional machine learningHand-crafted extraction of features from the data such as

Lesion Symmetry/Asymmetry.Irregular borders.Non-Uniform pigmentation.Lesion size.

Problem: not scalable to large data sets.

Deep Learning

Automatically extract features from large sized data.Problem: Needs large, balanced, and unbiased data.

Page 4: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Traditional training

Visualization results for the conventionally-trained model (Top): Originalimage. (Bottom): Visualization mask overlaid on the original image.

The model overfits to image occlusions such as hairs, rulersand ink marks.

Page 5: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Proposed approach

Data Impurities:

Removal of unwanted objects such as hair, rulers etc.

Data Imbalancedness

Synthetic data generation.Data augmentation.

Page 6: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Data purification

Thresholding in the LUV color space combined withmorphological operations. Note that this may also removedark regions belonging to the lesion itself.[PhilippeSchmid-Saugeon et al]

Overlay the processed image with the segmented lesionobtained from our segmentation algorithm.

Page 7: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Data purification - results

(a) (b) (c) (d) (e)

Figure 2: Top: Original images. Bottom: Images obtained after a,b)scales, c) hairs and scales, and d,e) hairs removal.

Page 8: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Data generation

Figure 3: Architecture of Generative Adversarial Network

Main idea:

Train a generator network to generate images which havesimilar distribution to the one followed by the training data,but do not appear in the training data set.

The discriminator provides a feedback on similarity betweenthe two distributions.

We generated 350 images of melanoma and 750 images ofseborrheic keratosis.

Page 9: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Data generation - results

0.02 0.04 0.06 0.08 0.10Mean Squared Error

0

10

20

30

40

50

60

Frequency

Histogram for Seborrheic Keratosis

0.0 0.1 0.2 0.3Mean Squared Error

0

10

20

30

40

50

60

70

Freq

uenc

y

Histogram for Melanoma

Figure 4: Histograms of the MSE values for (left) seborrheic keratosisand (right) melanoma.

Page 10: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Data generation - results

Seborrheic Keratosis

0.02 0.04 0.059

Melanoma

0.02 0.09 0.18

Page 11: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Classification results: confusion matrix

M N SK

Predicted label

M

N

SK

Tru

e label

80 19 18

89 269 35

12 6 72

Confusion matrix

M N SK

Predicted label

M

N

SK

Tru

e label

83 23 11

38 338 17

6 15 69

Confusion matrix

Figure 5: Confusion matrix obtained by traditional baseline (left) andproposed model (right).

Page 12: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Classification results: ROC-AUC

Mean Value ROC-AUC

Our Approach 0.915Kazuhisa Matsunaga[K. Matsunaga et al.] 0.911

RECOD Titans[A. Menegola et al.] 0.908

Table 1: Leader-board for melanoma and seborrheic keratosis combined.

Method 82% 89% 95%

Top AVG[K. Matsunaga et al.] 0.729 0.588 0.366

Top SK [I. Gonzalez Diaz et al.] 0.727 0.555 0.404

Top M [A. Menegola et al.] 0.747 0.590 0.395

Our Approach 0.697 0.648 0.492

Table 2: Specificity values at sensitivity levels of 82%/89%/95% formelanoma classification. Top AVG, Top SK, and Top M denote thewinning approaches of the ISIC 2017 challenge.

Page 13: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Classification results visualized

TP FP FN TN

Figure 6: Visualization results for seborrheic keratosis. Top: Originalimage. Bottom: Visualization result.

Page 14: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Classification results visualized

TP FP FN TN

Figure 7: Visualization results for Nevus. Top: Original image. Bottom:Visualization result.

Page 15: Towards Automated Melanoma Detection with Deep …...Towards Automated Melanoma Detection with Deep Learning: Data Purification and Augmentation Author Devansh Bisla, Anna Choromanska,

Motivation Related work Proposed approach Empirical results Conclusion

Conclusion

Deep learning based methods are the most accurate andscalable, but they require large, pure and balanced trainingdata sets.

We presented solutions to improve effectiveness ofclassification systems by data purification (removal ofunwanted objects) and data augmentation (synthetic datageneration).