Anonymising pathology data using generative adversarial networks

David Morrison*, David Cameron Christopher Harris-Birtill

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

35 Downloads (Pure)

Abstract

Anonymising medical data for use in machine learning is important to preserve patient privacy and, in many circumstances, is a requirement before data can be made available. One approach to anonymising image data is to train a generative model to produce data that is statistically similar to the input data and then use the output of the model for downstream tasks, such as image classification, instead of the original sensitive data. In digital pathology, it's not yet well understood how using generative models to anonymise histology slide data impacts the performance of downstream tasks. To begin addressing this, we present an evaluation of a histology image classifier trained using patches extracted from the Camelyon 16 dataset and compare it to a classifier trained on the same number of synthetic images generated with a Deep Convolutional Generative Adversarial Network (DCGAN), from the same data. When predicting the class of an image patch as either cancer or normal it's shown that the accuracy reduces from 0.78 for original alone to 0.59 for synthetic alone, and the recall is significantly reduced from 0.70 to 0.44 when training exclusively on the same amount of synthetic data. If retaining a similar accuracy is required for the downstream task, then either the original data must be used or an improved anonymisation strategy must be devised. We conclude that using this DCGAN to anonymise the dataset, degrades the accuracy of the classifier which implies that it has failed to capture the required variation in the original data to generalise and act as a sufficient anonymisation strategy.
Original languageEnglish
Title of host publicationMedical imaging 2022
Subtitle of host publicationdigital and computational pathology
EditorsJohn E. Tomaszewski, Aaron D. Ward, Richard M.
Place of PublicationBellingham, WA
PublisherSPIE
Number of pages6
ISBN (Electronic)9781510649545
ISBN (Print)9781510649538
DOIs
Publication statusPublished - 4 Apr 2022
EventSPIE Medical Imaging 2022 - Town & Country Resort Convention Center, San Diego, United States
Duration: 20 Feb 202224 Feb 2022

Publication series

NameProceedings of SPIE
Volume12039
ISSN (Print)0277-786X
ISSN (Electronic)1996-756X

Conference

ConferenceSPIE Medical Imaging 2022
Country/TerritoryUnited States
CitySan Diego
Period20/02/2224/02/22

Keywords

  • GANs
  • Generative adversarial networks
  • Anonymisation
  • Histopathology
  • Digital pathology
  • Medical anonymisation

Fingerprint

Dive into the research topics of 'Anonymising pathology data using generative adversarial networks'. Together they form a unique fingerprint.

Cite this