A good feature extractor is all you need for weakly supervised pathology slide classification

Georg Wölflein*, Dyke Ferber, Asier Rabasco Meneghetti, Omar S.M. El Nahhas, Daniel Truhn, Zunamys I. Carrero, David J. Harrison, Ognjen Arandjelović, Jakob Nikolas Kather

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Stain normalisation is thought to be a crucial preprocessing step in computational pathology pipelines. We question this belief in the context of weakly supervised whole slide image classification, motivated by the emergence of powerful feature extractors trained using self-supervised learning on diverse pathology datasets. To this end, we performed the most comprehensive evaluation of publicly available pathology feature extractors to date, involving more than 8,000 training runs across nine tasks, five datasets, three downstream architectures, and various preprocessing setups. Notably, we find that omitting stain normalisation and image augmentations does not compromise downstream slide-level classification performance, while incurring substantial savings in memory and compute. Using a new evaluation metric that facilitates relative downstream performance comparison, we identify the best publicly available extractors, and show that their latent spaces are remarkably robust to variations in stain and augmentations like rotation. Contrary to previous patch-level benchmarking studies, our approach emphasises clinical relevance by focusing on slide-level biomarker prediction tasks in a weakly supervised setting with external validation cohorts. Our findings stand to streamline digital pathology workflows by minimising preprocessing needs and informing the selection of feature extractors. Code and data are available at https://georg.woelflein.eu/good-features.

Original languageEnglish
Title of host publicationComputer vision – ECCV 2024 workshops, proceedings
Subtitle of host publicationMilan, Italy, September 29–October 4, 2024, proceedings, part XVI
EditorsAlessio Del Bue, Cristian Canton, Jordi Pont-Tuset, Tatiana Tommasi
Place of PublicationCham
PublisherSpringer
Pages68-87
Number of pages20
ISBN (Electronic)9783031917219
ISBN (Print)9783031917202
DOIs
Publication statusPublished - 30 May 2025
EventWorkshops that were held in conjunction with the 18th European Conference on Computer Vision, ECCV 2024 - Milan, Italy
Duration: 29 Sept 20244 Oct 2024
https://eccv.ecva.net/Conferences/2024

Publication series

NameLecture notes in computer science
Volume15638 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Workshop

WorkshopWorkshops that were held in conjunction with the 18th European Conference on Computer Vision, ECCV 2024
Country/TerritoryItaly
CityMilan
Period29/09/244/10/24
Internet address

Keywords

  • Pathology
  • Stain normalisation
  • Weakly supervised learning

Fingerprint

Dive into the research topics of 'A good feature extractor is all you need for weakly supervised pathology slide classification'. Together they form a unique fingerprint.

Cite this