Projects per year
Abstract
Colorectal cancer is the second leading cause of cancer death worldwide. Its high incidence and mortality rate highlight the critical role of advanced diagnostics and early detection methods. Advancements in computational pathology can significantly enhance diagnostic precision and treatment personalisation, ultimately improving patient outcomes. Hospitals and labs globally are transitioning toward routine whole slide image (WSI) digitisation. This digitisation process generates large volumes of data, offering an opportunity to enhance diagnostic capabilities through the use of machine learning techniques such as weakly supervised learning and self supervised learning (SSL). This study evaluates the performance of state-of-the-art self-supervised learning (SSL) feature extractor foundation models—CTransPath, Phikon, and UNI—against a pretrained ResNet-50, which serves as a benchmark. Our Transformer network analyses these feature vectors, focusing on their efficacy in predicting key colorectal cancer biomarkers within a small dataset containing 423 WSIs with only 8% of cases exhibiting mismatch repair (MMR) deficiency. The CTransPath model achieved the highest validation AUROC of 0.9466 for MMR classification but exhibited a test AUROC of 0.6880, demonstrating significant variability. In contrast, the UNI model demonstrated greater consistency and robustness, achieving a test AUROC of 0.7136, which additionally represents a 6.3% improvement over ResNet-50’s test AUROC of 0.6709. The results highlight the feasibility of using advanced machine learning models with smaller, sparsely annotated datasets, though the variability noted in some models underscores the challenges at the edge of data scarcity. Code and experimental framework available at https://github.com/CraigMyles/SurGen-CRC-Arena.
Original language | English |
---|---|
Title of host publication | Medical image understanding and analysis |
Subtitle of host publication | 28th annual conference, MIUA 2024, Manchester, UK, July 24–26, 2024, proceedings, part I |
Editors | Moi Hoon Yap, Connah Kendrick, Ardhendu Behera, Timothy Cootes, Reyer Zwiggelaar |
Place of Publication | Cham |
Publisher | Springer |
Pages | 329-343 |
ISBN (Electronic) | 9783031669552 |
ISBN (Print) | 9783031669545 |
DOIs | |
Publication status | Published - 24 Jul 2024 |
Event | Medical Image Understanding and Analysis: 28th Annual Event: 28th Annual Conference, MIUA 2024, Manchester, UK, July 24–26, 2024 - Manchester Metropolitan University, Manchester, United Kingdom Duration: 24 Jul 2024 → 26 Jul 2024 Conference number: 28 https://miua2024.github.io/ |
Publication series
Name | Lecture notes in computer science |
---|---|
Publisher | Springer Nature |
Volume | 14859 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | Medical Image Understanding and Analysis: 28th Annual Event |
---|---|
Abbreviated title | MIUA |
Country/Territory | United Kingdom |
City | Manchester |
Period | 24/07/24 → 26/07/24 |
Internet address |
Keywords
- Digital pathology
- Machine learning
- Transformer
- Deep learning
- Slide-level classification
- Mismatch repair (MMR)
- BRAF mutation
- RAS mutation
- Survival prediction
Fingerprint
Dive into the research topics of 'Leveraging foundation models for enhanced detection of colorectal cancer biomarkers in small datasets'. Together they form a unique fingerprint.Projects
- 1 Finished
-
ICAIRD: I-CAIRD: Industrial Centre for AI Research in Digital Diagnostics
Harrison, D. J. (PI)
1/02/19 → 31/01/22
Project: Standard
Datasets
-
SurGen: 1020 H&E-stained Whole Slide Images With Survival and Genetic Markers
Myles, C. G. G. (Creator), Um, I. H. (Creator), Marshall, C. (Creator), Harris-Birtill, D. C. C. (Creator) & Harrison, D. J. (Creator), EMBL-EBI, 24 Jul 2024
DOI: 10.6019/S-BIAD1285, https://www.ebi.ac.uk/biostudies/bioimages/studies/S-BIAD1285
Dataset