August 25, 2025
Modern Pathology

Deep Learning on Histologic Slides Accurately Predicts Consensus Molecular Subtypes and Spatial Heterogeneity in Colon Cancer

AI
Biology
Cancer
Research
Abstract

Colon cancer (CC) is the third most prevalent cancer type. It is highly heterogeneous, particularly in terms of molecular profiles, which have both prognostic and predictive impacts on the treatment efficacy. However, CC treatment in adjuvant situations is currently guided solely by T and N staging. In this context, consensus molecular subtypes (CMSs) were introduced to stratify patients with CC based on molecular profiles. Recent studies have shown that CMS can be heterogeneous in CC, leading to a worse prognosis.

This study focused on predicting CMS and its heterogeneity in CC using deep learning on digitized hematoxylin and eosin ± saffron–stained whole-slide images. Data and whole-slide images of 1996 patients from the PETACC-8, The Cancer Genome Atlas-COAD, and PRODIGE-13 cohorts were used. The model is trained to predict a 4-dimensional CMS vector, reflecting intratumor heterogeneity (ITH). It comprises a self-supervised model for embedding image patches into vectors and a weakly supervised model predicting CMS calls. Ground-truth CMS scores are obtained with the CMSclassifier package. Interpretability analyses are performed at the slide and patch levels.

For homogeneous tumors, the model trained on PETACC-8 achieves 93.0% (±1.4%) macroaverage area under the curve in internal cross-validation and 94.4% macroaverage area under the curve in external validation over PRODIGE-13, whereas the The Cancer Genome Atlas-COAD model reaches 85.4% (±3.0%) in cross-validation and 92.4% over PRODIGE-13. The trained models also provide spatial distributions of CMS across tumor slides and associate specific histologic features with each CMS. Finally, the models are able to predict ITH.

The results show that a deep learning model trained on routine histology slides is capable of providing an efficient and robust method for predicting CMS and characterizing a patient’s ITH, paving the way for the routine consideration of CMS/ITH in clinical decision making in the adjuvant setting.

Authors
Jean-Eudes Le Douget
Paul Jacob
Côme Lepage
Claire Gallois
Marine Sroussi
Aurélien De Reynies
Antoine Cazelles
Daniel Gonzalez
Charles Maussion
Olivier Bouche
Charles-Briac Levache
Marine Jary
Laurent Mineur
Simon Jegou
Charlie Saillard
Mehdi Morel
Julien Taïeb
Frédéric Bibeau
Jean Francois Emile
Pierre Laurent-Puig