October 11, 2025
BioRxiv

Histology and spatial transcriptomic integration revealed infiltration zone with specific cell composition as a prognostic hotspot in glioblastoma

AI
Biology
Cancer
ML
Research
Spatial Omics
Abstract
Background

Glioblastoma (GBM), the most aggressive primary brain tumor, has a median survival of approximately 15 months. Twenty percent of patients survive beyond three years, but known clinical factors like age, performance status, resection extent, and MGMT promoter methylation status do not fully explain the observed outcomes.

Objective

Our objective was to identify novel histology derived biomarkers associated with end-of-spectrum overall survival (OS) to provide novel biological insight with a translational potential.

Methods

We analyzed a total of 748 GBM patients from 3 different cohorts, uniquely enriched in long survivors (n=98 with overall survival (OS) > 5y including n=196 with OS≥3y), with clinical data and H&E slides obtained from the primary tumor at baseline. We propose an interpretable machine learning (ML) methodology for the discovery of histological biomarkers. Our method learned to segment each H&E slide into three distinct regions associated with long-term survival, short-term survival, and non-informative tissue. We characterized these regions by integrating unsupervised learning, nuclei segmentation, blood vessels detection, pathologist annotations, and multimodal data including spatial transcriptomics from n=31 patients of the GBM MOSAIC dataset to discover fully interpretable biomarkers.

Results

Our OS prediction model using histology and clinical data as input achieved an area under the curve (AUC) of 0.85 for the classification of patients between OS<2 and OS≥3y in external cohort validation, outperforming significantly models trained on clinical data or on histology alone (AUC of 0.76; 0.73, respectively).

Two novel biomarkers were predicting poor survival: the presence of regions of lowly infiltrated white matter enriched in malignant cells with a mesenchymal-like phenotype, and lower levels of angiogenesis associated with higher hypoxia response in the main tumor regions. We also found that a subtype of immunosuppressive tumor macrophages - defined by high PLIN2 expression and lipid accumulation-is consistently enriched in histological areas predictive of poor prognosis.

Conclusion

Our interpretable ML methodology identified a novel prognostic impact of biological processes and cell types according to distinct tumor regions of GBM. These results pave the way for spatially-informed biomarkers to improve risk stratification and for personalized spatially-targeted therapeutic strategies.

Authors
Lucas Fidon
Mélanie Lubrano, PhD
Caroline Hoffmann
Esther Baena
Céline Thiriez
Luis Cano Ayestas
Alex Cornish
Elo Madissoon
Valérie Ducret
Christian Esposito
Eric Durand, PhD
Benoit Schmauch
Elodie Pronier
Alberto Romagnoni
Almudena Espin Perez
MOSAIC Consortium
Dr. Markus Eckstein
Raphael Gottardo
Theoni Maragkou
Mohamed Shelan
Almoatazbellah Youssef
 Spencer S Watson
Marc Sanson
Franck Bielle