Medical College of Wisconsin
CTSIResearch InformaticsREDCap

Machine learning identifies prognostic subtypes of the tumor microenvironment of NSCLC. Sci Rep 2024 Jul 01;14(1):15004

Date

07/02/2024

Pubmed ID

38951567

Pubmed Central ID

PMC11217297

DOI

10.1038/s41598-024-64977-7

Scopus ID

2-s2.0-85197302000 (requires institutional sign-in at Scopus site)   2 Citations

Abstract

The tumor microenvironment (TME) plays a fundamental role in tumorigenesis, tumor progression, and anti-cancer immunity potential of emerging cancer therapeutics. Understanding inter-patient TME heterogeneity, however, remains a challenge to efficient drug development. This article applies recent advances in machine learning (ML) for survival analysis to a retrospective study of NSCLC patients who received definitive surgical resection and immune pathology following surgery. ML methods are compared for their effectiveness in identifying prognostic subtypes. Six survival models, including Cox regression and five survival machine learning methods, were calibrated and applied to predict survival for NSCLC patients based on PD-L1 expression, CD3 expression, and ten baseline patient characteristics. Prognostic subregions of the biomarker space are delineated for each method using synthetic patient data augmentation and compared between models for overall survival concordance. A total of 423 NSCLC patients (46% female; median age [inter quantile range]: 67 [60-73]) treated with definite surgical resection were included in the study. And 219 (52%) patients experienced events during the observation period consisting of a maximum follow-up of 10 years and median follow up 78 months. The random survival forest (RSF) achieved the highest predictive accuracy, with a C-index of 0.84. The resultant biomarker subtypes demonstrate that patients with high PD-L1 expression combined with low CD3 counts experience higher risk of death within five-years of surgical resection.

Author List

Yu D, Kane MJ, Koay EJ, Wistuba II, Hobbs BP

Author

Duo Yu PhD Assistant Professor in the Data Science Institute department at Medical College of Wisconsin




MESH terms used to index this publication - Major topics in bold

Aged
B7-H1 Antigen
Biomarkers, Tumor
Carcinoma, Non-Small-Cell Lung
Female
Humans
Lung Neoplasms
Machine Learning
Male
Middle Aged
Prognosis
Retrospective Studies
Survival Analysis
Tumor Microenvironment