In advanced-stage lung cancer management, the brain emerges as the predominant metastatic site. Accurately forecasting both the likelihood of intracranial metastasis and patient survival duration has become a critical clinical priority, though existing predictive systems demonstrate insufficient precision.
Materials and methodThis study conducted a retrospective cohort study utilizing SEER database records of lung cancer cases spanning 2010‒2015, incorporating demographic and clinical data. Multivariate logistic regression was initially employed for variable selection, followed by Cox proportional hazards modeling to identify risk factors associated with brain metastasis development and prognostic indicators. Significant predictors were subsequently integrated into six machine learning algorithms to construct risk stratification frameworks. Model performance underwent comprehensive evaluation through area under the ROC curve metrics, classification error matrices, and calibration curve assessments.
ResultsThis investigation analyzed a cohort of 25,072 lung cancer patients. Multiple clinical parameters including demographic characteristics (age, gender, ethnicity), tumor features (primary location, histological classification, differentiation grade), disease progression markers (T/N staging), therapeutic interventions (surgical procedures, radiotherapy, chemotherapy), and social factors (marital status) emerged as significant predictors of prognosis. Metastasis risk analysis identified age, tumor grade, histopathological type, and treatment modalities as key determinants. Comparative evaluation of predictive models revealed that the Gradient Boosting Machine (GBM) algorithm demonstrated superior performance in both cross-validation and internal testing phases.
ConclusionThe developed GBM-based predictive system offers clinicians a user-friendly web application for estimating cerebral metastasis probability in lung cancer cases, enhancing personalized treatment planning through accessible risk stratification.
Lung carcinoma remains the primary contributor to cancer-related mortality worldwide, with cerebral metastases emerging as a frequent complication that substantially diminishes survival outcomes. Current prognostic tools for predicting brain metastasis development exhibit restricted clinical utility due to methodological constraints. These models frequently depend on traditional statistical methodologies, employ insufficient sample cohorts, and incorporate potentially biased variable selection processes. This investigation aims to construct an innovative predictive framework integrating machine learning-enhanced nomographic analysis with cloud-based computational tools. By leveraging advanced pattern recognition capabilities, this approach seeks to improve prognostic precision while offering clinicians an accessible decision-support system for personalized therapeutic planning and patient counseling.
Lung cancer remains a global health burden due to its substantial disease prevalence and fatality rates.1 Statistical analyses reveal an annual incidence of roughly 53.6 cases per 100,000 population, with corresponding mortality figures reaching 45.6 per 100,000 individuals.2 Histologically, this malignancy is classified into two main pathological subtypes: Non-Small Cell Lung Carcinoma (NSCLC) and Small Cell Lung Carcinoma (SCLC). NSCLC represents the predominant form, accounting for 85 %‒90 % of diagnoses, with further subdivisions including adenocarcinoma, squamous cell carcinoma, and large cell carcinoma variants.3 The diagnostic challenge arises from the absence of distinct clinical manifestations in initial disease stages, frequently leading to delayed detection until advanced progression with distant metastatic spread. Metastatic patterns typically emerge through three distinct mechanisms: lymphatic system involvement, bloodstream dissemination, and contiguous tissue infiltration. Common metastatic sites include the brain, skeletal system, hepatic tissue, and adrenal glands, with these secondary growths significantly influencing treatment outcomes and survival probabilities.
The central nervous system has been recognized as a predominant metastatic target in pulmonary carcinoma cases.4–6 Clinical data indicate that nearly half of all lung cancer patients eventually experience intracranial spread (Brain Metastasis, BM). The emergence of neurological dissemination events correlates with diminished survival prospects, demonstrating median life expectancy ranging from 3‒6 months and posing substantial therapeutic difficulties.7,8 Furthermore, cerebral metastases in pulmonary malignancies markedly influence clinical outcomes, reflecting terminal disease progression and indicating unfavorable survival trajectories. These pathological developments may precipitate neurovascular complications accompanied by clinical manifestations including chronic discomfort, cachexia, and potential emergence of Multi-Organ Failure Syndromes (MODS) in terminal stages. This investigation utilized cutting-edge machine learning algorithms to perform systematic analysis on patients with pulmonary carcinoma-related cerebral metastases. Artificial intelligence-driven analytical approaches demonstrate enhanced capacity for integrating multi-layered data patterns when processing extensive clinical datasets, thereby substantially improving predictive model precision.
Machine learning demonstrates superior predictive capabilities when contrasted with conventional logistic regression approaches. By employing sophisticated computational frameworks and large-scale data analytics, this methodology enhances the accuracy of predictive modeling and analytical outcomes.9 The application of machine learning spans multiple sectors, from self-driving vehicle systems to intricate decision-making scenarios in complex systems, highlighting its exceptional computational efficiency and adaptive nature.10 Within biomedical research, the exponential growth of healthcare data repositories has created novel opportunities for advancing disease understanding and health optimization. These advanced algorithms possess the dual capacity to systematically process vast medical datasets while identifying clinically relevant patterns, thereby accelerating progress in tailored medical interventions and individualized care strategies.11,12 This technological advancement has demonstrated practical utility across various medical applications, including diagnostic procedures, therapeutic interventions, and patient monitoring systems.
The advancements in this field have facilitated earlier diagnosis rates, more personalized therapeutic strategies, and improved overall disease management for oncology patients.13 Given the differences in clinicopathological characteristics among individuals with pulmonary malignancies, diverse treatment approaches and survival expectations exist, resulting in considerable variability in clinical outcomes. However, current studies focusing on cerebral metastases from primary lung tumors remain scarce, creating clinical challenges for medical practitioners when formulating intervention protocols. This investigation, therefore, aims to develop and validate advanced machine learning frameworks combined with prognostic nomogram systems ‒ specifically those demonstrating superior predictive accuracy ‒ for assessing cerebral metastasis risks and predicting survival timelines in lung cancer cases.
Materials and methodsPatient cohortThe Surveillance, Epidemiology, and End Results (SEER) Program [www.seer.cancer.gov] provided data through its SEER*Stat tool (version 8.4.1), accessing the Incidence-SEER Research Dataset covering 17 registries. This population-based registry contained time-dependent county attributes from 1990‒2022, including socioeconomic and geographic indicators, with case information spanning 2000‒2021. The National Cancer Institute's Division of Cancer Control and Population Sciences (DCCPS) released the analyzed dataset in April 2024, utilizing November 2023 submissions. The study cohort comprised individuals diagnosed with primary lung malignancies (encompassing small cell carcinoma, squamous cell carcinoma, large cell carcinoma, and adenocarcinoma) during 2010‒2015. Case selection criteria incorporated multiple parameters: Anatomic site classification using ICD-O-3/WHO 2008 guidelines (“Lung and Bronchus”), malignant behavior designation (ICD-O-3 code/3), and restriction to first-primary tumors only.
Histology classifications (ICD-O-3) included small cell lung carcinoma (8041/3), squamous cell lung carcinoma (8070/3), large cell lung carcinoma (8012/3), and pulmonary adenocarcinoma (8140/3). All histological categorizations aligned with SEER's “Site-Histology Recode for Lung and Bronchus” (2010–2015 edition) to maintain database compatibility. Exclusion parameters comprised: 1) Omission of patients lacking survival duration data 2) Removal of individuals with undocumented follow-up status; 3) Elimination of subjects without brain metastasis documentation; 4) Exclusion of cases missing AJCC T/N stage specifications; 5) Disqualification of records lacking racial background, histological verification, or tumor differentiation details; 6) Exclusion of patients with undetermined primary tumor location or laterality information; 7) Removal of cases without recorded therapeutic interventions.
Cases lacking documented marital status, age, or gender information were omitted from the analysis. The case selection procedure is visually represented in Fig. 1, with comprehensive predictor details available in Supplementary Table 2. Given the SEER database's constraint of offering cross-sectional patient data (lacking temporal treatment records such as therapeutic interventions and BM detection timelines), the precise sequence between therapeutic modalities (surgical procedures, radiotherapy, chemotherapeutic regimens) and BM development remains undetermined - a recognized limitation in SEER-related investigations. Following established research protocols, these therapeutic variables were incorporated as predictive factors, with 'brain metastasis' in the OS prognostic model characterized as the initial presentation status during lung cancer identification. Regarding principal predictors (surgical intervention, radiotherapy), these parameters were operationally defined as primary therapeutic strategies determined post-baseline staging, aligning with clinical protocols and SEER research methodologies.
Data screeningThis investigation incorporated fourteen distinct parameters associated with clinicopathological profiles and demographic attributes for comprehensive evaluation. The demographic subset encompassed four key factors: chronological age, biological sex, marital status categorization, and ethnic background classification. The clinicopathological parameters analyzed encompassed primary tumor location, laterality, histological classification, tumor grade, T-category, N category, surgical intervention, radiotherapy administration, chemotherapy regimen, and presence of brain metastases. Lung cancer histotypes were categorized using ICD-O-3 codes into adenocarcinoma, squamous cell carcinoma, large cell carcinoma, and small cell carcinoma subtypes. All malignancies were initially classified according to AJCC 7th edition criteria (aligning with SEER's 2010‒2015 documentation standards) and later reclassified following AJCC 9th edition guidelines through official conversion protocols and SEER staging parameters. Optimal age thresholds were established through X-tile software (v3.6.1) analysis,14 with statistical comparisons performed using chi-square tests. Fourteen prognostic factors were evaluated across three domains: Demographic factors (age at diagnosis, gender, ethnicity, marital status), tumor characteristics (primary site, laterality, histopathological type, differentiation grade, T-stage, N-stage), along therapeutic interventions (surgical resection, radiation therapy, systemic chemotherapy). Outcome-related predictors (including baseline cerebral metastases status) were established prior to analytical procedures, with comprehensive criteria outlined in Supplementary Table 2. A complete-case analytical approach was utilized to eliminate subjects containing undetermined values for these predictive factors, achieving a 0 % missing data rate across all parameters within the finalized study population of 25,072 participants.
Data preprocessingStatistical analyses in this investigation were executed through Python (3.11.3), SPSS 27, and R software (4.4.1). The research utilized SPSS for performing both logistic and Cox regression analyses, serving dual purposes of identifying predictive variables for machine learning model construction and prognostic factor evaluation in survival analysis. Preliminary univariate logistic regression identified potential predictors of cerebral metastasis in lung cancer cases (statistically significant at p < 0.05), which subsequently underwent multivariate analysis. Variables demonstrating persistent significance (p < 0.05) were selected for integration into advanced machine learning frameworks. Parallel analytical procedures using Cox proportional hazards models revealed prognostic determinants (p < 0.05), subsequently visualized through a prognostic nomogram. Inter-variable relationships were systematically examined through comprehensive correlation assessments. To address class imbalance, this approach prioritizes augmenting the sample quantity of underrepresented categories to enhance predictive performance. Feature significance was assessed through the Permutation Importance framework integrated within the algorithmic framework.15,16 The analytical process involved importing 'dplyr' and 'caret' libraries in the R environment, followed by randomized allocation of SEER database records into training and internal validation cohorts with a 70:30 distribution ratio. Discrepancy detection between datasets was conducted through chi-square testing.
ML model establishment and evaluationSix distinct classification methodologies were implemented, combining three ensemble approaches (RF, GBM, XGB) with three fundamental classifiers (LR, DT, NBC).17 Model development was implemented in a Python programming environment, incorporating rigorous training protocols for each algorithm. The SEER dataset underwent division into ten cross-validation folds within the training cohort.18 To address data imbalance, researchers implemented the Synthetic Minority Oversampling Technique (SMOTE) for preprocessing, with this augmentation strategy exclusively applied to the internal validation subset.19 All six computational models (RF, XGB, GBM, LR, DT, NB) maintained standardized parameters (random_state fixed at 123 for experimental consistency). The assessment framework expanded beyond conventional metrics to include Precision-Recall AUC (PR-AUC), Positive Predictive Value (PPV), and Negative Predictive Value (NPV) for comprehensive model evaluation. The present study’s research team developed calibration curves for the six classifiers using the internal testing cohort. During internal validation, raw test data underwent direct processing through the established predictive models. Algorithm performance analysis incorporated multiple quantitative measures, including receiver operating characteristic curves and associated statistical parameters. The model's performance was evaluated using key metrics, including the area under the receiver operating characteristic curve (AUC), along with sensitivity, specificity, classification accuracy, and F-measure scores. Clinical relevance was examined through probability density visualization and clinical utility curve analysis. Building upon the optimal model configuration, the authors designed an interactive web application for clinical prediction. Detailed algorithmic configurations and hyperparameter settings are comprehensively documented in Supplementary Table 3, with established default values ensuring methodological consistency and reproducibility ‒ these baseline configurations align with published performance benchmarks and represent standard initialization practices in SEER-related pulmonary oncology research. The complete analytical framework and SEER*Stat query specifications represent collaborative intellectual assets protected under institutional knowledge-sharing agreements. For replication purposes, researchers may contact the corresponding investigator (Zhe Zhang; E-mail: zz9412182021@163.com) to request access to these computational resources under appropriate data-sharing protocols.
Development and construction of predictive modelThe statistical analysis employs R software (version 4.4.1) with integrated 'rms', 'foreign', 'survival', and 'survivalROC' packages for nomogram creation. Additionally, the authors evaluate predictive performance using multiple validation metrics, including concordance index, receiver operating characteristic analysis, calibration curves, decision curve evaluation, and discrimination enhancement scores through IDI assessment.
ResultsDemographic and clinical profiles of study participantsThe study cohort comprised 25,072 lung cancer cases retrieved from the SEER database, stratified into training (70 %) and internal validation (30 %) cohorts through random allocation. Demographic and clinical characteristics evaluated included age at diagnosis, gender, ethnicity, tumor location, lateralization, differentiation status, histological subtype, TNM classification components (T and N stages), therapeutic interventions (surgical resection, radiotherapy, chemotherapy), marital status, and presence of cerebral metastases. Intergroup variations were assessed through chi-square testing, revealing comparable distributions across all parameters (p > 0.05; see Supplementary Table 1). Cerebral metastases prevalence showed minimal variation between cohorts ‒ 795 cases (4.5 %) in the training group versus 361 cases (4.8 %) in the validation group (detailed in Table 1).
Clinical and pathological characteristics of the train set and internal test set.
This study utilized univariate and multivariate logistic regression methods to identify eight clinical predictors associated with cerebral metastasis development. Key variables encompassed patient age, tumor grade, histological type, tumor staging (T and N classifications), along with treatment modalities including surgical intervention, radiotherapy, and chemotherapy (statistical significance p < 0.05, see Table 2). Building upon these predictors, six predictive models were developed through machine learning techniques.
Univariate analysis and multivariate logistic regression analysis of variables.
To enhance the prognostic nomogram's accuracy, both univariate and multivariate Cox regression analyses were implemented. This analytical approach revealed thirteen clinically significant variables impacting overall survival outcomes. The prognostic indicators comprised demographic factors (age, gender, ethnicity, marital status), tumor characteristics (primary location, histopathology, differentiation grade, TNM staging, brain metastasis), therapeutic interventions (surgical management, radiation therapy, chemotherapeutic treatment), and metastatic status (p < 0.05 as detailed in Table 3).
Univariate analysis and multivariate Cox regression analysis of variables.
Statistical evaluation of variable relationships was conducted using Spearman's correlation method to examine interdependencies within the dataset. The visualized correlation matrix (Fig. 2A) demonstrated moderate-strength associations among the thirteen examined parameters, with coefficients ranging between ±0.5. This pattern suggests minimal interrelatedness among the analyzed variables. Feature importance assessments across multiple analytical approaches, as shown in Fig. 2B, revealed that parameters identified through both single-variable and multi-factor regression approaches showed significant predictive performance in all six computational frameworks. Notably, surgical procedures maintained consistent prominence as the most influential parameter across predictive models, underscoring their critical association with cerebral metastasis development in pulmonary carcinoma cases. The majority of algorithmic approaches confirmed this pattern, Surgical intervention emerges as the primary determinant within the modeling framework, with radiotherapy demonstrating substantial secondary impact. Other variables exhibit differential significance across alternative analytical approaches.
Model performanceThe performance metrics of six predictive algorithms are visually represented in Fig. 3A‒D and Table 4. Analysis of internal 10-fold cross-validation results (Fig. 3A) revealed the Gradient Boosting Machine (GBM) algorithm demonstrated superior predictive capability compared to other models, attaining a mean AUC of 0.894, while the XGBoost (XGB) model showed competitive performance with an AUC of 0.876. Subsequent validation testing outcomes detailed in Table 4 and Fig. 3B‒D confirm the sustained advantage of the GBM model, achieving peak AUC performance (0.899), complemented by balanced metrics including 0.852 accuracy, 0.729 sensitivity, and 0.858 specificity. The model exhibited a PR-AUC of 0.274 alongside predictive values of 0.137 PPV and 0.996 NPV (Table 4). In clinical applications for detecting lung cancer-related Brain Metastases (BM) ‒ particularly crucial given the low observed BM prevalence (4 %‒5 %) ‒ the framework prioritizes diagnostic reliability by emphasizing the critical need to minimize false-negative classifications, thereby ensuring timely therapeutic interventions for high-risk cohorts.
(A) Ten-fold cross-validation results of different machine learning models. (B) The ROC (Receiver Operating Characteristic) curves of different machine learning models in internal test set. (C) The PR-AUC (Precision-Recall Area Under the Curve) of different machine learning models in internal test set. (D) The calibration charts of different machine learning models in internal test set. (E) The confusion matrix of the GBM (Gradient Boosting Machine) model in the train set and the internal test set. TP, True Positive; TN, True Negative; FP, False Positive; FN, False Negative. (F) Probability density plot of gradient boosting machine model. (G) The clinical impact curve of the GBM (Gradient Boosting Machine) model.
Prediction performance of different models.
Clinically pivotal ‒ the remarkably elevated NPV (0.996) demonstrates the model's exceptional precision in excluding BM among individuals categorized as low-risk: over 99 % of model-designated “BM-negative” cases truly lack metastases, substantially mitigating unwarranted patient concerns while curbing excessive utilization of subsequent neuroimaging procedures. This dual benefit simultaneously decreases healthcare expenditures and patient stress. The PR-AUC value of 0.274, while moderate, illustrates the algorithm's discriminative capability within this severely skewed dataset ‒ a characteristic hurdle in predicting rare pathological conditions. Complementing this, the clinically relevant PPV (0.137) ‒ corresponding with the disease's low prevalence and aiding in detecting a high-risk cohort meriting comprehensive diagnostic assessment ‒ equips medical practitioners with data-driven insights for optimizing surveillance protocols. These combined metrics substantiate the model's practical implementation value within clinical BM detection pathways, particularly for strategically allocating limited diagnostic resources while maintaining screening accuracy.
The classification matrix (Fig. 3E) demonstrates model efficacy through detailed performance metrics across training and evaluation datasets. During training, the matrix reveals substantial true negative classifications (12,729) alongside 4026 false positive instances, while maintaining minimal false negatives (77) against 718 accurately identified positive cases. This pattern persists in test data with 5427 true negatives, 1734 false positives, 33 false negatives, and 328 true positives, reflecting comparable classification proficiency between learned and novel data - confirming reliability across both phases. The probability distribution visualization (Fig. 3F) highlights optimal predictive performance occurring near a threshold value of 0.49, where positive and negative case densities exhibit distinct demarcation. This critical inflection point in predictive scoring corresponds to maximum discriminative capacity between diagnostic categories, facilitating precise clinical decision-making thresholds.
This analysis serves as a pivotal consideration for clinical decision-making, as it weighs accurate identification of confirmed cases against erroneous detection. The clinical utility plot (Fig. 3G) provides visual decision support through dual trajectory visualization: one curve illustrates the capture rate of true Brain Metastasis (BM) instances across varying probability cutoffs, while the complementary curve demonstrates exclusion efficacy for non-BM cases. Medical practitioners strategically select diagnostic thresholds through this framework ‒ for instance, a lower cutoff prioritizes BM case inclusion to prevent oversight, whereas a higher threshold minimizes unwarranted diagnostic procedures. This adaptive approach enhances precision in patient care protocols while optimizing resource allocation.
Web predictorBuilding upon the established research framework, a prognostic algorithm for cerebral metastases in pulmonary carcinoma patients was formulated through gradient boosting methodology. This computational tool demonstrated exceptional accuracy in anticipating neurological spread patterns among oncology cases. The digital assessment platform facilitates evidence-based clinical judgments by quantifying personalized metastasis risks, enabling practitioners to input demographic parameters (age, tumor grade, histopathological classification), therapeutic interventions (surgical history, radiation exposure, chemotherapeutic regimens), and TNM staging data. Upon submitting these variables to the analytical engine via the interactive portal (https://lung-cancer-brain-m-bm7dssal6atqu6qdm.streamlit.app/), healthcare professionals can select the “calculate risk” option to generate instantaneous probability estimates. The operational workflow and interface specifications are comprehensively illustrated in Fig. 4A, providing visual guidance for optimal utilization of this predictive system.
Developing and validating of prognostic nomogramA predictive model incorporating the previously identified 13 independent variables was developed from the training cohort to estimate lung cancer patients' Overall Survival (OS). This clinical scoring system demonstrated that surgical treatment emerged as the strongest determinant of survival outcomes, with patient age, tumor T-classification, chemotherapy administration, and cerebral metastases showing subsequent predictive importance. Survival probability projections at 12-, 24-, and 36-month intervals are visually presented in Fig. 4B For clinical application of this prognostic instrument, practitioners should begin by identifying the patient's specific clinical parameters aligned with each predictive variable (including age category, cerebral metastasis status, surgical history, and T classification) as detailed in Supplementary Table 4. Proceed by applying the standardized scoring protocol: align each clinical characteristic with its designated point value in Supplementary Table 4 (e.g., 100 points allocated for surgical intervention, 50.6 points assigned for confirmed brain metastases, 36 points corresponding to specific age brackets).
To calculate the survival prognosis score, clinicians first assign corresponding points to each prognostic factor (e.g., 34.5 points for age ≥ 79), then aggregate each component's score to obtain the total prognostic value ‒ with elevated scores indicating diminished survival prospects ‒ and subsequently apply this composite score to the nomogram for survival time estimation. The model's prognostic efficacy was assessed through three validation metrics: Concordance index (C-index), Receiver Operating Characteristic (ROC) analysis, and calibration curves. The training cohort demonstrated a C-index of 0.801 (95 % Confidence Interval [95 % CI: 0.798–0.804]), while the internal validation cohort showed comparable discriminative ability with a C-index of 0.805 (95 % CI: 0.800–0.810). Bootstrap-corrected validation in both training and internal cohorts revealed robust predictive capacity across multiple timepoints: training cohorts yielded C-indexes of 0.811 (1-year OS), 0.788 (2-year OS), and 0.774 (3-year OS), confirming the model's temporal reliability in overall survival prediction.
In these analyses, the 95 % Confidence Intervals for adjusted C-index values predicting 1-year, 2-year and 3-year overall survival spanned 0.803‒0.818, 0.781‒0.794 and 0.767 - 0.780, respectively. During internal validation, bootstrap-adjusted C-index measurements for 1-, 2-, and 3-year survival outcomes reached 0.812, 0.791, and 0.780, with confidence bounds of 0.801–0.823, 0.781–0.801, and 0.770–0.790. All C-index measurements approached 0.8, demonstrating the model's robust predictive capacity for lung cancer survival outcomes across validation cohorts. ROC curve analysis revealed AUC scores of 0.9, 0.812, and 0.756 for 1-, 2-, and 3-year predictions in the training cohort (Fig. 5A), while internal testing showed comparable values of 0.899, 0.806, and 0.758 (Fig. 5C). Calibration curves exhibited favorable alignment between model predictions and observed survival rates across all time points in both training and validation datasets (Fig. 5C and 5D).
validation of the predicting nomogram. (A) 1-, 2-, and 3-year ROC (Receiver Operating Characteristic) curves of the train set. (B) 1-, 2-, and 3-year ROC (Receiver Operating Characteristic) curves of the internal test set. (C) 1-, 2-, and 3-year calibration charts of the train set. (D) 1-, 2-, and 3-year calibration charts of the internal test set.
To evaluate the efficacy of the prognostic framework, the authors utilized Brier score metrics derived from probability calibration charts (Fig. 5B and 5D). These metrics measure how precisely predicted probabilities correspond with observed results, where diminished values reflect superior calibration performance. Both developmental and validation cohorts demonstrated concordant Brier scores for 1-, 2-, and 3-year Overall Survival (OS) forecasts, consistent with the optimal alignment visible in calibration graphics. The combined assessment ‒ incorporating both graphical calibration concordance and quantitative validation through Brier metrics ‒ confirms the present model's temporal survival estimates maintain strong correspondence with empirical outcomes. Crucially, all calibration trajectories exhibited minimal deviation from the reference 45° bisecting line. For clinical applicability verification, the authors conducted Decision Curve Analysis (DCA) contrasting the predictive algorithm against the AJCC TNM staging protocol across 1-, 2-, and 3-year. The Decision Curve Analysis (DCA) results for both training and internal validation cohorts across 1-, 2-, and 3-year intervals are presented in Fig. 6A and 6B Temporal analysis of the DCA plots revealed the innovative prognostic framework consistently outperformed the conventional AJCC TNM staging system in clinical utility throughout all observation periods, while simultaneously offering practical guidance for therapeutic strategies. Complementing these findings, the integrated discrimination improvement metrics quantitatively confirmed enhanced predictive performance: Training cohort IDI measurements progressively increased from 16.1 % (1-year) to 17.2 % (3-year OS) with statistical significance (p < 0.05 for all intervals), while internal validation demonstrated comparable improvements ranging from 14.9 % to 15.6 % across corresponding timepoints (all p < 0.05), as detailed in Fig. 6.
DiscussionLung cancer ranks as the second most prevalent malignancy in both genders. Current epidemiological data indicate approximately 155,000 fatalities in the United States during 2017 were linked to this disease, representing one-quarter of total cancer mortality.20–22 Metastatic progression accounts for the majority of lung cancer fatalities,23 with cerebral metastases being particularly prevalent – clinical studies reveal up to 50 % of patients eventually develop Brain Metastases (BM).24–26 Recent years have witnessed a marked rise in Lung Cancer Brain Metastasis (LCBM) cases, significantly compromising five-year survival rates while simultaneously straining healthcare infrastructure.27 This epidemiological shift stems from three interrelated causes: advancements in therapeutic interventions prolonging patient survival, enhanced diagnostic imaging enabling earlier metastasis detection, and evolving tumor biology facilitating neural tissue infiltration.
Advancements in neuroimaging modalities have enhanced the identification frequency of cerebral metastases,28 with demographic shifts toward older populations and socioeconomic development contributing to this epidemiological pattern.29,30
Therapeutic interventions and holistic care approaches for pulmonary carcinoma metastases continue to present significant clinical obstacles, requiring integrated treatment protocols. Consequently, investigating risk determinants and disease characteristics in pulmonary malignancies, coupled with precise prognostication of cerebral metastasis progression and survival outcomes, carries crucial implications for therapeutic planning within personalized oncology paradigms.
Extensive research has explored both carcinogenic elements and prognostic indicators influencing pulmonary malignancies. The pathogenesis of this disease involves complex interactions between environmental exposures and hereditary predispositions. International epidemiological investigations consistently identify tobacco use as the principal causative agent in lung cancer development.31 Concurrently, residential radon accumulation has emerged as the secondary etiological determinant following cigarette exposure, representing the predominant risk factor among non-smoking populations.32 Genomic instability and somatic mutations further contribute significantly to oncogenesis, with comprehensive molecular profiling revealing multiple driver mutations such as EGFR, ALK, ROS1, KRAS, BRAF, C-MET, RET, and HER2.33 These evidence-based findings regarding pulmonary carcinogenesis have gained universal recognition within clinical oncology practice.
Extensive research efforts have been dedicated to identifying prognostic indicators in lung cancer management. Research findings indicate that untreated individuals with cerebral metastases face a particularly bleak outlook, surviving an average of 1‒2 months post-diagnosis. Clinical evidence demonstrates that Whole-Brain Radiation Therapy (WBRT) extends median survival to 4‒6 months, solidifying its position as the primary treatment approach for multiple brain metastases cases.34,35 Cytotoxic drug regimens substantially impact clinical outcomes in metastatic patients, with platinum-based combinations historically serving as the cornerstone therapy for advanced NSCLC. Despite therapeutic advancements, metastatic NSCLC maintains a discouraging 5-year survival rate of approximately 7 %.36 Surgical intervention studies by Bryan and Donington revealed improved survival outcomes and localized disease control when integrated into multimodality treatment plans for operable stage IIIA malignancies.37
Previous research by Chati et al. established a predictive nomogram framework for advanced non-small cell lung cancer patients, incorporating variables such as demographic characteristics, tumor staging parameters, and therapeutic approaches.38 Subsequent investigations by Li Z and Mu X expanded this methodology to prognostic modeling in small cell and large cell carcinomas.39,40 Current research in this field presents two primary constraints: insufficient utilization of large-scale datasets for lung cancer prognostic analysis, coupled with methodological limitations in existing models that have overlooked the integration of both NSCLC and SCLC subtypes. The present investigation addresses these gaps by incorporating four major histological subtypes (adenocarcinoma, squamous cell carcinoma, SCLC, and LCLC) concurrently within the analytical framework while accounting for metastatic progression, thereby enhancing prognostic prediction comprehensiveness.
Nevertheless, scholarly publications demonstrate a notable scarcity of studies examining BM in terminal-phase pulmonary carcinoma. Contemporary explorations within this field mainly emphasize two crucial dimensions. Initially, insufficient attention has been directed toward comprehensive investigation of elevated-risk prognostic indicators correlated with cerebral metastases in lung malignancies. Moreover, the dynamic interactions between these autonomous predictive variables remain insufficiently mapped. Subsequently, despite artificial intelligence and extensive data processing demonstrating remarkable analytical capacities with substantial progress in recent years, structured explorations of cerebral metastasis prediction systems utilizing AI methodologies in lung cancer remain underrepresented. Therefore, integrative studies addressing these gaps are critically imperative to improve prognostic evaluation and metastatic pattern anticipation in pulmonary oncology.
By employing comprehensive data analysis from the SEER registry, this investigation utilized logistic regression modeling to pinpoint significant predictors of cerebral metastasis. Thirteen clinically relevant parameters spanning demographic characteristics, tumor biology, and therapeutic interventions were analyzed, including patient age, gender, ethnicity, tumor origin, laterality, differentiation status, histological subtype, tumor extent (T-stage), nodal involvement (N-stage), surgical intervention, radiotherapy administration, systemic treatment history, and marital status. Inter-variable associations were visualized through Spearman's rank correlation heatmaps, with Fig. 2A demonstrating minimal collinearity among the selected variables. Multivariate analysis identified eight clinically significant predictors of cerebral dissemination after adjusting for potential confounders: patient age at diagnosis, tumor histopathology, differentiation grade, primary tumor extent, regional lymph node involvement, surgical resection status, radiation therapy implementation, and chemotherapy administration history. The development of predictive frameworks for cerebral metastasis in progressive pulmonary malignancies represents a critical advancement comparable in importance to the identification of these prognostic determinants.
Within this analytical framework, six predictive models were developed using machine learning techniques. An internal validation process employing a 10-fold cross-validation approach was implemented (Fig. 3A), through which the Gradient Boosting Machine (GBM) algorithm demonstrated superior predictive capability compared to other candidates. Building upon these findings, the authors developed a publicly accessible web-based prediction platform utilizing the GBM architecture (https://lung-cancer-brain-m- bm7dssal6atqu6qdmung2h.streamlit.app/). This computational system allows precise estimation of cerebral metastasis probability through comprehensive clinical parameter analysis. Medical practitioners can utilize the web-based interface to input patient-specific clinical data and obtain instant risk stratification results, thereby enhancing therapeutic planning and clinical management strategies.
Subsequently, Cox proportional hazards regression modeling was utilized to pinpoint 13 distinct prognostic indicators linked to pulmonary carcinoma outcomes, while simultaneously developing time-dependent survival probability estimates spanning 1-, 2-, and 3-year intervals. The graphical nomogram facilitates simultaneous incorporation of diverse clinical parameters through its visual analytics framework, offering an integrated predictive system for customized survival projections. This multidimensional assessment methodology empowers clinicians to perform risk stratification with enhanced precision based on patients' biological profiles, thereby enabling optimized therapeutic regimen customization. Such granular prognostic insights significantly improve collaborative treatment planning processes between medical teams and affected individuals, fostering evidence-based clinical interventions that elevate the quality of oncological care delivery.41,42
This study boasts several prominent strengths. First, a machine-driven statistical model was developed to forecast brain metastasis in lung cancer patients. To the best of our knowledge, this represents the inaugural effort to leverage machine learning for constructing a predictive model of lung cancer brain metastasis, thereby expanding the understanding of advanced lung cancer. Second, the study further probed the associations among independent high-risk factors, offering novel avenues for future clinical research. Specifically, clinical research ought not only to examine the occurrence of metastasis in patients but also account for the relationships between distinct high-risk factors, and gain a deeper comprehension of their mutual influences. This would help to further mitigate adverse factors potentially inducing metastasis during the perioperative phase. Additionally, the authors have crafted not only a predictive model for lung cancer brain metastasis but also a nomogram for prognosticating lung cancer survival time. The integration of these two models can substantially aid clinicians in making more well-informed decisions for lung cancer patients. Fourth, considering the recent release of the ninth edition of the AJCC lung cancer staging system, the clinical data employed this ninth edition TNM staging system, facilitating more remarkable outcomes.
LimitationNevertheless, this investigation presents inherent methodological constraints that warrant recognition. While advancing the field, the research encounters two primary limitations. Current machine learning methodologies predominantly utilize statistical approaches or opaque model architectures,43 introducing significant theoretical limitations in predictive accuracy. This opacity restricts the mechanistic interpretation of critical prognostic factors (including surgical interventions and tumor staging) in cerebral metastasis development, complicating clinical implementation of predictive analytics. Additionally, being conducted at a single institution, the research features a geographically constrained participant pool that may limit demographic diversity and generalizability of findings.
The utilization of machine learning algorithms on large-scale datasets may enhance predictive accuracy.44 The validation protocol relied solely on internal evaluations within the SEER database (partitioned into training and internal validation cohorts), lacking external verification through independent multicenter studies ‒ a limitation that affects the broader applicability of both the GBM algorithm and prognostic nomogram. Fourthly, potential temporal confounding arises from the SEER registry's absence of time-stamped clinical records. The chronological relationship between therapeutic interventions (surgical procedures, radiotherapy, and chemotherapy) and subsequent BM diagnosis could not be definitively established, potentially introducing measurement bias when considering these treatments as baseline predictors, despite adherence to SEER methodological standards. Fifthly, critical clinical variables such as ECOG performance scores, detailed smoking history metrics (including pack-year calculations), and molecular profiling data (EGFR/ALK mutation status) were excluded due to database limitations, constraining the model's clinical specificity. Furthermore, the age stratification threshold was determined algorithmically using X-tile software rather than being grounded in established clinical parameters.
The absence of robust theoretical foundations and potentially undermining consistency among different cohorts represent key limitations. Sixth, the omission of cases containing “unknown” predictor variables (such as AJCC stages and histology) in complete-case analytical approaches could introduce selection bias, as patients with missing data might constitute distinct clinical subgroups not fully representative of real-world clinical scenarios. Subsequent investigations will tackle these limitations through multicenter external validation studies, incorporating temporally annotated data streams, and integration of previously unaccounted clinical parameters to strengthen predictive model reliability.
ConclusionConsequently, this research developed a predictive algorithm using machine learning techniques to assess cerebral metastasis risks in lung cancer patients, incorporating eight routinely available clinical-pathological parameters from standard medical practice. Among various models evaluated, the Gradient Boosting Machine exhibited superior predictive accuracy. This optimized GBM framework demonstrates clinical utility for anticipating neurological metastases, enabling physicians to enhance precision in therapeutic decision-making for pulmonary carcinoma cases. Additionally, investigators created a comprehensive prognostic scoring system through big data analytics. The integrated application of these predictive tools facilitates simultaneous assessment of metastasis probability and survival outcomes, providing multidimensional clinical insights for patient management.
Ethics approval and consent to participateNot applicable. The authors confirm that no experimental procedures involving humans or animals were conducted during this research.
Consent for publicationAll authors take full responsibility for ensuring the validity and integrity of this work. Any issues concerning accuracy or ethical standards have been thoroughly examined and addressed.
Availability of data and materialsThe research data supporting this study are openly accessible through the SEER database (https://seer.cancer.gov). Specific configurations used in SEER*Stat analyses, computational scripts, or additional methodological details may be requested from the corresponding author. Full technical documentation and analytical resources will be made available to facilitate replication studies.
FundingChina-Japan Union Hospital of Jilin University MED+XInterdisciplinary Discipline Cultivation Program No 2025022. Clinical Comparative Study of Single-port thoracoscopic radical resection of lung Cancer and traditional thoracoscopic radical resection of lung cancer (No.SCZSY201611). The Special Project for Health Research Talents of Jilin Province [construction of 3D model for precise navigation of thoracoscopic lung segmental (subsegmental) resection], funded by the Jilin Provincial Department of Finance (No. 2023SCZ27). Research on the prediction model of invasive Adenocarcinoma of the lung based on multimodal deep learning of clinical-radiomics-genomics (No. YXJL-2022-0080-0443).
Data availabilityThe datasets generated and/or analyzed during the current study are available from the corresponding author upon reasonable request.
CRediT authorship contribution statementYukai Zeng: Visualization. Fengwu Lin: Funding acquisition. Lening Zhang: Supervision. Jingyuan Jiang: Writing – review & editing. Chen Zhang: Validation. Dongliang Qin: Data curation. Zhe Zhang: Writing – original draft.
The authors declare no conflicts of interest.
Gratitude is extended to all patients, investigators, and institutions involved in this study, with special acknowledgment of the SEER database.











