骨肉瘤患者2年肺转移风险机器学习预测模型的构建
Construction of a Machine Learning Prediction Model for the 2-Year Lung Metastasis Risk in Patients with Osteosarcoma
DOI: 10.12677/acm.2026.1641549, PDF,   
作者: 王 杰, 訾贵洋, 岳 斌*:青岛大学附属医院骨与软组织肿瘤科,山东 青岛;陈仲毅:青岛大学青岛医学院,山东 青岛
关键词: 骨肉瘤肺转移机器学习风险预测可解释人工智能Osteosarcoma Pulmonary Metastasis Machine Learning Risk Prediction Explainable Artificial Intelligence (XAI)
摘要: 目的:构建基于机器学习技术的骨肉瘤患者2年肺转移风险预测模型,并精准识别影响肺转移发生的关键预测指标,为临床早期干预提供参考依据。方法:回顾性收集青岛大学附属医院2016年1月至2025年12月期间收治的骨肉瘤患者临床资料。肺转移的定义为随访2年内通过影像学检查明确证实存在肺部转移病灶。采用Boruta算法(实施10折交叉验证,筛选被10折中超过9折判定为重要的特征)、相关性分析及LASSO回归三种方法联合筛选核心预测变量,构建8种不同的机器学习模型。结果:经多步特征筛选后,最终保留6个关键预测特征,分别为白蛋白、软组织受累、血红蛋白、碱性磷酸酶、C反应蛋白及乳酸脱氢酶。在构建的8种机器学习模型中,逻辑回归(LR)模型表现最优,其训练集曲线下面积(AUC)为0.93,测试集AUC为0.92,敏感性达92%,特异性为82%,展现出良好的泛化能力和预测可靠性。结论:基于多层感知机(MLP)构建的可解释性模型,能够有效识别骨肉瘤肺转移的高危患者,该模型预测效能稳定,具备较高的临床推广应用价值。
Abstract: Objective: To develop a machine learning-based model for predicting the 2-year risk of pulmonary metastasis in patients with osteosarcoma, and to accurately identify key predictive indicators associated with pulmonary metastasis, thereby providing evidence for early clinical intervention. Methods: Clinical data of patients with osteosarcoma treated at the Affiliated Hospital of Qingdao University from January 2016 to December 2025 were retrospectively collected. Pulmonary metastasis was defined as the presence of lung metastatic lesions confirmed by imaging examinations within a 2-year follow-up period. Core predictive variables were jointly screened using the Boruta algorithm (with 10-fold cross-validation, selecting features identified as important in more than 9 of the 10 folds), correlation analysis, and least absolute shrinkage and selection operator (LASSO) regression. Subsequently, eight different machine learning models were constructed. Results: After multistep feature selection, six key predictive features were ultimately retained: albumin, soft tissue involvement, hemoglobin, alkaline phosphatase, C-reactive protein, and lactate dehydrogenase. Among the eight constructed machine learning models, the logistic regression (LR) model demonstrated the best performance, with an area under the curve (AUC) of 0.93 in the training set and 0.92 in the test set. The model achieved a sensitivity of 92% and a specificity of 82%, indicating good generalization ability and reliable predictive performance. Conclusion: An interpretable model based on a multilayer perceptron (MLP) can effectively identify osteosarcoma patients at high risk of pulmonary metastasis. The model demonstrates stable predictive performance and has considerable potential for clinical application.
文章引用:王杰, 陈仲毅, 訾贵洋, 岳斌. 骨肉瘤患者2年肺转移风险机器学习预测模型的构建[J]. 临床医学进展, 2026, 16(4): 2920-2929. https://doi.org/10.12677/acm.2026.1641549

参考文献

[1] 王凤伟, 王臻, 李靖, 等. 早期进展性骨肉瘤发生率与预后的单中心回顾性研究[J]. 中国骨与关节杂志, 2021, 10(8): 572-576.
[2] 何涛, 唐冰, 熊敏, 等. 基于SEER数据库成人骨肉瘤肺转移病人的预后分析[J]. 骨科, 2024, 15(6): 534-540.
[3] 李文乐, 胡朝晖, 王永辉, 等. 列线图在骨肿瘤疾病的应用与研究进展[J]. 中国骨与关节杂志, 2021, 10(8): 624-629.
[4] 陈维, 潘德润, 陈卫国. 人工智能在骨肿瘤影像学中的应用研究进展[J]. 临床放射学杂志, 2025, 44(5): 953-956.
[5] 刘璐, 郑新宇. 乳腺癌患者预后影响因素的 Cox 模型分析[J]. 中国卫生统计, 2021, 38(3): 458-461
[6] 丁云. 炎症和营养相关指标与胃癌患者预后的相关性研究[D]: [硕士学位论文]. 恩施: 湖北民族大学, 2022.
[7] Huang, G., Zhang, X., Xu, Y., Chen, S., Cao, Q., Liu, W., et al. (2024) Prognostic and Predictive Value of Super-Enhancer-Derived Signatures for Survival and Lung Metastasis in Osteosarcoma. Journal of Translational Medicine, 22, Article No. 88. [Google Scholar] [CrossRef] [PubMed]
[8] Pereira, H.M., Leite Duarte, M.E., Ribeiro Damasceno, I., de Oliveira Moura Santos, L.A. and Nogueira-Barbosa, M.H. (2021) Machine Learning-Based CT Radiomics Features for the Prediction of Pulmonary Metastasis in Osteosarcoma. The British Journal of Radiology, 94, 20201391. [Google Scholar] [CrossRef] [PubMed]
[9] Zheng, S., Chen, L., Wang, J., Wang, H., Hu, Z., Li, W., et al. (2023) A Clinical Prediction Model for Lung Metastasis Risk in Osteosarcoma: A Multicenter Retrospective Study. Frontiers in Oncology, 13, Article 1001219. [Google Scholar] [CrossRef] [PubMed]
[10] Su, Z., Huang, F., Yin, C., Yu, Y. and Yu, C. (2023) Clinical Model of Pulmonary Metastasis in Patients with Osteosarcoma: A New Multiple Machine Learning-Based Risk Prediction. Journal of Orthopaedic Surgery, 31, Article 10225536231177102. [Google Scholar] [CrossRef] [PubMed]
[11] Zumárraga, J.P., Baptista, A.M., Rosa, L.P.D.L., Caiero, M.T. and Camargo, O.P.D. (2016) Serum Values of Alkaline Phosphatase and Lactate Dehydrogenase in Osteosarcoma. Acta Ortopédica Brasileira, 24, 142-146. [Google Scholar] [CrossRef] [PubMed]
[12] Li, W., Liu, W., Hussain Memon, F., Wang, B., Xu, C., Dong, S., et al. (2022) An External-Validated Prediction Model to Predict Lung Metastasis among Osteosarcoma: A Multicenter Analysis Based on Machine Learning. Computational Intelligence and Neuroscience, 2022, 2220527. [Google Scholar] [CrossRef] [PubMed]