选择性共形预测在乳腺癌高风险人群风险评估中的方法研究
Selective Conformal Prediction for Risk Assessment in High-Risk Breast Cancer Subgroups
摘要: 在医学风险预测中,为个体化预测结果提供可靠的不确定性量化,是评估模型临床实用性的关键环节。本文聚焦于选择性共形预测方法在乳腺癌高风险人群风险评估中的应用,旨在通过构建覆盖率可控的预测区间,实现对风险模型预测性能的稳健评估,并提升其在高风险人群中的可解释性。我们首先系统回顾了分裂共形预测的基本原理,进一步引入选择性共形预测方法,在保证理论覆盖率的同时,实现高风险个体的有效识别与重点覆盖,并针对该人群给出最大置信度与预测可信度等单指标不确定性度量,为临床干预提供参考信息。结合基于临床指标的风险预测模型,本文利用威斯康星州乳腺癌数据开展实证分析。结果表明,所提出的方法在高风险个体的预测准确率与不确定性控制方面均优于分裂共形预测,具有良好的实际应用潜力。本研究为医学诊断场景下的不确定性建模提供了新的统计工具,并为高风险人群的精准防控提供了方法上的支持。
Abstract: In biomedical research, reliable quantification of predictive uncertainty is a key criterion for evaluating the clinical utility of risk prediction models. This study focuses on the application of selective conformal prediction to risk assessment in high-risk breast cancer populations. The objective is to construct prediction sets with controllable coverage, enabling robust evaluation of model performance and enhancing interpretability for high-risk subgroups. We first review the split conformal prediction method and then introduce a selective conformal prediction framework that achieves both theoretical coverage guarantees and effective identification with prioritized coverage of high-risk individuals. For these individuals, we provide uncertainty measures such as maximum confidence and credibility scores to inform targeted clinical interventions. Using a risk prediction model incorporating clinical indicators, we conduct real data analysis based on the Wisconsin Breast Cancer dataset. Results demonstrate that the proposed method outperforms standard split conformal prediction in terms of predictive accuracy and uncertainty control for high-risk cases, showing strong potential for practical applications. This work offers a novel statistical tool for uncertainty modeling in medical diagnostics and provides methodological support for precision prevention and control in high-risk populations.
文章引用:王子雯, 彭诗淇, 曹雅琦. 选择性共形预测在乳腺癌高风险人群风险评估中的方法研究[J]. 统计学与应用, 2025, 14(9): 216-227. https://doi.org/10.12677/sa.2025.149270

参考文献

[1] 金璋, 王学钦. 共型预测方法的发展和应用[J]. 中国科学: 数学, 2024, 54(12): 2121-2140.
[2] Jung, S., Park, K. and Kim, B. (2021) Clustering on the Torus by Conformal Prediction. The Annals of Applied Statistics, 15, 1583-1600. [Google Scholar] [CrossRef
[3] Angelopoulos, A.N., Bates, S., Fisch, A., Lei, L. and Schuster, T. (2022) Conformal Risk Control. arXiv, 2208.02814.
[4] Barber, R.F., Candès, E.J., Ramdas, A. and Tibshirani, R.J. (2023) Conformal Prediction Beyond Exchangeability. The Annals of Statistics, 51, 816-845. [Google Scholar] [CrossRef
[5] Hu, X. and Lei, J. (2024) A Two-Sample Conditional Distribution Test Using Conformal Prediction and Weighted Rank Sum. Journal of the American Statistical Association, 119, 1136-1154. [Google Scholar] [CrossRef
[6] Zaffran, M., Dieuleveut, A., Josse, J. and Romano, Y. (2023) Conformal Prediction with Missing Values. arXiv, 2305.19034.
[7] Yang, Y. and Kuchibhotla, A.K. (2024) Selection and Aggregation of Conformal Prediction Sets. Journal of the American Statistical Association, 120, 435-447. [Google Scholar] [CrossRef
[8] Zhu, K., Yang, S. and Wang, X. (2024) Enhancing Statistical Validity and Power in Hybrid Controlled Trials: A Randomization Inference Approach with Conformal Selective Borrowing. arXiv, 2410.11713.
[9] Feng, J., Sondhi, A., Perry, J. and Simon, N. (2021) Selective Prediction-Set Models with Coverage Rate Guarantees. Computing Research Repository, 79, 811-825. [Google Scholar] [CrossRef] [PubMed]
[10] Bao, Y., Huo, Y., Ren, H. and Zou, C. (2024) Selective Conformal Inference with False Coverage-Statement Rate Control. Biometrika, 111, 727-742. [Google Scholar] [CrossRef
[11] Lei, J., G’Sell, M., Rinaldo, A., Tibshirani, R.J. and Wasserman, L. (2018) Distribution-Free Predictive Inference for Regression. Journal of the American Statistical Association, 113, 1094-1111. [Google Scholar] [CrossRef
[12] Wolberg, W.H., Street, W.N. and Mangasarian, O.L. (1995) Breast Cancer Wisconsin (Diagnostic) Dataset. University of Wisconsin-Madison.
[13] Basu, A., Bose, S. and Purkayastha, S. (2004) Robust Discriminant Analysis Using Weighted Likelihood Estimators. Journal of Statistical Computation and Simulation, 74, 445-460. [Google Scholar] [CrossRef
[14] Barrett, J.E. (2017) Information‐Adaptive Clinical Trials with Selective Recruitment and Binary Outcomes. Statistics in Medicine, 36, 2803-2813. [Google Scholar] [CrossRef] [PubMed]
[15] 夏露竹, 曾文君, 曹雅琦. 共形预测框架下的预测区间估计方法及乳腺癌应用研究[J]. 应用统计与数据科学, 2025, 1(3): 90-95.