基于秃鹰搜索的抗乳腺癌候选药物优化建模
Optimization Modeling of Anti-Breast Cancer Candidate Drugs Based on Bald Eagle Search
摘要: 乳腺癌在全球范围内已取代肺癌成为最常见的癌症,并且其死亡率居高不下。因此,利用机器学习和智能优化算法等技术筛选乳腺癌药物对于推动乳腺癌治疗药物的发展至关重要。本文提出了一种基于改进的随机森林算法构建ERa活性预测模型的方法,并筛选出对生物活性最具影响力的前20个分子描述符。然后,使用该模型对50个化合物的IC50值和对应的pIC50值进行预测。同时,借助支持向量机(SVM)和Adaboost二分类模型,对化合物Caco-2、CYP3A4、hERG、HOB、MN的5种成分进行分别预测,并建立ADMET分类预测模型。最后,利用秃鹰搜索算法构建化合物筛选模型,使用黑鹰搜索算法融合前两个模型,解决各类复杂数值优化问题,以找到可行性药物操作变量范围。实验结果表明,所提出的预测模型具有很高的准确性,可应用于抗乳腺癌药物的研发。
Abstract: Breast cancer has replaced lung cancer as the most common cancer worldwide, and its mortality rate remains high. Therefore, the selection of breast cancer drugs using techniques such as machine learning and intelligent optimization algorithms is of great significance to drive the development of breast cancer treatment drugs. In this paper, we propose a method based on the improved random forest algorithm to construct an ERa activity prediction model and select the top 20 most influential molecular descriptors for biological activity. Subsequently, using this model, we predict the IC50 values and corresponding pIC50 values of 50 compounds. Furthermore, with the aid of support vector machine (SVM) and Adaboost binary classification models, we predict the five components (Caco-2, CYP3A4, hERG, HOB, MN) of the compounds separately and establish an ADMET classifica-tion prediction model. Finally, we construct a compound screening model using the Bald Eagle search algorithm and integrate it with the previous two models using the Black Hawk search algo-rithm to address various complex numerical optimization problems and determine the feasible range of drug operating variables. Experimental results demonstrate that the proposed prediction model exhibits high accuracy and can be applied to the development of anti- breast cancer drugs.
文章引用:龙楷潮, 袁学枫, 张利. 基于秃鹰搜索的抗乳腺癌候选药物优化建模[J]. 建模与仿真, 2023, 12(4): 3930-3942. https://doi.org/10.12677/MOS.2023.124359

参考文献

[1] Wang, B.J., Shen, Y.H., Liu, T.Y. and Li, T. (2021) ERα Promotes Transcription of Tumor Suppressor Gene ApoA-I by Estab-lishing H3K27ac-Enriched Chromatin Microenvironment in Breast Cancer Cells. Journal of Zhejiang University- Science B, 22, 1034-1044. [Google Scholar] [CrossRef
[2] Khamouli, S., et al. (2022) QSAR Modeling, Molecular Docking, ADMET Prediction and Molecular Dynamics Simulations of Some 6-Arylquinazolin-4-Amine Derivatives as DYRK1A Inhib-itors. Journal of Molecular Structure, 1258, Article ID: 132659. [Google Scholar] [CrossRef
[3] 顾耀文, 张博文, 郑思, 杨丰春, 李姣. 基于图注意力网络的药物ADMET分类预测模型构建方法[J]. 数据分析与知识发现, 2021, 5(8): 76-85.
[4] 俞青芬. 人工神经网络在吡喃酮类化合物生物活性预测中的应用[J]. 江汉大学学报(自然科学版), 2017, 45(5): 418-423. [Google Scholar] [CrossRef
[5] 王玉成, 冯志宏, 赵娜娜, 汪鸣明, 叶晓东. 基于RegNet-1d模型和积分梯度法的ERα拮抗剂的生物活性预测方法[P]. 中国专利, CN114121177A. 2022-03-01.
[6] 沈杰. 药物ADMET理论预测方法开发和靶向雌激素受体的药物设计研究[D]: [博士学位论文]. 上海: 华东理工大学, 2011.
[7] 于娜. 常用的特征筛选方法研究[J]. 科技资讯, 2020, 18(36): 231-233.
[8] 李颜平, 吴刚. 基于典型数据集的数据预处理方法对比分析[J]. 沈阳工业大学学报, 2022, 44(2): 185-192.
[9] Hu, L., Gao, L.B., Li, Y.H., Zhang, P. and Gao, W.F. (2022) Feature-Specific Mutual Information Variation for Multi-Label Feature Selection. Information Sciences, 593, 449-471. [Google Scholar] [CrossRef
[10] 王璐, 孙聚波. Lasso回归方法在特征变量选择中的应用[J]. 吉林工程技术师范学院学报, 2021, 37(12): 109-112.
[11] Naila, S., et al. (2020) A Rapid Recognition Meth-od for Rice False Smut Based on HOG Features and SVM Classification. Journal of Physics: Conference Series, 1576, Article ID: 012018. [Google Scholar] [CrossRef
[12] Li, W. and Jiao, G. (2020) Prediction of Poor Students’ Classifi-cation Based on Adaboost Algorithm Integrated Learning Model. Journal of Physics Conference Series, 1574, Article ID: 012172. [Google Scholar] [CrossRef
[13] 贾鹤鸣, 姜子超, 李瑶. 基于改进秃鹰搜索算法的同步优化特征选择[J/OL]. 控制与决策: 1-9. 2021-10-18.[CrossRef
[14] Alsattar, H.A., Zaidan, A.A. and Zaidan, B.B. (2020) Novel Meta-Heuristic Bald Eagle Search Optimisation Algorithm. Artificial Intelligence Review, 53, 2237-2264. [Google Scholar] [CrossRef
[15] Shar, P.A., Tao, W.Y., Gao, S., et al. (2016) Pred-Binding: Large-Scale Protein-Ligand Binding Affinity Prediction. Journal of Enzyme Inhibition and Medicinal Chemistry, 31, 1443-1450. [Google Scholar] [CrossRef] [PubMed]