基于归一化欧氏距离和谱分邻的融合网络对癌症亚型聚类
Clustering of Cancer Subtypes Based on Similarity Network Fusion of Normalized Euclidean Distance and Spectral Neighbor
DOI: 10.12677/PM.2019.910137, PDF,    国家自然科学基金支持
作者: 杨 宽:中国海洋大学数学科学学院,山东 青岛;赵亚萍:青岛外事服务职业学校,山东 青岛
关键词: 亚型网络融合归一化欧氏距离邻居谱聚类相似网络Subtype Network Fusion Normalization Euclidean Distance Neighbor Spectral Clustering Similarity Network
摘要: 相似网络融合(SNF)是一种确定癌症亚型的有效聚类方法,可以将病人不同数据类型的相似网络融合成一个相似网络,融合后的相似网络包含病人的所有信息。本文使用归一化的欧氏距离和重新定义的病人的邻居,提出了基于归一化欧氏距离和谱聚类分邻的相似网络融合(NSSNF),减少了相似网络中数据分析时产生的噪声,并增加了不同相似网络的数据间的互补性,最后,针对数据库TCGA中的五种癌症数据,利用NSSNF方法进行数据分析,评价指标DB和CH的结果表明NSSNF方法优于传统SNF方法、NSNF方法和CSNF方法。
Abstract: Similarity network fusion (SNF) is an effective clustering method to identify cancer subtypes. By us-ing SNF, patient similarity networks for each of their data types are integrated into a single similari-ty network, which contains all the information of patients. In this paper, a similarity network fusion based on normalized Euclidean distance and spectral clustering (NSSNF) is proposed by using nor-malized Euclidean distance and redefined neighbor of the patient to reduce the noise of data analy-sis in similarity networks and increase the complementarity between the data from different simi-larity networks. Finally, for the five cancer data types from the TCGA database, the data analysis was performed by the NSSNF method, and the results of the evaluation indexes DB and CH showed that the NSSNF method is superior to the SNF method, NSNF method and CSNF method.
文章引用:杨宽, 赵亚萍. 基于归一化欧氏距离和谱分邻的融合网络对癌症亚型聚类[J]. 理论数学, 2019, 9(10): 1115-1122. https://doi.org/10.12677/PM.2019.910137

参考文献

[1] Wu, D., Wang, D.C., Cheng, Y., et al. (2016) Roles of Tumor Heterogeneity in the Development of Drug Resistance: A Call for Precision Therapy. Seminars in Cancer Biology, 42, 13-19. [Google Scholar] [CrossRef] [PubMed]
[2] 周彩存, 刘桑田. 肺癌的靶向治疗与精准医学[J]. 医学研究生学报, 2017, 30(11): 1132-1139.
[3] Wang, B., Mezlini, A., Demir, F., et al. (2014) Similarity Network Fusion for Aggregating Data Types on a Genomic Scale. Nature Methods, 11, 333-337. [Google Scholar] [CrossRef] [PubMed]
[4] Jiang, X.P. and Hu, X.H. (2014) Inferring Microbial Interaction Net-works Based on Consensus Similarity Network Fusion. College of Computing and Informatics, 57, 1115-1120. [Google Scholar] [CrossRef] [PubMed]
[5] Zhang, Y., Hua, X. and Jiang, X.P. (2017) Multi-View Cluster-ing of Microbiome Samples by Robust Similarity Network Fusion and Spectral Clustering. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 14, 264-271. [Google Scholar] [CrossRef
[6] Zhao, Y.P., Zhao, X.Q., Wang, K.D., et al. (2017) Clustering Cancer Subtypes Based on Neighborhood-Information Embedded Similarity Network Fusion. Periodical of Ocean University of China, A1, 155-160.
[7] 张月, 邹焕新, 邵宁远, 等. 基于相似度网络融合的极化SAR图像地物分类[J]. 系统工程与电子技术, 2018, 40(2): 295-302.
[8] Chen, N. (2019) CI-SNF: Exploiting Contextual Information to Improve SNF Based Information Retrieval. Information Fusion, 52, 175-186. [Google Scholar] [CrossRef
[9] Qin, D.F., Wengert, C., van Gool, L., et al. (2013) Query Adaptive Similarity for Scale Object Retrieval. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Portland, 23-28 June 2013, 1610-1617.
[10] Von Luxburg, U. (2007) A Tutorial on Spectral Clustering. Statistics and Computing, 17, 395-416. [Google Scholar] [CrossRef
[11] Nigro, J.M., et al. (2005) Integrated Array-Comparative Genomic Hybridization and Expression Array Profiles Identify Clinically Relevant Molecular Subtypes of Glioblastoma. Cancer Research, 65, 1678-1686. [Google Scholar] [CrossRef
[12] Davies, D.L. and Bouldin, D.W. (1979) A Cluster Separation Measure. IEEE Transactions on Pattern Analysis Machine Intelligence, 1, 224. [Google Scholar] [CrossRef
[13] Caliski, T. and Harabasz, J. (1974) A Dendrite Method for Cluster Analysis. Communications in Statistics, 3, 1-27. [Google Scholar] [CrossRef