基于序列关联的蛋白质亚细胞定位识别
Recognition of Subcellular Localization of Proteins Using of Sequences Fusion
摘要: 对未知蛋白的功能注释是蛋白质组学的主要目标。一个关键的注释是蛋白质亚细胞定位的预测。应用基于序列关联的二次判别分析方法进行蛋白质亚细胞定位预测,对4个植物定位类型进行5-fold交叉检验。
Abstract:
Functional annotation of unknown proteins is a major goal in proteomics. A key annotation is the prediction of a protein’s subcellular localization. We used the method of Increment of Diversity with Quad-ratic Discriminant analysis (IDQD) to predict subcellular localization of proteins which are recognized by the four plant categories and three non-plant and obtained accuracy 87.4(±0.5)% and 91.2(±0.2)%, respectively in 5-fold cross-validation test. Our result is better than comparable existing methods.
参考文献
|
[1]
|
K. Nakai. Protein sorting signals and prediction of subcellular localization. Advances in Protein Chemistry, 2000, 54: 277-344.
|
|
[2]
|
K. C. Chou, H. B. Shen. Review: Recent progresses in protein subcellular location prediction. Analytical Biochemistry, 2007, 370(1): 1-16.
|
|
[3]
|
贾芸, 赵巨东, 吕军. 基于N端信号的蛋白质亚细胞定位识别[J]. 内蒙古工业大学学报(自然门科学版), 2008, 27(2): 81-87.
|
|
[4]
|
O. Emanuelsson, H. Niclsen, S. Brunak, et al. Predicting subcellular localization of proteins using amino acid terminal amino acid sequence. Journal of Molecular Biology, 2000, 300(4): 1005-1016.
|
|
[5]
|
罗辽复. 生命进化的物理观[M]. 上海: 上海科学技术出版社, 2000: 169-189.
|
|
[6]
|
R. R. Laxton. The measure of diversity. Journal of theoretical biology, 1978, 70(1): 51-67.
|
|
[7]
|
徐克学. 生物数学[M]. 北京: 科学出版社, 1999: 278-286.
|
|
[8]
|
L. R. Zhang, L. F. Luo. Splice site prediction with quadratic discriminant analysis using diversity measure. Nucleic Acids Research, 2003, 31(21): 6214-6220.
|
|
[9]
|
吕军, 罗辽复. 人类PolII启动子的识别[J]. 生物化学与生物物理进展, 2005, 32: 1185-1191.
|
|
[10]
|
边肇祺, 张学工等. 模式识别[M]. 北京: 清华大学出版社, 2004: 9-42.
|