基于序列关联的蛋白质亚细胞定位识别
Recognition of Subcellular Localization of Proteins Using of Sequences Fusion
DOI: 10.12677/hjcb.2011.11001, PDF, HTML, 下载: 3,517  浏览: 13,380  科研立项经费支持
作者: 贾芸
关键词: 亚细胞定位F值二次判别分析
Subcellular Localization; F-Value; Quadratic Discriminant Analysis
摘要: 对未知蛋白的功能注释是蛋白质组学的主要目标。一个关键的注释是蛋白质亚细胞定位的预测。应用基于序列关联的二次判别分析方法进行蛋白质亚细胞定位预测,对4个植物定位类型进行5-fold交叉检验。
Abstract: Functional annotation of unknown proteins is a major goal in proteomics. A key annotation is the prediction of a protein’s subcellular localization. We used the method of Increment of Diversity with Quad-ratic Discriminant analysis (IDQD) to predict subcellular localization of proteins which are recognized by the four plant categories and three non-plant and obtained accuracy 87.4(±0.5)% and 91.2(±0.2)%, respectively in 5-fold cross-validation test. Our result is better than comparable existing methods.
文章引用:贾芸. 基于序列关联的蛋白质亚细胞定位识别[J]. 计算生物学, 2011, 1(1): 1-3. http://dx.doi.org/10.12677/hjcb.2011.11001

参考文献

[1] K. Nakai. Protein sorting signals and prediction of subcellular localization. Advances in Protein Chemistry, 2000, 54: 277-344.
[2] K. C. Chou, H. B. Shen. Review: Recent progresses in protein subcellular location prediction. Analytical Biochemistry, 2007, 370(1): 1-16.
[3] 贾芸, 赵巨东, 吕军. 基于N端信号的蛋白质亚细胞定位识别[J]. 内蒙古工业大学学报(自然门科学版), 2008, 27(2): 81-87.
[4] O. Emanuelsson, H. Niclsen, S. Brunak, et al. Predicting subcellular localization of proteins using amino acid terminal amino acid sequence. Journal of Molecular Biology, 2000, 300(4): 1005-1016.
[5] 罗辽复. 生命进化的物理观[M]. 上海: 上海科学技术出版社, 2000: 169-189.
[6] R. R. Laxton. The measure of diversity. Journal of theoretical biology, 1978, 70(1): 51-67.
[7] 徐克学. 生物数学[M]. 北京: 科学出版社, 1999: 278-286.
[8] L. R. Zhang, L. F. Luo. Splice site prediction with quadratic discriminant analysis using diversity measure. Nucleic Acids Research, 2003, 31(21): 6214-6220.
[9] 吕军, 罗辽复. 人类PolII启动子的识别[J]. 生物化学与生物物理进展, 2005, 32: 1185-1191.
[10] 边肇祺, 张学工等. 模式识别[M]. 北京: 清华大学出版社, 2004: 9-42.