标题:
编码和非编码DNA序列的可视化分析The Visual Analysis of Coding and Non-Coding DNA Sequences
作者:
刘玉倩, 郑智捷
关键字:
非编码序列, 图形表示方法, 概率测量Non-Coding Sequences, Graphic Representation Technique, Probability Measurements
期刊名称:
《Hans Journal of Computational Biology》, Vol.4 No.2, 2014-06-12
摘要:
DNA序列作为一种复杂的遗传信息,其具体特性不仅体现在编码序列之中,也包含在非编码序列之中。在高等生物体中主要基因成分为非编码序列,在ENCODE计划中,有证据表明,在人类基因中有98%为非编码形式,其中80%具有功能性,所以对编码区和非编码区的研究已经成为一类重要研究热点。本文提供的模型和实验结果,使用图形表示方法对编码区以及非编码区基因的差异进行区分。该模型采用的是对编码区以及非编码区的DNA序列进行分段概率测量,从而对不同的基因特征分布进行比较。
DNA sequences include complex genetic information; their specific characteristics are contained in both the coding and non-coding sequences. Major gene components in higher levels of organisms are composed of non-coding sequences. In ENCODE project, there are evidences that 98% of the human genomes are non-coding forms and 80% of them with functions, so the research on coding region and non-coding region has become an important research hotspot. This paper provides models and experiment results which using visual representation techniques to distinguish differences between coding and non-coding sequences. This model uses probability measurements on the DNA sequences to coding and non-coding regions respectively to distinguish patterns identified from different sequences.