Protein Coding Sequence Identification by Simultaneously Characterizing the Periodic and Random Features of DNA Sequences
[摘要] Most codon indices used today are based on highly biased nonrandom usage of codons in coding regions. The background of a coding or noncoding DNA sequence, however, is fairly random, and can be characterized as a random fractal. When a gene-finding algorithm incorporates multiple sources of informationabout coding regions, it becomes more successful. It is thus highly desirable to develop new and efficient codon indices by simultaneously characterizing the fractal and periodic features of a DNA sequence. In this paper, we describe a novel way of achieving this goal. The efficiency of the new codon index is evaluated by studying all of the 16 yeast chromosomes. In particular, we show that the method automatically and correctly identifies which of the threereading frames is the one that contains a gene.
[发布日期] [发布机构]
[效力级别] [学科分类] 基础医学
[关键词] [时效性]