已收录 268921 条政策
 政策提纲
  • 暂无提纲
Statistical extraction of Drosophila cis-regulatory modules using exhaustive assessment of local word frequency
[摘要]

Background

Transcription regulatory regions in higher eukaryotes are often represented by cis-regulatory modules (CRM) and are responsible for the formation of specific spatial and temporal gene expression patterns. These extended, ~1 KB, regions are found far from coding sequences and cannot be extracted from genome on the basis of their relative position to the coding regions.

Results

To explore the feasibility of CRM extraction from a genome, we generated an original training set, containing annotated sequence data for most of the known developmental CRMs from Drosophila. Based on this set of experimental data, we developed a strategy for statistical extraction of cis-regulatory modules from the genome, using exhaustive analysis of local word frequency (LWF). To assess the performance of our analysis, we measured the correlation between predictions generated by the LWF algorithm and the distribution of conserved non-coding regions in a number of Drosophila developmental genes.

Conclusions

In most of the cases tested, we observed high correlation (up to 0.6–0.8, measured on the entire gene locus) between the two independent techniques. We discuss computational strategies available for extraction of Drosophila CRMs and possible extensions of these methods.

[发布日期] 2003-12-22 [发布机构] 
[效力级别]  [学科分类] 
[关键词]  [时效性] 
   浏览次数:2      统一登录查看全文      激活码登录查看全文