已收录 273081 条政策
 政策提纲
  • 暂无提纲
Simultaneous computational discovery of DNA regulatory motifs and transcription factor binding constraints at high spatial resolution
[摘要] I present three novel computational methods to address the challenge of identifying protein-DNA interactions at high spatial resolution from noisy ChIP-Seq data. I first present the genome positioning system (GPS) algorithm which predicts protein-DNA interaction events from ChIP-Seq data using a single-base resolution generative probabilistic model. Using synthetic and actual ChIP-Seq data, I show that GPS improves the effective spatial resolution and accuracy in resolving proximal binding events when comparing with existing methods. Second, I present the k-mer set motif (KSM) representation and the k-mer motif alignment and clustering (KMAC) method which discovers DNA-binding motifs from ChIP-Seq derived sequences. I demonstrate that the KSM model is more predictive than the widely used position weight matrix model, and that KMAC outperforms other existing motif discovery programs in recovering known motifs from a large collection of human ChIP-Seq experiments. Finally, I present an integrative method, genome wide event finding and motif discovery (GEM), which models ChIP data with explanatory motifs and binding events at high spatial resolution. The GEM model links binding event discovery and motif discovery with positional priors in the context of a generative probabilistic model of ChIP data and genome sequence. I show that GEM further improve upon previous methods for processing ChIP-Seq and ChIP-exo data to yield unsurpassed spatial resolution and discovery of proximal binding events. GEM enables a systematic analysis of in vivo transcription factor binding to discover hundreds of spatial binding constraints between factors in human and mouse cells, including known factor pairs and novel pairs such as c-Fos:c-Jun/USF1, CTCF/Egr1, and HNF4a/FOXA1. I also discovered a complex spatial binding relationship involved 6 key regulatory factors in mouse embryonic stem (ES) cell that is likely to be functional in ES cell gene regulation. Such computational discoveries propose testable models for regulatory factor interactions that will help elucidate genome function and the implementation of combinatorial control.
[发布日期]  [发布机构] Massachusetts Institute of Technology
[效力级别]  [学科分类] 
[关键词]  [时效性] 
   浏览次数:3      统一登录查看全文      激活码登录查看全文