Genome-wide analysis of transcription factor binding sites based on ChIP-Seq data

Academic Article

Abstract

  • Molecular interactions between protein complexes and DNA mediate essential gene-regulatory functions. Uncovering such interactions by chromatin immunoprecipitation coupled with massively parallel sequencing (ChIP-Seq) has recently become the focus of intense interest. We here introduce quantitative enrichment of sequence tags (QuEST), a powerful statistical framework based on the kernel density estimation approach, which uses ChIP-Seq data to determine positions where protein complexes contact DNA. Using QuEST, we discovered several thousand binding sites for the human transcription factors SRF, GABP and NRSF at an average resolution of about 20 base pairs. MEME motif-discovery tool-based analyses of the QuEST-identified sequences revealed DNA binding by cofactors of SRF, providing evidence that cofactor binding specificity can be obtained from ChIP-Seq data. By combining QuEST analyses with Gene Ontology (GO) annotations and expression data, we illustrate how general functions of transcription factors can be inferred.
  • Published In

  • PLoS Medicine  Journal
  • Digital Object Identifier (doi)

    Pubmed Id

  • 23625490
  • Author List

  • Valouev A; Johnson DS; Sundquist A; Medina C; Anton E; Batzoglou S; Myers RM; Sidow A
  • Start Page

  • 829
  • End Page

  • 834
  • Volume

  • 5
  • Issue

  • 9