Identification of genes within CpG-enriched DNA from human chromosome 4p16.3

Academic Article


  • We combined the isolation of gene-enriched genomic DNA with gene prediction by computer to search for genes in a cosmid contig covering one million base pairs in the Huntington disease region on chromosome 4. Our aim was to develop a simple, robust strategy to identify genes adjacent to CpG Islands without first characterizing undermethylated regions with multiple rare-cutter restriction enzyme sites. We cloned DNA adjacent to the rare-cutter restriction enzyme sites Eagl and Sacll, which are predicted to cut more frequently within CpG Islands and relied solely on minimal sequence analysis to determine the likely coding potential of the DNA next to these sites. Our results indicated that isolating fragments with a single rare-cutter restriction enzyme site was sufficient to provide a high likelihood of identifying genes. Of the 42 CpG-selected clones analyzed, we determined that 17 contained exons as determined by sequence identity to known genes in this region, sequence identity to gene fragments isolated by direct cDNA selection in our laboratory, and/or their ability to detect transcripts on Northern blots. Analysis of the sequences with the BLAST and GRAIL programs provided additional independent evidence that 15 of these 17 clones contain coding sequences and that nine other clones are likely to contain sequences coding for portions of new genes. By mapping these clones to an EcoRI restriction map of the region, we determined a detailed localization for each of the exons and estimate that there are a minimum of seven genes that contain CpG-rich DNA between D4S126 and D4S181. © 1994 Oxford University Press.
  • Published In

    Digital Object Identifier (doi)

    Author List

  • John RM; Robbins CA; Myers RM
  • Start Page

  • 1611
  • End Page

  • 1616
  • Volume

  • 3
  • Issue

  • 9