Integration of Cot Analysis, DNA Cloning, and High-Throughput Sequencing Facilitates Genome Characterization and Gene Discovery

  1. Daniel G. Peterson1,4,
  2. Stefan R. Schulze1,3,
  3. Erica B. Sciara1,3,
  4. Scott A. Lee1,
  5. John E. Bowers1,
  6. Alexander Nagel2,
  7. Ning Jiang2,
  8. Deanne C. Tibbitts1,
  9. Susan R. Wessler2, and
  10. Andrew H. Paterson1,2
  1. 1Center for Applied Genetic Technologies and Department of Crop and Soil Sciences, University of Georgia, Athens, Georgia 30602, USA; 2Department of Botany and Department of Genetics, University of Georgia, Athens, Georgia 30602, USA

Abstract

Cot-based sequence discovery represents a powerful means by which both low-copy and repetitive sequences can be selectively and efficiently fractionated, cloned, and characterized. Based upon the results of a Cot analysis, hydroxyapatite chromatography was used to fractionate sorghum (Sorghum bicolor) genomic DNA into highly repetitive (HR), moderately repetitive (MR), and single/low-copy (SL) sequence components that were consequently cloned to produce HRCot, MRCot, and SLCot genomic libraries. Filter hybridization (blotting) and sequence analysis both show that the HRCot library is enriched in sequences traditionally found in high-copy number (e.g., retroelements, rDNA, centromeric repeats), the SLCot library is enriched in low-copy sequences (e.g., genes and “nonrepetitive ESTs”), and the MRCot library contains sequences of moderate redundancy. The Cot analysis suggests that the sorghum genome is approximately 700 Mb (in agreement with previous estimates) and that HR, MR, and SL components comprise 15%, 41%, and 24% of sorghum DNA, respectively. Unlike previously described techniques to sequence the low-copy components of genomes, sequencing of Cot components is independent of expression and methylation patterns that vary widely among DNA elements, developmental stages, and taxa. High-throughput sequencing of Cot clones may be a means of “capturing” the sequence complexity of eukaryotic genomes at unprecedented efficiency.

[Online supplementary material is available at www.genome.org. The sequence data described in this paper have been submitted to the GenBank under accession nos.AZ921847-AZ923007. Reagents, samples, and unpublished information freely provided by H. Ma and J. Messing.]

Footnotes

  • 3  These authors contributed equally to the research. They are listed in alphabetical order.

  • 4  Corresponding author.

  • E-MAIL dgp{at}arches.uga.edu; FAX (706) 583-0160.

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.226102. Article published online before print in April 2002.

    • Received November 28, 2001.
    • Accepted March 6, 2002.
| Table of Contents

Preprint Server