Reconstructing large regions of an ancestral mammalian genome in silico

  1. Mathieu Blanchette1,4,5,
  2. Eric D. Green2,
  3. Webb Miller3, and
  4. David Haussler1,5
  1. 1 Howard Hughes Medical Institute, University of California, Santa Cruz, California 95064, USA
  2. 2 National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
  3. 3 Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA

Abstract

It is believed that most modern mammalian lineages arose from a series of rapid speciation events near the Cretaceous-Tertiary boundary. It is shown that such a phylogeny makes the common ancestral genome sequence an ideal target for reconstruction. Simulations suggest that with methods currently available, we can expect to get 98% of the bases correct in reconstructing megabase-scale euchromatic regions of an eutherian ancestral genome from the genomes of ∼20 optimally chosen modern mammals. Using actual genomic sequences from 19 extant mammals, we reconstruct 1.1 Mb of ancient genome sequence around the CFTR locus. Detailed examination suggests the reconstruction is accurate and that it allows us to identify features in modern species, such as remnants of ancient transposon insertions, that were not identified by direct analysis. Tracing the predicted evolutionary history of the bases in the reconstructed region, estimates are made of the amount of DNA turnover due to insertion, deletion, and substitution in the different placental mammalian lineages since the common eutherian ancestor, showing considerable variation between lineages. In coming years, such reconstructions may help in identifying and understanding the genetic features common to eutherian mammals and may shed light on the evolution of human or primate-specific traits.

Footnotes

  • [Supplemental material is available online at www.genome.org and http://genome.ucsc.edu/ancestors.]

  • Article and publication are at http://www.genome.org/cgi/doi/10.1101/gr.2800104.

  • 4 Present address: McGill University, Montreal, Quebec H3A 2B4 Canada.

  • 5 Corresponding authors. E-mail haussler{at}soe.ucsc.edu; fax (831) 459-4829. E-mail blanchem{at}mcb.mcgill.ca; fax (514) 398-3387.

    • Accepted October 5, 2004.
    • Received May 18, 2004.
| Table of Contents

Preprint Server