Discovery of functional elements in 12 Drosophila genomes using evolutionary signatures

Nature. 2007 Nov 8;450(7167):219-32. doi: 10.1038/nature06340.

Abstract

Sequencing of multiple related species followed by comparative genomics analysis constitutes a powerful approach for the systematic understanding of any genome. Here, we use the genomes of 12 Drosophila species for the de novo discovery of functional elements in the fly. Each type of functional element shows characteristic patterns of change, or 'evolutionary signatures', dictated by its precise selective constraints. Such signatures enable recognition of new protein-coding genes and exons, spurious and incorrect gene annotations, and numerous unusual gene structures, including abundant stop-codon readthrough. Similarly, we predict non-protein-coding RNA genes and structures, and new microRNA (miRNA) genes. We provide evidence of miRNA processing and functionality from both hairpin arms and both DNA strands. We identify several classes of pre- and post-transcriptional regulatory motifs, and predict individual motif instances with high confidence. We also study how discovery power scales with the divergence and number of species compared, and we provide general guidelines for comparative studies.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Animals
  • Base Sequence
  • Binding Sites
  • Conserved Sequence
  • Drosophila / classification*
  • Drosophila / genetics*
  • Drosophila Proteins / genetics
  • Evolution, Molecular*
  • Exons / genetics
  • Gene Expression Regulation / genetics
  • Genes, Insect / genetics
  • Genome, Insect / genetics*
  • Genomics*
  • MicroRNAs / genetics
  • Molecular Sequence Data
  • Organ Specificity
  • Phylogeny
  • Untranslated Regions / genetics

Substances

  • Drosophila Proteins
  • MicroRNAs
  • Untranslated Regions