A comprehensive classification and evolutionary analysis of plant homeobox genes

Mol Biol Evol. 2009 Dec;26(12):2775-94. doi: 10.1093/molbev/msp201. Epub 2009 Sep 4.

Abstract

The full complement of homeobox transcription factor sequences, including genes and pseudogenes, was determined from the analysis of 10 complete genomes from flowering plants, moss, Selaginella, unicellular green algae, and red algae. Our exhaustive genome-wide searches resulted in the discovery in each class of a greater number of homeobox genes than previously reported. All homeobox genes can be unambiguously classified by sequence evolutionary analysis into 14 distinct classes also characterized by conserved intron-exon structure and by unique codomain architectures. We identified many new genes belonging to previously defined classes (HD-ZIP I to IV, BEL, KNOX, PLINC, WOX). Other newly identified genes allowed us to characterize PHD, DDT, NDX, and LD genes as members of four new evolutionary classes and to define two additional classes, which we named SAWADEE and PINTOX. Our comprehensive analysis allowed us to identify several newly characterized conserved motifs, including novel zinc finger motifs in SAWADEE and DDT. Members of the BEL and KNOX classes were found in Chlorobionta (green plants) and in Rhodophyta. We found representatives of the DDT, WOX, and PINTOX classes only in green plants, including unicellular green algae, moss, and vascular plants. All 14 homeobox gene classes were represented in flowering plants, Selaginella, and moss, suggesting that they had already differentiated in the last common ancestor of moss and vascular plants.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Motifs / genetics
  • Amino Acid Sequence
  • Evolution, Molecular*
  • Genes, Homeobox / genetics*
  • Genes, Plant / genetics*
  • Homeodomain Proteins / chemistry
  • Homeodomain Proteins / classification*
  • Homeodomain Proteins / genetics*
  • Introns / genetics
  • Leucine Zippers / genetics
  • Likelihood Functions
  • Models, Genetic
  • Molecular Sequence Data
  • Phylogeny
  • Plants / genetics*
  • Protein Structure, Tertiary
  • Sequence Alignment
  • Zinc Fingers / genetics

Substances

  • Homeodomain Proteins