Fraction of proteins encoded by bacterial and archaeal genomes that can be placed into discrete Clusters of Orthologous Groups of proteins (COGs)

Range 55 % - 83 %
Organism prokaryote
Reference Peterson SN, Fraser CM. The complexity of simplicity. Genome Biol. 2001 2(2):COMMENT2002 p.7 left column top paragraphPubMed ID11182883
Primary Source [19] Koonin EV. How many genes can make a cell - the minimal gene set concept. Annu Rev Genomics Hum Genet. 2000 1 :99-116 DOI: 10.1146/annurev.genom.1.1.99PubMed ID11701626
Comments P.6 right column bottom paragraph: "The added sensitivity derived from the COGs analysis arises because COGs are not dependent on BLAST cutoff scores and arbitrary sequence-relatedness criteria: genes with relatively unimpressive sequence identity can be placed into an orthology group. Some interesting facts arose from this analysis [primary source]. First, 55-83% of proteins encoded by bacterial and archaeal genomes can be placed into discrete COGs, suggesting that many genes present in bacteria and archaea are highly conserved." Primary source p.103 bottom paragraph: "The current collection of COGs shows two striking and, in a sense, opposing trends that are relevant for the discussion of the minimal-gene-set concept (35: link ). First, it is notable that 55%–83% of the proteins encoded in each of the bacterial and archaeal genomes belong to the COGs, which, it should be emphasized, by definition include representatives of at least three phylogenetically distant clades (Table 1)."
Entered by Uri M
ID 116960