Genome size of different strains and pan-genome size (total number of genes)

Range genome size 4.5Mbp - >5.5Mbp: number of genes >15,000 unique proteins
Organism Bacteria Escherichia coli
Reference Monk JM et al., Multi-omics Quantification of Species Variation of Escherichia coli Links Molecular Features with Strain Phenotypes. Cell Syst. 2016 Sep 28 3(3):238-251.e12. doi: 10.1016/j.cels.2016.08.013. p.238 left column bottom paragraph & right column top paragraphPubMed ID27667363
Primary Source Lukjancenko O, Wassenaar TM, Ussery DW. Comparison of 61 sequenced Escherichia coli genomes. Microb Ecol. 2010 Nov60(4):708-20. doi: 10.1007/s00248-010-9717-3. AND Gordienko EN, Kazanov MD, Gelfand MS. Evolution of pan-genomes of Escherichia coli, Shigella spp., and Salmonella enterica. J Bacteriol. 2013 Jun195(12):2786-92. doi: 10.1128/JB.02285-12.PubMed ID20623278, 23585535
Comments P.238 left column bottom paragraph & right column top paragraph: "Escherichia coli are widely used as a model prokaryote for physiology studies. Some strains are important pathogens and others are key host strains for metabolic engineering and synthetic biology. This diversity in lifestyle and application reflects the high level of genetic diversity within the species. Thanks to the genomics revolution in microbiology that has enabled sequencing of diverse strains for any species, it is now known that the genomes of different strains of E. coli range in size from 4.5 to over 5.5 Mbp, and the species has a pan-genome composed of more than 15,000 unique proteins (primary sources). Part of this large pan-genome consists of unique metabolic capabilities that have been shown to have important implications for infectious disease studies and pathogenic niches (Monk et al., 2013, Baumler et al., 2011 and Vieira et al., 2011). This metabolic diversity is likely to be equally impactful on synthetic biology applications (Lee and Kim, 2015). The massive genomic diversity of the E. coli species provides a deep pool of strains to use for basic research and for metabolic engineering and synthetic biology applications. It also raises an important question: what range of phenotypic behaviors exist and how can these be leveraged to further exploit E. coli as a model organism and host strain?"
Entered by Uri M
ID 113214