||Blattner FR, Plunkett G 3rd, Bloch CA, Perna NT, Burland V, Riley M, Collado-Vides J, Glasner JD, Rode CK, Mayhew GF, Gregor J, Davis NW, Kirkpatrick HA, Goeden MA, Rose DJ, Mau B, Shao Y. The complete genome sequence of Escherichia coli K-12. Science. 1997 Sep 5 277(5331):1453-62. p.1458 right column 2nd paragraphPubMed ID9278503
||Abstract: "The 4,639,221-base pair sequence of Escherichia coli K-12 is presented." P.1458 right column 2nd paragraph: "Of the 4288 ORFs annotated in the sequence, 1853 are previously described genes. (A complete listing of E. coli ORFs is available at link and is likely to change as functional data accumulate.) The distribution of start codons is as follows: ATG, 3542 GTG, 612 and TTG, 130. There is also one ATT and possibly a CTG (ref 44)." ATG fraction: 3542/4288=82.6%. GTG fraction: 612/4288=14.3%. TTG fraction: 130/4288=3%.