Original estimate of genome size

Value 1.25e+8 bp
Organism Thale cress Arabidopsis thaliana
Reference Arabidopsis Genome Initiative. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000 Dec 14 408(6814):796-815. p.797 left column, top paragraphPubMed ID11130711
Method P.796 right column 2nd paragraph: "[Researchers] used large-insert bacterial artificial chromosome (BAC), phage (P1) and transformation-competent artificial chromosome (TAC) libraries (refs 9-12) as the primary substrates for sequencing. Early stages of genome sequencing used 79 cosmid clones. Physical maps of the genome of accession Columbia were assembled by restriction fragment `fingerprint' analysis of BAC clones (ref 13), by hybridization (ref 14) or polymerase chain reaction (PCR) (ref 15) of sequence-tagged sites and by hybridization and Southern blotting (ref 16)."
Comments P.797 left column, top paragraph: "Estimates of the unsequenced centromeric and rDNA repeat regions measure roughly 10 megabases (Mb), yielding a genome size of about 125Mb, in the range of the 50-150Mb haploid content estimated by different methods (ref 18-Meyerowitz, E. M. & Somerville, C. R. Arabidopsis (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1994)). In general, features such as gene density, expression levels and repeat distribution are very consistent across the five chromosomes (Fig. 1), and these are described in detail in reports on individual chromosomes (refs 4-8) and in the analysis of centromere, telomere and rDNA sequences." See BNID 100400 for estimate of 157Mb according to flow cytometry.
