Number of genes

Please note: The entry will be shown to all once approved by the database administrator.
Value 3767 unitless
Organism Bacteria Caulobacter crescentus
Reference Nierman et al., Complete genome sequence of Caulobacter crescentus. Proc Natl Acad Sci U S A. 2001 Mar 27 98(7):4136-41 free full text article table 1 (p. 4138 in pdf) Table - link PubMed ID11259647
Method The genome sequence of C. crescentus CB15 was determined by the whole genome random sequencing method (Heidelberg et al., 2000 PMID 10952301). ORFs were identified by using GLIMMER (6). Annotation of the identified ORFs was accomplished by manual curation of the outputs of a variety of similarity searches. Searches of the predicted coding regions were performed with BLASTP, as previously described (7). The protein– protein matches are aligned with blastoextendorepraze, a modified Smith–Waterman (8) algorithm that maximally extends regions of similarity across frameshifts. Gene identification is facilitated by searching against a database of nonredundant bacterial proteins (nraa) developed at TIGR and curated from the public archives GenBank, Genpept, PIR, and SwissProt. Searches matching entries in nraa have the corresponding role, gene common name, percent identity and similarity of match, pairwise sequence alignment, and taxonomy associated with the match assigned to the predicted coding region and stored in the database. (Numbers in brackets point to refs in article).
Comments A total of 3,767 predicted ORFs were identified, of which 2,030 (53.9%) are assigned putative functions, 725 (19.2%) have matches to hypothetical proteins, and 1,012 (26.9%) have no database match (Fig. 4, published as supplemental data on the PNAS web site, www.pnas.org).
Entered by Uri M
ID 105498