Number of genes

Please note: The entry will be shown to all once approved by the database administrator.
Value 23300 unitless
Organism Sea urchin Strongylocentrotus purpuratus
Reference Sea Urchin Genome Sequencing Consortium and researchers, The genome of the sea urchin Strongylocentrotus purpuratus. Science. 2006 Nov 10 314(5801):941-52. p. 942 right column 2nd paragraphPubMed ID17095691
Method To estimate the number of genes in the S. purpuratus genome, researchers began with the 28,900 gene models in the official gene set (OGS) and reduced this by the 5% redundancy found by mapping to the v 2.0 assembly, then increased it by a few percent for the new genes observed in the Ensembl set from the v 2.0 assembly compared with v 0.5. From manual analysis of well-characterized gene sets (e.g., ciliary, cell cycle control, and RNA metabolism genes), they estimated that, in addition to redundancy, another 25% of the genes in the OGS were fragments, pseudogenes, or otherwise not valid. Finally, whole-genome tiling microarray analysis showed 10% of the transcriptionally active regions (long open reading frames, not small RNAs) were not represented by genes in the OGS. Taken together, this analysis gave an estimate of about 23,300 genes for S. purpuratus. Information on all annotated genes can be found at (http://sugp.caltech.edu/resources/annotation.php).
Comments The overall trends in gene structure were similar to those seen in the human genome. The statistics of the Ensembl predictions from the whole-genome shotgun (WGS) assembly revealed an average of 8.3 exons and 7.3 introns per transcript (see SOM). The average gene length was 7.7 kb with an average primary transcript length of 8.9 kb. A broad distribution of all exon lengths peaked at around 100 to 115 nucleotides, whereas that for introns at around 750 nucleotides. The smaller average intron size relative to humans' was consistent with the trend that intron size is correlated with genome size. See BNID 105517
Entered by Uri M
ID 105518