||Human Homo sapiens
||Venter et al, celera genomics, The sequence of the human genome.
Science. 2001 Feb 16 291(5507):1304-51. p.1321 left column 2nd paragraphPubMed ID11181995
||P.1317 right column 2nd paragraph: "[Researchers] have developed a rule-based expert system, called Otto, to identify and characterize genes in the human genome (ref 60)." P.1317 right column 5th paragraph: "Initially, gene boundaries are predicted on the basis of examination of sets of overlapping protein and EST matches generated by a computational pipeline (ref 62). This pipeline searches the scaffold sequences against protein, EST, and genome-sequence databases to define regions of sequence similarity and runs three de novo gene-prediction programs."
||p.1321 left column 2nd paragraph: "[Researchers] estimate the average span for a "typical" gene in the human DNA sequence to be about 27,894 bases. This is based on the average span covered by RefSeq transcripts, used because it represents [their] highest confidence set." For value of 10-15 kbp see BNID 104316. Only 5% of average gene encodes protein, BNID 101970. Note-p.1327 table 11 gives average gene size of 27 kbp