Annotating non-coding regions of the genome

Roger P Alexander; Gang Fang; Joel Rozowsky; Michael Snyder; Mark B Gerstein

doi:10.1038/nrg2814

Annotating non-coding regions of the genome

Nat Rev Genet. 2010 Aug;11(8):559-71. doi: 10.1038/nrg2814. Epub 2010 Jul 13.

Authors

Roger P Alexander¹, Gang Fang, Joel Rozowsky, Michael Snyder, Mark B Gerstein

Affiliation

¹ Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut 06520, USA.

PMID: 20628352
DOI: 10.1038/nrg2814

Abstract

Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.

Publication types

Comparative Study
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Review

MeSH terms

Animals
Chromosome Mapping
Conserved Sequence
DNA Transposable Elements
DNA, Intergenic / genetics*
Genome, Human*
Genomics / methods*
Genomics / trends
Humans
Pseudogenes
Regulatory Elements, Transcriptional
Sequence Alignment
Sequence Analysis, DNA
Tandem Repeat Sequences

Substances

DNA Transposable Elements
DNA, Intergenic