Large-scale identification of novel transcripts in the human genome

Brock A. Peters, Brad St. Croix, Tobias Sjöblom, Jordan M. Cummins, Natalie Silliman, Janine Ptak, Saurabh Saha, Kenneth W Kinzler, Christos Hatzis, Victor E Velculescu

Research output: Contribution to journalArticle

Abstract

Although the sequencing of the human genome has been completed, the number and identity of genes contained within it remains to be fully determined. We used LongSAGE to analyze 660,357 human transcripts from human brain mRNA and identified expression of 17,409 known genes and >15,000 different transcripts that were not annotated in genome databases. Analysis of a subset of these unannotated transcripts suggests that 85% were differentially expressed in various tissue types and that fewer than 20% would have been detected by ab initio gene predictions. These studies suggest that the human genome contains on the order of twice as many transcribed regions as are currently annotated and that experimental approaches will be required to fully elucidate the novel genes corresponding to these transcripts.

Original languageEnglish (US)
Pages (from-to)287-292
Number of pages6
JournalGenome Research
Volume17
Issue number3
DOIs
StatePublished - Mar 2007

Fingerprint

Human Genome
Genes
Genome
Databases
Messenger RNA
Brain

ASJC Scopus subject areas

  • Genetics

Cite this

Peters, B. A., St. Croix, B., Sjöblom, T., Cummins, J. M., Silliman, N., Ptak, J., ... Velculescu, V. E. (2007). Large-scale identification of novel transcripts in the human genome. Genome Research, 17(3), 287-292. https://doi.org/10.1101/gr.5486607

Large-scale identification of novel transcripts in the human genome. / Peters, Brock A.; St. Croix, Brad; Sjöblom, Tobias; Cummins, Jordan M.; Silliman, Natalie; Ptak, Janine; Saha, Saurabh; Kinzler, Kenneth W; Hatzis, Christos; Velculescu, Victor E.

In: Genome Research, Vol. 17, No. 3, 03.2007, p. 287-292.

Research output: Contribution to journalArticle

Peters, BA, St. Croix, B, Sjöblom, T, Cummins, JM, Silliman, N, Ptak, J, Saha, S, Kinzler, KW, Hatzis, C & Velculescu, VE 2007, 'Large-scale identification of novel transcripts in the human genome', Genome Research, vol. 17, no. 3, pp. 287-292. https://doi.org/10.1101/gr.5486607
Peters BA, St. Croix B, Sjöblom T, Cummins JM, Silliman N, Ptak J et al. Large-scale identification of novel transcripts in the human genome. Genome Research. 2007 Mar;17(3):287-292. https://doi.org/10.1101/gr.5486607
Peters, Brock A. ; St. Croix, Brad ; Sjöblom, Tobias ; Cummins, Jordan M. ; Silliman, Natalie ; Ptak, Janine ; Saha, Saurabh ; Kinzler, Kenneth W ; Hatzis, Christos ; Velculescu, Victor E. / Large-scale identification of novel transcripts in the human genome. In: Genome Research. 2007 ; Vol. 17, No. 3. pp. 287-292.
@article{5c13ddcd8a94456e9277111f2595f1cb,
title = "Large-scale identification of novel transcripts in the human genome",
abstract = "Although the sequencing of the human genome has been completed, the number and identity of genes contained within it remains to be fully determined. We used LongSAGE to analyze 660,357 human transcripts from human brain mRNA and identified expression of 17,409 known genes and >15,000 different transcripts that were not annotated in genome databases. Analysis of a subset of these unannotated transcripts suggests that 85{\%} were differentially expressed in various tissue types and that fewer than 20{\%} would have been detected by ab initio gene predictions. These studies suggest that the human genome contains on the order of twice as many transcribed regions as are currently annotated and that experimental approaches will be required to fully elucidate the novel genes corresponding to these transcripts.",
author = "Peters, {Brock A.} and {St. Croix}, Brad and Tobias Sj{\"o}blom and Cummins, {Jordan M.} and Natalie Silliman and Janine Ptak and Saurabh Saha and Kinzler, {Kenneth W} and Christos Hatzis and Velculescu, {Victor E}",
year = "2007",
month = "3",
doi = "10.1101/gr.5486607",
language = "English (US)",
volume = "17",
pages = "287--292",
journal = "Genome Research",
issn = "1088-9051",
publisher = "Cold Spring Harbor Laboratory Press",
number = "3",

}

TY - JOUR

T1 - Large-scale identification of novel transcripts in the human genome

AU - Peters, Brock A.

AU - St. Croix, Brad

AU - Sjöblom, Tobias

AU - Cummins, Jordan M.

AU - Silliman, Natalie

AU - Ptak, Janine

AU - Saha, Saurabh

AU - Kinzler, Kenneth W

AU - Hatzis, Christos

AU - Velculescu, Victor E

PY - 2007/3

Y1 - 2007/3

N2 - Although the sequencing of the human genome has been completed, the number and identity of genes contained within it remains to be fully determined. We used LongSAGE to analyze 660,357 human transcripts from human brain mRNA and identified expression of 17,409 known genes and >15,000 different transcripts that were not annotated in genome databases. Analysis of a subset of these unannotated transcripts suggests that 85% were differentially expressed in various tissue types and that fewer than 20% would have been detected by ab initio gene predictions. These studies suggest that the human genome contains on the order of twice as many transcribed regions as are currently annotated and that experimental approaches will be required to fully elucidate the novel genes corresponding to these transcripts.

AB - Although the sequencing of the human genome has been completed, the number and identity of genes contained within it remains to be fully determined. We used LongSAGE to analyze 660,357 human transcripts from human brain mRNA and identified expression of 17,409 known genes and >15,000 different transcripts that were not annotated in genome databases. Analysis of a subset of these unannotated transcripts suggests that 85% were differentially expressed in various tissue types and that fewer than 20% would have been detected by ab initio gene predictions. These studies suggest that the human genome contains on the order of twice as many transcribed regions as are currently annotated and that experimental approaches will be required to fully elucidate the novel genes corresponding to these transcripts.

UR - http://www.scopus.com/inward/record.url?scp=33847693138&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33847693138&partnerID=8YFLogxK

U2 - 10.1101/gr.5486607

DO - 10.1101/gr.5486607

M3 - Article

C2 - 17267814

AN - SCOPUS:33847693138

VL - 17

SP - 287

EP - 292

JO - Genome Research

JF - Genome Research

SN - 1088-9051

IS - 3

ER -