The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags

The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium; The Human Cancer Genome Project Sequencing Consortium

doi:10.1073/pnas.1233632100

The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags

The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium, The Human Cancer Genome Project Sequencing Consortium

Research output: Contribution to journal › Article › peer-review

93 Scopus citations

Abstract

Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define ≈23,500 genes, of which only ≈1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that < 1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.

Original language	English (US)
Pages (from-to)	13418-13423
Number of pages	6
Journal	Proceedings of the National Academy of Sciences of the United States of America
Volume	100
Issue number	23
DOIs	https://doi.org/10.1073/pnas.1233632100
State	Published - Nov 11 2003
Externally published	Yes

ASJC Scopus subject areas

General

Access to Document

10.1073/pnas.1233632100

Cite this

The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium, & The Human Cancer Genome Project Sequencing Consortium (2003). The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags. Proceedings of the National Academy of Sciences of the United States of America, 100(23), 13418-13423. https://doi.org/10.1073/pnas.1233632100

The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags. / The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium; The Human Cancer Genome Project Sequencing Consortium.
In: Proceedings of the National Academy of Sciences of the United States of America, Vol. 100, No. 23, 11.11.2003, p. 13418-13423.

Research output: Contribution to journal › Article › peer-review

The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium & The Human Cancer Genome Project Sequencing Consortium 2003, 'The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags', Proceedings of the National Academy of Sciences of the United States of America, vol. 100, no. 23, pp. 13418-13423. https://doi.org/10.1073/pnas.1233632100

The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium, The Human Cancer Genome Project Sequencing Consortium. The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags. Proceedings of the National Academy of Sciences of the United States of America. 2003 Nov 11;100(23):13418-13423. doi: 10.1073/pnas.1233632100

The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium ; The Human Cancer Genome Project Sequencing Consortium. / The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags. In: Proceedings of the National Academy of Sciences of the United States of America. 2003 ; Vol. 100, No. 23. pp. 13418-13423.

@article{00128797c67f4541b6194462b88d1cc7,

title = "The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags",

abstract = "Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define ≈23,500 genes, of which only ≈1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that < 1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.",

author = "{The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium} and {The Human Cancer Genome Project Sequencing Consortium} and Helena Brentani and Caballero, {Ot{\'a}via L.} and Camargo, {Anamaria A.} and {Da Silva}, {Aline M.} and {Da Silva}, {Wilson Ara{\'u}jo} and Neto, {Emmanuel Dias} and Marco Grivet and Arthur Gruber and Guimaraes, {Pedro Edson Moreira} and Winston Hide and Christian Iseli and Jongeneel, {C. Victor} and Janet Kelso and Nagai, {Maria Aparecida} and Ojopi, {Elida Paula Benquique} and Osorio, {Elisson C.} and Reis, {Eduardo M.R.} and Riggins, {Gregory J.} and Simpson, {Andrew John George} and {De Souza}, Sandro and Stevenson, {Brian J.} and Strausberg, {Robert L.} and Tajara, {Eloiza H.} and Sergio Verjovski-Almeida and Acencio, {Marcio Luis} and Bengtson, {Mario Henrique} and Fabiana Bettoni and Bodmer, {Walter F.} and Briones, {Marcelo R.S.} and Camargo, {Luiz Paulo} and Webster Cavenee and Cerutti, {Janete M.} and Andrade, {Luıs Eduardo Coelho} and {dos Santos}, {Paulo Cesar Costa} and {Ramos Costa}, {Maria Cristina} and {da Silva}, {Israel Tojal} and Estecio, {Marcos Roberto H.} and Ferreira, {Karine Sa} and Furnari, {Frank B.} and Milton Faria and Galante, {Pedro A.F.} and Guimaraes, {Gustavo S.} and Holanda, {Adriano Jesus} and Kimura, {Edna Teruko} and Leerkes, {Maarten R.} and Maciel, {Rui M.B.} and Martins, {Elizabeth A.L.} and Massirer, {Katlin Brauer} and Melo, {Analy S.A.} and Paquola, {Apua C.M.}",

year = "2003",

month = nov,

day = "11",

doi = "10.1073/pnas.1233632100",

language = "English (US)",

volume = "100",

pages = "13418--13423",

journal = "Proceedings of the National Academy of Sciences of the United States of America",

issn = "0027-8424",

publisher = "National Academy of Sciences",

number = "23",

}

TY - JOUR

T1 - The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags

AU - The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium

AU - The Human Cancer Genome Project Sequencing Consortium

AU - Brentani, Helena

AU - Caballero, Otávia L.

AU - Camargo, Anamaria A.

AU - Da Silva, Aline M.

AU - Da Silva, Wilson Araújo

AU - Neto, Emmanuel Dias

AU - Grivet, Marco

AU - Gruber, Arthur

AU - Guimaraes, Pedro Edson Moreira

AU - Hide, Winston

AU - Iseli, Christian

AU - Jongeneel, C. Victor

AU - Kelso, Janet

AU - Nagai, Maria Aparecida

AU - Ojopi, Elida Paula Benquique

AU - Osorio, Elisson C.

AU - Reis, Eduardo M.R.

AU - Riggins, Gregory J.

AU - Simpson, Andrew John George

AU - De Souza, Sandro

AU - Stevenson, Brian J.

AU - Strausberg, Robert L.

AU - Tajara, Eloiza H.

AU - Verjovski-Almeida, Sergio

AU - Acencio, Marcio Luis

AU - Bengtson, Mario Henrique

AU - Bettoni, Fabiana

AU - Bodmer, Walter F.

AU - Briones, Marcelo R.S.

AU - Camargo, Luiz Paulo

AU - Cavenee, Webster

AU - Cerutti, Janete M.

AU - Andrade, Luıs Eduardo Coelho

AU - dos Santos, Paulo Cesar Costa

AU - Ramos Costa, Maria Cristina

AU - da Silva, Israel Tojal

AU - Estecio, Marcos Roberto H.

AU - Ferreira, Karine Sa

AU - Furnari, Frank B.

AU - Faria, Milton

AU - Galante, Pedro A.F.

AU - Guimaraes, Gustavo S.

AU - Holanda, Adriano Jesus

AU - Kimura, Edna Teruko

AU - Leerkes, Maarten R.

AU - Maciel, Rui M.B.

AU - Martins, Elizabeth A.L.

AU - Massirer, Katlin Brauer

AU - Melo, Analy S.A.

AU - Paquola, Apua C.M.

PY - 2003/11/11

Y1 - 2003/11/11

N2 - Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define ≈23,500 genes, of which only ≈1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that < 1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.

AB - Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define ≈23,500 genes, of which only ≈1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that < 1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.

UR - http://www.scopus.com/inward/record.url?scp=0345686789&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0345686789&partnerID=8YFLogxK

U2 - 10.1073/pnas.1233632100

DO - 10.1073/pnas.1233632100

M3 - Article

C2 - 14593198

AN - SCOPUS:0345686789

SN - 0027-8424

VL - 100

SP - 13418

EP - 13423

JO - Proceedings of the National Academy of Sciences of the United States of America

JF - Proceedings of the National Academy of Sciences of the United States of America

IS - 23

ER -

The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this