TY - JOUR
T1 - The generation and utilization of a cancer-oriented representation of the human transcriptome by using expressed sequence tags
AU - The Human Cancer Genome Project/Cancer Genome Anatomy Project Annotation Consortium
AU - The Human Cancer Genome Project Sequencing Consortium
AU - Brentani, Helena
AU - Caballero, Otávia L.
AU - Camargo, Anamaria A.
AU - Da Silva, Aline M.
AU - Da Silva, Wilson Araújo
AU - Neto, Emmanuel Dias
AU - Grivet, Marco
AU - Gruber, Arthur
AU - Guimaraes, Pedro Edson Moreira
AU - Hide, Winston
AU - Iseli, Christian
AU - Jongeneel, C. Victor
AU - Kelso, Janet
AU - Nagai, Maria Aparecida
AU - Ojopi, Elida Paula Benquique
AU - Osorio, Elisson C.
AU - Reis, Eduardo M.R.
AU - Riggins, Gregory J.
AU - Simpson, Andrew John George
AU - De Souza, Sandro
AU - Stevenson, Brian J.
AU - Strausberg, Robert L.
AU - Tajara, Eloiza H.
AU - Verjovski-Almeida, Sergio
AU - Acencio, Marcio Luis
AU - Bengtson, Mario Henrique
AU - Bettoni, Fabiana
AU - Bodmer, Walter F.
AU - Briones, Marcelo R.S.
AU - Camargo, Luiz Paulo
AU - Cavenee, Webster
AU - Cerutti, Janete M.
AU - Andrade, Luıs Eduardo Coelho
AU - dos Santos, Paulo Cesar Costa
AU - Ramos Costa, Maria Cristina
AU - da Silva, Israel Tojal
AU - Estecio, Marcos Roberto H.
AU - Ferreira, Karine Sa
AU - Furnari, Frank B.
AU - Faria, Milton
AU - Galante, Pedro A.F.
AU - Guimaraes, Gustavo S.
AU - Holanda, Adriano Jesus
AU - Kimura, Edna Teruko
AU - Leerkes, Maarten R.
AU - Maciel, Rui M.B.
AU - Martins, Elizabeth A.L.
AU - Massirer, Katlin Brauer
AU - Melo, Analy S.A.
AU - Paquola, Apua C.M.
PY - 2003/11/11
Y1 - 2003/11/11
N2 - Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define ≈23,500 genes, of which only ≈1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that < 1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.
AB - Whereas genome sequencing defines the genetic potential of an organism, transcript sequencing defines the utilization of this potential and links the genome with most areas of biology. To exploit the information within the human genome in the fight against cancer, we have deposited some two million expressed sequence tags (ESTs) from human tumors and their corresponding normal tissues in the public databases. The data currently define ≈23,500 genes, of which only ≈1,250 are still represented only by ESTs. Examination of the EST coverage of known cancer-related (CR) genes reveals that < 1% do not have corresponding ESTs, indicating that the representation of genes associated with commonly studied tumors is high. The careful recording of the origin of all ESTs we have produced has enabled detailed definition of where the genes they represent are expressed in the human body. More than 100,000 ESTs are available for seven tissues, indicating a surprising variability of gene usage that has led to the discovery of a significant number of genes with restricted expression, and that may thus be therapeutically useful. The ESTs also reveal novel nonsynonymous germline variants (although the one-pass nature of the data necessitates careful validation) and many alternatively spliced transcripts. Although widely exploited by the scientific community, vindicating our totally open source policy, the EST data generated still provide extensive information that remains to be systematically explored, and that may further facilitate progress toward both the understanding and treatment of human cancers.
UR - http://www.scopus.com/inward/record.url?scp=0345686789&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0345686789&partnerID=8YFLogxK
U2 - 10.1073/pnas.1233632100
DO - 10.1073/pnas.1233632100
M3 - Article
C2 - 14593198
AN - SCOPUS:0345686789
SN - 0027-8424
VL - 100
SP - 13418
EP - 13423
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 23
ER -