The impact of SNPs on the interpretation of SAGE and MPSS experimental data

Ana Paula M Silva, Jorge E S De Souza, Pedro A F Galante, Gregory J Riggins, Sandro J. De Souza, Anamaria A. Camargo

Research output: Contribution to journalArticle

Abstract

Serial Analysis of Gene Expression (SAGE) and Massively Parallel Signature Sequencing (MPSS) are powerful techniques for gene expression analysis. A crucial step in analyzing SAGE and MPSS data is the assignment of experimentally obtained tags to a known transcript. However, tag to transcript assignment is not a straightforward process since alternative tags for a given transcript can also be experimentally obtained. Here, we have evaluated the impact of Single Nucleotide Polymorphisms (SNPs) on the generation of alternative SAGE and MPSS tags. This was achieved through the construction of a reference database of SNP-associated alternative tags, which has been integrated with SAGE Genie. A total of 2020 SNP-associated alternative tags were catalogued in our reference database and at least one SNP-associated alternative tag was observed for ∼8.6% of all known human genes. A significant fraction (61.9%) of these alternative tags matched a list of experimentally obtained tags, validating their existence. In addition, the origin of four out of five SNP-associated alternative MPSS tags was experimentally confirmed through the use of the GLGI-MPSS protocol (Generation of Long cDNA fragments for Gene Identification). The availability of our SNP-associated alternative tag database will certainly improve the interpretation of SAGE and MPSS experiments.

Original languageEnglish (US)
Pages (from-to)6104-6110
Number of pages7
JournalNucleic Acids Research
Volume32
Issue number20
DOIs
StatePublished - 2004

Fingerprint

High-Throughput Nucleotide Sequencing
Single Nucleotide Polymorphism
Gene Expression
Databases
Genes
Complementary DNA

ASJC Scopus subject areas

  • Genetics

Cite this

Silva, A. P. M., De Souza, J. E. S., Galante, P. A. F., Riggins, G. J., De Souza, S. J., & Camargo, A. A. (2004). The impact of SNPs on the interpretation of SAGE and MPSS experimental data. Nucleic Acids Research, 32(20), 6104-6110. https://doi.org/10.1093/nar/gkh937

The impact of SNPs on the interpretation of SAGE and MPSS experimental data. / Silva, Ana Paula M; De Souza, Jorge E S; Galante, Pedro A F; Riggins, Gregory J; De Souza, Sandro J.; Camargo, Anamaria A.

In: Nucleic Acids Research, Vol. 32, No. 20, 2004, p. 6104-6110.

Research output: Contribution to journalArticle

Silva, APM, De Souza, JES, Galante, PAF, Riggins, GJ, De Souza, SJ & Camargo, AA 2004, 'The impact of SNPs on the interpretation of SAGE and MPSS experimental data', Nucleic Acids Research, vol. 32, no. 20, pp. 6104-6110. https://doi.org/10.1093/nar/gkh937
Silva, Ana Paula M ; De Souza, Jorge E S ; Galante, Pedro A F ; Riggins, Gregory J ; De Souza, Sandro J. ; Camargo, Anamaria A. / The impact of SNPs on the interpretation of SAGE and MPSS experimental data. In: Nucleic Acids Research. 2004 ; Vol. 32, No. 20. pp. 6104-6110.
@article{5693a47b1616435aa30b0dd7fce94912,
title = "The impact of SNPs on the interpretation of SAGE and MPSS experimental data",
abstract = "Serial Analysis of Gene Expression (SAGE) and Massively Parallel Signature Sequencing (MPSS) are powerful techniques for gene expression analysis. A crucial step in analyzing SAGE and MPSS data is the assignment of experimentally obtained tags to a known transcript. However, tag to transcript assignment is not a straightforward process since alternative tags for a given transcript can also be experimentally obtained. Here, we have evaluated the impact of Single Nucleotide Polymorphisms (SNPs) on the generation of alternative SAGE and MPSS tags. This was achieved through the construction of a reference database of SNP-associated alternative tags, which has been integrated with SAGE Genie. A total of 2020 SNP-associated alternative tags were catalogued in our reference database and at least one SNP-associated alternative tag was observed for ∼8.6{\%} of all known human genes. A significant fraction (61.9{\%}) of these alternative tags matched a list of experimentally obtained tags, validating their existence. In addition, the origin of four out of five SNP-associated alternative MPSS tags was experimentally confirmed through the use of the GLGI-MPSS protocol (Generation of Long cDNA fragments for Gene Identification). The availability of our SNP-associated alternative tag database will certainly improve the interpretation of SAGE and MPSS experiments.",
author = "Silva, {Ana Paula M} and {De Souza}, {Jorge E S} and Galante, {Pedro A F} and Riggins, {Gregory J} and {De Souza}, {Sandro J.} and Camargo, {Anamaria A.}",
year = "2004",
doi = "10.1093/nar/gkh937",
language = "English (US)",
volume = "32",
pages = "6104--6110",
journal = "Nucleic Acids Research",
issn = "1362-4962",
publisher = "Oxford University Press",
number = "20",

}

TY - JOUR

T1 - The impact of SNPs on the interpretation of SAGE and MPSS experimental data

AU - Silva, Ana Paula M

AU - De Souza, Jorge E S

AU - Galante, Pedro A F

AU - Riggins, Gregory J

AU - De Souza, Sandro J.

AU - Camargo, Anamaria A.

PY - 2004

Y1 - 2004

N2 - Serial Analysis of Gene Expression (SAGE) and Massively Parallel Signature Sequencing (MPSS) are powerful techniques for gene expression analysis. A crucial step in analyzing SAGE and MPSS data is the assignment of experimentally obtained tags to a known transcript. However, tag to transcript assignment is not a straightforward process since alternative tags for a given transcript can also be experimentally obtained. Here, we have evaluated the impact of Single Nucleotide Polymorphisms (SNPs) on the generation of alternative SAGE and MPSS tags. This was achieved through the construction of a reference database of SNP-associated alternative tags, which has been integrated with SAGE Genie. A total of 2020 SNP-associated alternative tags were catalogued in our reference database and at least one SNP-associated alternative tag was observed for ∼8.6% of all known human genes. A significant fraction (61.9%) of these alternative tags matched a list of experimentally obtained tags, validating their existence. In addition, the origin of four out of five SNP-associated alternative MPSS tags was experimentally confirmed through the use of the GLGI-MPSS protocol (Generation of Long cDNA fragments for Gene Identification). The availability of our SNP-associated alternative tag database will certainly improve the interpretation of SAGE and MPSS experiments.

AB - Serial Analysis of Gene Expression (SAGE) and Massively Parallel Signature Sequencing (MPSS) are powerful techniques for gene expression analysis. A crucial step in analyzing SAGE and MPSS data is the assignment of experimentally obtained tags to a known transcript. However, tag to transcript assignment is not a straightforward process since alternative tags for a given transcript can also be experimentally obtained. Here, we have evaluated the impact of Single Nucleotide Polymorphisms (SNPs) on the generation of alternative SAGE and MPSS tags. This was achieved through the construction of a reference database of SNP-associated alternative tags, which has been integrated with SAGE Genie. A total of 2020 SNP-associated alternative tags were catalogued in our reference database and at least one SNP-associated alternative tag was observed for ∼8.6% of all known human genes. A significant fraction (61.9%) of these alternative tags matched a list of experimentally obtained tags, validating their existence. In addition, the origin of four out of five SNP-associated alternative MPSS tags was experimentally confirmed through the use of the GLGI-MPSS protocol (Generation of Long cDNA fragments for Gene Identification). The availability of our SNP-associated alternative tag database will certainly improve the interpretation of SAGE and MPSS experiments.

UR - http://www.scopus.com/inward/record.url?scp=11344268655&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=11344268655&partnerID=8YFLogxK

U2 - 10.1093/nar/gkh937

DO - 10.1093/nar/gkh937

M3 - Article

C2 - 15562001

AN - SCOPUS:11344268655

VL - 32

SP - 6104

EP - 6110

JO - Nucleic Acids Research

JF - Nucleic Acids Research

SN - 1362-4962

IS - 20

ER -