A method for identifying splice sites and translationai start sites in eukaryotic mrna

Steven L. Salzberg

doi:10.1093/bioinformatics/13.4.365

A method for identifying splice sites and translationai start sites in eukaryotic mrna

Steven L. Salzberg

Research output: Contribution to journal › Article › peer-review

105 Scopus citations

Abstract

This paper describes a new method for determining the consensus sequences that signal the start of translation and the boundaries between exons and introns (donor and acceptor sites) in eukaryotic mRNA. The method takes into account the dependencies between adjacent bases, in contrast to the usual technique of considering each position independently. When coupled with a dynamic program to compute the most likely sequence, new consensus sequences emerge. The consensus sequence information is summarized in conditional probability matrices which, when used to locate signals in uncharacterized genomic DNA, have greater sensitivity and specificity than conventional matrices. Species-specific versions of these matrices are especially effective at distinguishing true and false sites.

Original language	English (US)
Pages (from-to)	365-376
Number of pages	12
Journal	Bioinformatics
Volume	13
Issue number	4
DOIs	https://doi.org/10.1093/bioinformatics/13.4.365
State	Published - 1997
Externally published	Yes

ASJC Scopus subject areas

Statistics and Probability
Biochemistry
Molecular Biology
Computer Science Applications
Computational Theory and Mathematics
Computational Mathematics

Access to Document

10.1093/bioinformatics/13.4.365

Cite this

@article{1850787c8e0547629fda8808aaba9b07,

title = "A method for identifying splice sites and translationai start sites in eukaryotic mrna",

abstract = "This paper describes a new method for determining the consensus sequences that signal the start of translation and the boundaries between exons and introns (donor and acceptor sites) in eukaryotic mRNA. The method takes into account the dependencies between adjacent bases, in contrast to the usual technique of considering each position independently. When coupled with a dynamic program to compute the most likely sequence, new consensus sequences emerge. The consensus sequence information is summarized in conditional probability matrices which, when used to locate signals in uncharacterized genomic DNA, have greater sensitivity and specificity than conventional matrices. Species-specific versions of these matrices are especially effective at distinguishing true and false sites.",

author = "Salzberg, {Steven L.}",

note = "Funding Information: Thanks to Simon Kasif. Art Delcher and the anonymous reviewers for many helpful suggestions. This material is based on work supported by the National Science foundation under grant no. IRI-9530462. and by the National Center for Human Genome Research at the National Institutes of Health under grant no. KO1-HGO0022-1.",

year = "1997",

doi = "10.1093/bioinformatics/13.4.365",

language = "English (US)",

volume = "13",

pages = "365--376",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "4",

}

TY - JOUR

T1 - A method for identifying splice sites and translationai start sites in eukaryotic mrna

AU - Salzberg, Steven L.

N1 - Funding Information: Thanks to Simon Kasif. Art Delcher and the anonymous reviewers for many helpful suggestions. This material is based on work supported by the National Science foundation under grant no. IRI-9530462. and by the National Center for Human Genome Research at the National Institutes of Health under grant no. KO1-HGO0022-1.

PY - 1997

Y1 - 1997

N2 - This paper describes a new method for determining the consensus sequences that signal the start of translation and the boundaries between exons and introns (donor and acceptor sites) in eukaryotic mRNA. The method takes into account the dependencies between adjacent bases, in contrast to the usual technique of considering each position independently. When coupled with a dynamic program to compute the most likely sequence, new consensus sequences emerge. The consensus sequence information is summarized in conditional probability matrices which, when used to locate signals in uncharacterized genomic DNA, have greater sensitivity and specificity than conventional matrices. Species-specific versions of these matrices are especially effective at distinguishing true and false sites.

AB - This paper describes a new method for determining the consensus sequences that signal the start of translation and the boundaries between exons and introns (donor and acceptor sites) in eukaryotic mRNA. The method takes into account the dependencies between adjacent bases, in contrast to the usual technique of considering each position independently. When coupled with a dynamic program to compute the most likely sequence, new consensus sequences emerge. The consensus sequence information is summarized in conditional probability matrices which, when used to locate signals in uncharacterized genomic DNA, have greater sensitivity and specificity than conventional matrices. Species-specific versions of these matrices are especially effective at distinguishing true and false sites.

UR - http://www.scopus.com/inward/record.url?scp=0030869663&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0030869663&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/13.4.365

DO - 10.1093/bioinformatics/13.4.365

M3 - Article

C2 - 9283751

AN - SCOPUS:0030869663

SN - 1367-4803

VL - 13

SP - 365

EP - 376

JO - Bioinformatics

JF - Bioinformatics

IS - 4

ER -

A method for identifying splice sites and translationai start sites in eukaryotic mrna

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this