Complete sequence of the Rous sarcoma virus env gene: Identification of structural and functional regions of its product

E. Hunter, E. Hill, J Marie Hardwick, A. Bhown, D. E. Schwartz, R. Tizard

Research output: Contribution to journalArticle

Abstract

The amino-terminal amino acid sequences of gp85 and gp37, the envelope glycoproteins of Rous sarcoma virus (RSV), were determined. Alignment of these sequences with the amino acid sequence predicted from the complete nucleotide sequence of the Prague strain of RSV, subgroup C (PR-C), has allowed us to delineate the env gene-coding region of this virus. The coding sequences for gp85 and gp37 have been placed in an open reading frame that extends from nucleotide 5045 to nucleotide 6862 and predict sizes of 341 amino acids (36,962 molecular weight) for gp85 and 198 amino acids (21,566 molecular weight) for gp37. Carbohydrate makes a significant contribution to the observed molecular weights of these polypeptides - the amino acid sequence contains 14 potential glycosylation sites (Asn-X-Ser/Thr) in gp85 and two in gp37. Experiments aimed at estimating the number of carbohydrate side chains yielded results consistent with most or all of these sites being occupied. Although an initiation codon is located early (codon 4) in the open reading frame, it is likely that splicing yields an mRNA on which translation initiates at the same AUG as that of the gag gene to produce a nascent polypeptide in which gp85 is preceded by a 62-amino-acid-long leader peptide. This leader contains the hydrophobic sequence (signal sequence) necessary for translocation across the endoplasmic reticulum and is completely removed from the env gene product during translation. The polyprotein precursor, Pr95(env), is cleaved to gp85 and gp37 at the carboxyl side of the basic sequence: -Arg-Arg-Lys-Arg-. gp85 is attached through a disulphide linkage to gp37, and although the positions of the cysteines involved in this linkage are not known, the presence of a 27-amino-acid-long hydrophobic region at the carboxy-terminus of gp37 is consistent with its role as a membrane anchor for the viral glycoprotein complex. The location of host range variable regions with respect to the possible tertiary structure of the complex is discussed.

Original languageEnglish (US)
Pages (from-to)920-936
Number of pages17
JournalJournal of Virology
Volume46
Issue number3
StatePublished - 1983
Externally publishedYes

Fingerprint

Rous sarcoma virus
env Genes
Amino Acids
Amino Acid Sequence
amino acids
amino acid sequences
Molecular Weight
molecular weight
Protein Sorting Signals
signal peptide
translation (genetics)
Open Reading Frames
linkage (genetics)
open reading frames
glycoproteins
polypeptides
genes
Nucleotides
nucleotides
Carbohydrates

ASJC Scopus subject areas

  • Immunology

Cite this

Complete sequence of the Rous sarcoma virus env gene : Identification of structural and functional regions of its product. / Hunter, E.; Hill, E.; Hardwick, J Marie; Bhown, A.; Schwartz, D. E.; Tizard, R.

In: Journal of Virology, Vol. 46, No. 3, 1983, p. 920-936.

Research output: Contribution to journalArticle

Hunter, E. ; Hill, E. ; Hardwick, J Marie ; Bhown, A. ; Schwartz, D. E. ; Tizard, R. / Complete sequence of the Rous sarcoma virus env gene : Identification of structural and functional regions of its product. In: Journal of Virology. 1983 ; Vol. 46, No. 3. pp. 920-936.
@article{0ccea21172f6476a9a9bebea4d19e0d3,
title = "Complete sequence of the Rous sarcoma virus env gene: Identification of structural and functional regions of its product",
abstract = "The amino-terminal amino acid sequences of gp85 and gp37, the envelope glycoproteins of Rous sarcoma virus (RSV), were determined. Alignment of these sequences with the amino acid sequence predicted from the complete nucleotide sequence of the Prague strain of RSV, subgroup C (PR-C), has allowed us to delineate the env gene-coding region of this virus. The coding sequences for gp85 and gp37 have been placed in an open reading frame that extends from nucleotide 5045 to nucleotide 6862 and predict sizes of 341 amino acids (36,962 molecular weight) for gp85 and 198 amino acids (21,566 molecular weight) for gp37. Carbohydrate makes a significant contribution to the observed molecular weights of these polypeptides - the amino acid sequence contains 14 potential glycosylation sites (Asn-X-Ser/Thr) in gp85 and two in gp37. Experiments aimed at estimating the number of carbohydrate side chains yielded results consistent with most or all of these sites being occupied. Although an initiation codon is located early (codon 4) in the open reading frame, it is likely that splicing yields an mRNA on which translation initiates at the same AUG as that of the gag gene to produce a nascent polypeptide in which gp85 is preceded by a 62-amino-acid-long leader peptide. This leader contains the hydrophobic sequence (signal sequence) necessary for translocation across the endoplasmic reticulum and is completely removed from the env gene product during translation. The polyprotein precursor, Pr95(env), is cleaved to gp85 and gp37 at the carboxyl side of the basic sequence: -Arg-Arg-Lys-Arg-. gp85 is attached through a disulphide linkage to gp37, and although the positions of the cysteines involved in this linkage are not known, the presence of a 27-amino-acid-long hydrophobic region at the carboxy-terminus of gp37 is consistent with its role as a membrane anchor for the viral glycoprotein complex. The location of host range variable regions with respect to the possible tertiary structure of the complex is discussed.",
author = "E. Hunter and E. Hill and Hardwick, {J Marie} and A. Bhown and Schwartz, {D. E.} and R. Tizard",
year = "1983",
language = "English (US)",
volume = "46",
pages = "920--936",
journal = "Journal of Virology",
issn = "0022-538X",
publisher = "American Society for Microbiology",
number = "3",

}

TY - JOUR

T1 - Complete sequence of the Rous sarcoma virus env gene

T2 - Identification of structural and functional regions of its product

AU - Hunter, E.

AU - Hill, E.

AU - Hardwick, J Marie

AU - Bhown, A.

AU - Schwartz, D. E.

AU - Tizard, R.

PY - 1983

Y1 - 1983

N2 - The amino-terminal amino acid sequences of gp85 and gp37, the envelope glycoproteins of Rous sarcoma virus (RSV), were determined. Alignment of these sequences with the amino acid sequence predicted from the complete nucleotide sequence of the Prague strain of RSV, subgroup C (PR-C), has allowed us to delineate the env gene-coding region of this virus. The coding sequences for gp85 and gp37 have been placed in an open reading frame that extends from nucleotide 5045 to nucleotide 6862 and predict sizes of 341 amino acids (36,962 molecular weight) for gp85 and 198 amino acids (21,566 molecular weight) for gp37. Carbohydrate makes a significant contribution to the observed molecular weights of these polypeptides - the amino acid sequence contains 14 potential glycosylation sites (Asn-X-Ser/Thr) in gp85 and two in gp37. Experiments aimed at estimating the number of carbohydrate side chains yielded results consistent with most or all of these sites being occupied. Although an initiation codon is located early (codon 4) in the open reading frame, it is likely that splicing yields an mRNA on which translation initiates at the same AUG as that of the gag gene to produce a nascent polypeptide in which gp85 is preceded by a 62-amino-acid-long leader peptide. This leader contains the hydrophobic sequence (signal sequence) necessary for translocation across the endoplasmic reticulum and is completely removed from the env gene product during translation. The polyprotein precursor, Pr95(env), is cleaved to gp85 and gp37 at the carboxyl side of the basic sequence: -Arg-Arg-Lys-Arg-. gp85 is attached through a disulphide linkage to gp37, and although the positions of the cysteines involved in this linkage are not known, the presence of a 27-amino-acid-long hydrophobic region at the carboxy-terminus of gp37 is consistent with its role as a membrane anchor for the viral glycoprotein complex. The location of host range variable regions with respect to the possible tertiary structure of the complex is discussed.

AB - The amino-terminal amino acid sequences of gp85 and gp37, the envelope glycoproteins of Rous sarcoma virus (RSV), were determined. Alignment of these sequences with the amino acid sequence predicted from the complete nucleotide sequence of the Prague strain of RSV, subgroup C (PR-C), has allowed us to delineate the env gene-coding region of this virus. The coding sequences for gp85 and gp37 have been placed in an open reading frame that extends from nucleotide 5045 to nucleotide 6862 and predict sizes of 341 amino acids (36,962 molecular weight) for gp85 and 198 amino acids (21,566 molecular weight) for gp37. Carbohydrate makes a significant contribution to the observed molecular weights of these polypeptides - the amino acid sequence contains 14 potential glycosylation sites (Asn-X-Ser/Thr) in gp85 and two in gp37. Experiments aimed at estimating the number of carbohydrate side chains yielded results consistent with most or all of these sites being occupied. Although an initiation codon is located early (codon 4) in the open reading frame, it is likely that splicing yields an mRNA on which translation initiates at the same AUG as that of the gag gene to produce a nascent polypeptide in which gp85 is preceded by a 62-amino-acid-long leader peptide. This leader contains the hydrophobic sequence (signal sequence) necessary for translocation across the endoplasmic reticulum and is completely removed from the env gene product during translation. The polyprotein precursor, Pr95(env), is cleaved to gp85 and gp37 at the carboxyl side of the basic sequence: -Arg-Arg-Lys-Arg-. gp85 is attached through a disulphide linkage to gp37, and although the positions of the cysteines involved in this linkage are not known, the presence of a 27-amino-acid-long hydrophobic region at the carboxy-terminus of gp37 is consistent with its role as a membrane anchor for the viral glycoprotein complex. The location of host range variable regions with respect to the possible tertiary structure of the complex is discussed.

UR - http://www.scopus.com/inward/record.url?scp=0020575030&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0020575030&partnerID=8YFLogxK

M3 - Article

C2 - 6304351

AN - SCOPUS:0020575030

VL - 46

SP - 920

EP - 936

JO - Journal of Virology

JF - Journal of Virology

SN - 0022-538X

IS - 3

ER -