The GC skew near Pol II start sites and its association with SP1-binding site variants

Yu A. Medvedeva, I. V. Kulakovskii, N. Yu Oparina, Alexander Favorov, V. Yu Makeev

Research output: Contribution to journalArticle

Abstract

Nucleotide sequences of DNA within clusters of transcription start sites identified by the Cap Analysis of Gene Expression (CAGE) have some distinctive features. DNA within such clusters is enriched in cytosine and guanine, and its GC-skew agrees with selection of the coding strand for which the G content exceeds the C content. On the other hand, for the coding strand the frequency of tracts of the avoided cytosine, normalized to the expectation calculated from the local content of the nucleotide in the cluster, is significantly higher than that of the tracts of the preferred guanine. Similarly, the statistical significance of the C-rich variant of binding site for transcription factor Sp1 in the coding strand is higher than that of the G-rich variant. Yet it is unlikely that the choice of the Sp1 site variant is induced by the coding strand selection. Rather, it is more likely that both variants are more or less equiprobable, and the Sp1 functional binding works as a selection factor, which counteracts the mutations bringing about the GC-skew.

Original languageEnglish (US)
Pages (from-to)901-907
Number of pages7
JournalBiophysics
Volume55
Issue number6
DOIs
StatePublished - Dec 2010
Externally publishedYes

Fingerprint

Cytosine
Guanine
Binding Sites
Sp1 Transcription Factor
Transcription Initiation Site
DNA
Base Composition
Nucleotides
Gene Expression
Mutation

Keywords

  • CAGE
  • cap analysis of gene expression
  • Homo sapiens
  • Sp1
  • transcription factor

ASJC Scopus subject areas

  • Biophysics

Cite this

The GC skew near Pol II start sites and its association with SP1-binding site variants. / Medvedeva, Yu A.; Kulakovskii, I. V.; Oparina, N. Yu; Favorov, Alexander; Makeev, V. Yu.

In: Biophysics, Vol. 55, No. 6, 12.2010, p. 901-907.

Research output: Contribution to journalArticle

Medvedeva, Yu A. ; Kulakovskii, I. V. ; Oparina, N. Yu ; Favorov, Alexander ; Makeev, V. Yu. / The GC skew near Pol II start sites and its association with SP1-binding site variants. In: Biophysics. 2010 ; Vol. 55, No. 6. pp. 901-907.
@article{5df605d48aaa458880b483f654949d97,
title = "The GC skew near Pol II start sites and its association with SP1-binding site variants",
abstract = "Nucleotide sequences of DNA within clusters of transcription start sites identified by the Cap Analysis of Gene Expression (CAGE) have some distinctive features. DNA within such clusters is enriched in cytosine and guanine, and its GC-skew agrees with selection of the coding strand for which the G content exceeds the C content. On the other hand, for the coding strand the frequency of tracts of the avoided cytosine, normalized to the expectation calculated from the local content of the nucleotide in the cluster, is significantly higher than that of the tracts of the preferred guanine. Similarly, the statistical significance of the C-rich variant of binding site for transcription factor Sp1 in the coding strand is higher than that of the G-rich variant. Yet it is unlikely that the choice of the Sp1 site variant is induced by the coding strand selection. Rather, it is more likely that both variants are more or less equiprobable, and the Sp1 functional binding works as a selection factor, which counteracts the mutations bringing about the GC-skew.",
keywords = "CAGE, cap analysis of gene expression, Homo sapiens, Sp1, transcription factor",
author = "Medvedeva, {Yu A.} and Kulakovskii, {I. V.} and Oparina, {N. Yu} and Alexander Favorov and Makeev, {V. Yu}",
year = "2010",
month = "12",
doi = "10.1134/S0006350910060023",
language = "English (US)",
volume = "55",
pages = "901--907",
journal = "Biophysics (Russian Federation)",
issn = "0006-3509",
publisher = "Maik Nauka-Interperiodica Publishing",
number = "6",

}

TY - JOUR

T1 - The GC skew near Pol II start sites and its association with SP1-binding site variants

AU - Medvedeva, Yu A.

AU - Kulakovskii, I. V.

AU - Oparina, N. Yu

AU - Favorov, Alexander

AU - Makeev, V. Yu

PY - 2010/12

Y1 - 2010/12

N2 - Nucleotide sequences of DNA within clusters of transcription start sites identified by the Cap Analysis of Gene Expression (CAGE) have some distinctive features. DNA within such clusters is enriched in cytosine and guanine, and its GC-skew agrees with selection of the coding strand for which the G content exceeds the C content. On the other hand, for the coding strand the frequency of tracts of the avoided cytosine, normalized to the expectation calculated from the local content of the nucleotide in the cluster, is significantly higher than that of the tracts of the preferred guanine. Similarly, the statistical significance of the C-rich variant of binding site for transcription factor Sp1 in the coding strand is higher than that of the G-rich variant. Yet it is unlikely that the choice of the Sp1 site variant is induced by the coding strand selection. Rather, it is more likely that both variants are more or less equiprobable, and the Sp1 functional binding works as a selection factor, which counteracts the mutations bringing about the GC-skew.

AB - Nucleotide sequences of DNA within clusters of transcription start sites identified by the Cap Analysis of Gene Expression (CAGE) have some distinctive features. DNA within such clusters is enriched in cytosine and guanine, and its GC-skew agrees with selection of the coding strand for which the G content exceeds the C content. On the other hand, for the coding strand the frequency of tracts of the avoided cytosine, normalized to the expectation calculated from the local content of the nucleotide in the cluster, is significantly higher than that of the tracts of the preferred guanine. Similarly, the statistical significance of the C-rich variant of binding site for transcription factor Sp1 in the coding strand is higher than that of the G-rich variant. Yet it is unlikely that the choice of the Sp1 site variant is induced by the coding strand selection. Rather, it is more likely that both variants are more or less equiprobable, and the Sp1 functional binding works as a selection factor, which counteracts the mutations bringing about the GC-skew.

KW - CAGE

KW - cap analysis of gene expression

KW - Homo sapiens

KW - Sp1

KW - transcription factor

UR - http://www.scopus.com/inward/record.url?scp=79953135834&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79953135834&partnerID=8YFLogxK

U2 - 10.1134/S0006350910060023

DO - 10.1134/S0006350910060023

M3 - Article

VL - 55

SP - 901

EP - 907

JO - Biophysics (Russian Federation)

JF - Biophysics (Russian Federation)

SN - 0006-3509

IS - 6

ER -