Power law exponents characterizing human DNA

A. Provata, Th Oikonomou

Research output: Contribution to journalArticle

9 Citations (Scopus)

Abstract

The size distributions of all known coding and noncoding DNA sequences are studied in all human chromosomes. In a unified approach, both introns and intergenic regions are treated as noncoding regions. The distributions of noncoding segments Pnc (S) of size S present long tails Pnc (S) ∼ S-1- μnc, with exponents μnc ranging between 0.71 (for chromosome 13) and 1.2 (for chromosome 19). On the contrary, the exponential, short-range decay terms dominate in the distributions of coding (exon) segments Pc (S) in all chromosomes. Aiming to address the emergence of these statistical features, minimal, stochastic, mean-field models are proposed, based on randomly aggregating DNA strings with duplication, influx and outflux of genomic segments. These minimal models produce both the short-range statistics in the coding and the observed power law and fractal statistics in the noncoding DNA. The minimal models also demonstrate that although the two systems (coding and noncoding) coexist, alternating on the same linear chain, they act independently: the coding as a closed, equilibrium system and the noncoding as an open, out-of-equilibrium one.

Original languageEnglish
Article number056102
JournalPhysical Review E - Statistical, Nonlinear, and Soft Matter Physics
Volume75
Issue number5
DOIs
Publication statusPublished - May 3 2007
Externally publishedYes

Fingerprint

chromosomes
Power Law
coding
deoxyribonucleic acid
Coding
Exponent
Chromosome
exponents
Minimal Model
statistics
Statistics
Mean-field Model
Duplication
DNA Sequence
Range of data
Genomics
Tail
Fractal
fractals
strings

ASJC Scopus subject areas

  • Physics and Astronomy(all)
  • Condensed Matter Physics
  • Statistical and Nonlinear Physics
  • Mathematical Physics

Cite this

Power law exponents characterizing human DNA. / Provata, A.; Oikonomou, Th.

In: Physical Review E - Statistical, Nonlinear, and Soft Matter Physics, Vol. 75, No. 5, 056102, 03.05.2007.

Research output: Contribution to journalArticle

@article{24ef86671aba491793356d3b01c2720d,
title = "Power law exponents characterizing human DNA",
abstract = "The size distributions of all known coding and noncoding DNA sequences are studied in all human chromosomes. In a unified approach, both introns and intergenic regions are treated as noncoding regions. The distributions of noncoding segments Pnc (S) of size S present long tails Pnc (S) ∼ S-1- μnc, with exponents μnc ranging between 0.71 (for chromosome 13) and 1.2 (for chromosome 19). On the contrary, the exponential, short-range decay terms dominate in the distributions of coding (exon) segments Pc (S) in all chromosomes. Aiming to address the emergence of these statistical features, minimal, stochastic, mean-field models are proposed, based on randomly aggregating DNA strings with duplication, influx and outflux of genomic segments. These minimal models produce both the short-range statistics in the coding and the observed power law and fractal statistics in the noncoding DNA. The minimal models also demonstrate that although the two systems (coding and noncoding) coexist, alternating on the same linear chain, they act independently: the coding as a closed, equilibrium system and the noncoding as an open, out-of-equilibrium one.",
author = "A. Provata and Th Oikonomou",
year = "2007",
month = "5",
day = "3",
doi = "10.1103/PhysRevE.75.056102",
language = "English",
volume = "75",
journal = "Physical review. E",
issn = "2470-0045",
publisher = "American Physical Society",
number = "5",

}

TY - JOUR

T1 - Power law exponents characterizing human DNA

AU - Provata, A.

AU - Oikonomou, Th

PY - 2007/5/3

Y1 - 2007/5/3

N2 - The size distributions of all known coding and noncoding DNA sequences are studied in all human chromosomes. In a unified approach, both introns and intergenic regions are treated as noncoding regions. The distributions of noncoding segments Pnc (S) of size S present long tails Pnc (S) ∼ S-1- μnc, with exponents μnc ranging between 0.71 (for chromosome 13) and 1.2 (for chromosome 19). On the contrary, the exponential, short-range decay terms dominate in the distributions of coding (exon) segments Pc (S) in all chromosomes. Aiming to address the emergence of these statistical features, minimal, stochastic, mean-field models are proposed, based on randomly aggregating DNA strings with duplication, influx and outflux of genomic segments. These minimal models produce both the short-range statistics in the coding and the observed power law and fractal statistics in the noncoding DNA. The minimal models also demonstrate that although the two systems (coding and noncoding) coexist, alternating on the same linear chain, they act independently: the coding as a closed, equilibrium system and the noncoding as an open, out-of-equilibrium one.

AB - The size distributions of all known coding and noncoding DNA sequences are studied in all human chromosomes. In a unified approach, both introns and intergenic regions are treated as noncoding regions. The distributions of noncoding segments Pnc (S) of size S present long tails Pnc (S) ∼ S-1- μnc, with exponents μnc ranging between 0.71 (for chromosome 13) and 1.2 (for chromosome 19). On the contrary, the exponential, short-range decay terms dominate in the distributions of coding (exon) segments Pc (S) in all chromosomes. Aiming to address the emergence of these statistical features, minimal, stochastic, mean-field models are proposed, based on randomly aggregating DNA strings with duplication, influx and outflux of genomic segments. These minimal models produce both the short-range statistics in the coding and the observed power law and fractal statistics in the noncoding DNA. The minimal models also demonstrate that although the two systems (coding and noncoding) coexist, alternating on the same linear chain, they act independently: the coding as a closed, equilibrium system and the noncoding as an open, out-of-equilibrium one.

UR - http://www.scopus.com/inward/record.url?scp=34248218098&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34248218098&partnerID=8YFLogxK

U2 - 10.1103/PhysRevE.75.056102

DO - 10.1103/PhysRevE.75.056102

M3 - Article

AN - SCOPUS:34248218098

VL - 75

JO - Physical review. E

JF - Physical review. E

SN - 2470-0045

IS - 5

M1 - 056102

ER -