Cargando…

The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties

BACKGROUND: The sequencing of genomes provides us with an inventory of the 'molecular parts' in nature, such as protein families and folds, and their functions in living organisms. Through the analysis of such inventories, it has been shown that different genomes have very different usage...

Descripción completa

Detalles Bibliográficos
Autores principales: Luscombe, Nicholas M, Qian, Jiang, Zhang, Zhaolei, Johnson, Ted, Gerstein, Mark
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC126234/
https://www.ncbi.nlm.nih.gov/pubmed/12186647
_version_ 1782120323409248256
author Luscombe, Nicholas M
Qian, Jiang
Zhang, Zhaolei
Johnson, Ted
Gerstein, Mark
author_facet Luscombe, Nicholas M
Qian, Jiang
Zhang, Zhaolei
Johnson, Ted
Gerstein, Mark
author_sort Luscombe, Nicholas M
collection PubMed
description BACKGROUND: The sequencing of genomes provides us with an inventory of the 'molecular parts' in nature, such as protein families and folds, and their functions in living organisms. Through the analysis of such inventories, it has been shown that different genomes have very different usage of parts; for example, the common folds in the worm are very different from those in Escherichia coli. RESULTS: Despite these differences, we find that the genomic occurrence of generalized parts follows a well-known mathematical framework called the power law, with a few parts occurring many times and most occurring only a few times. This observation is true in a wide variety of genomic contexts. Earlier studies found power laws in a few specific cases, such as the occurrence of protein families. Here, we find many further cases of power-law behavior, for example in the occurrence of pseudogenes and in levels of gene expression. We show comprehensively that this behavior applies across many different genomes, for many different types of parts (DNA words, InterPro families, protein superfamilies and folds, pseudogene families and pseudomotifs), and for the many disparate attributes associated with these parts (their functions, interactions and expression levels). CONCLUSIONS: Power-law behavior provides a concise mathematical description of an important biological feature: the sheer dominance of a few members over the overall population. We present this behavior in a unified framework and propose that all these observations are connected to an underlying DNA duplication process as genomes evolved to their current state.
format Text
id pubmed-126234
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1262342002-09-25 The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties Luscombe, Nicholas M Qian, Jiang Zhang, Zhaolei Johnson, Ted Gerstein, Mark Genome Biol Research BACKGROUND: The sequencing of genomes provides us with an inventory of the 'molecular parts' in nature, such as protein families and folds, and their functions in living organisms. Through the analysis of such inventories, it has been shown that different genomes have very different usage of parts; for example, the common folds in the worm are very different from those in Escherichia coli. RESULTS: Despite these differences, we find that the genomic occurrence of generalized parts follows a well-known mathematical framework called the power law, with a few parts occurring many times and most occurring only a few times. This observation is true in a wide variety of genomic contexts. Earlier studies found power laws in a few specific cases, such as the occurrence of protein families. Here, we find many further cases of power-law behavior, for example in the occurrence of pseudogenes and in levels of gene expression. We show comprehensively that this behavior applies across many different genomes, for many different types of parts (DNA words, InterPro families, protein superfamilies and folds, pseudogene families and pseudomotifs), and for the many disparate attributes associated with these parts (their functions, interactions and expression levels). CONCLUSIONS: Power-law behavior provides a concise mathematical description of an important biological feature: the sheer dominance of a few members over the overall population. We present this behavior in a unified framework and propose that all these observations are connected to an underlying DNA duplication process as genomes evolved to their current state. BioMed Central 2002 2002-07-25 /pmc/articles/PMC126234/ /pubmed/12186647 Text en Copyright © 2002 Luscombe et al., licensee BioMed Central Ltd
spellingShingle Research
Luscombe, Nicholas M
Qian, Jiang
Zhang, Zhaolei
Johnson, Ted
Gerstein, Mark
The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties
title The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties
title_full The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties
title_fullStr The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties
title_full_unstemmed The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties
title_short The dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties
title_sort dominance of the population by a selected few: power-law behaviour applies to a wide variety of genomic properties
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC126234/
https://www.ncbi.nlm.nih.gov/pubmed/12186647
work_keys_str_mv AT luscombenicholasm thedominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties
AT qianjiang thedominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties
AT zhangzhaolei thedominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties
AT johnsonted thedominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties
AT gersteinmark thedominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties
AT luscombenicholasm dominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties
AT qianjiang dominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties
AT zhangzhaolei dominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties
AT johnsonted dominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties
AT gersteinmark dominanceofthepopulationbyaselectedfewpowerlawbehaviourappliestoawidevarietyofgenomicproperties