Cargando…

Genome-wide computational prediction of tandem gene arrays: application in yeasts

BACKGROUND: This paper describes an efficient in silico method for detecting tandem gene arrays (TGAs) in fully sequenced and compact genomes such as those of prokaryotes or unicellular eukaryotes. The originality of this method lies in the search of protein sequence similarities in the vicinity of...

Descripción completa

Detalles Bibliográficos
Autores principales: Despons, Laurence, Baret, Philippe V, Frangeul, Lionel, Louis, Véronique Leh, Durrens, Pascal, Souciet, Jean-Luc
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2822764/
https://www.ncbi.nlm.nih.gov/pubmed/20092627
http://dx.doi.org/10.1186/1471-2164-11-56
_version_ 1782177554048745472
author Despons, Laurence
Baret, Philippe V
Frangeul, Lionel
Louis, Véronique Leh
Durrens, Pascal
Souciet, Jean-Luc
author_facet Despons, Laurence
Baret, Philippe V
Frangeul, Lionel
Louis, Véronique Leh
Durrens, Pascal
Souciet, Jean-Luc
author_sort Despons, Laurence
collection PubMed
description BACKGROUND: This paper describes an efficient in silico method for detecting tandem gene arrays (TGAs) in fully sequenced and compact genomes such as those of prokaryotes or unicellular eukaryotes. The originality of this method lies in the search of protein sequence similarities in the vicinity of each coding sequence, which allows the prediction of tandem duplicated gene copies independently of their functionality. RESULTS: Applied to nine hemiascomycete yeast genomes, this method predicts that 2% of the genes are involved in TGAs and gene relics are present in 11% of TGAs. The frequency of TGAs with degenerated gene copies means that a significant fraction of tandem duplicated genes follows the birth-and-death model of evolution. A comparison of sequence identity distributions between sets of homologous gene pairs shows that the different copies of tandem arrayed paralogs are less divergent than copies of dispersed paralogs in yeast genomes. It suggests that paralogs included in tandem structures are more recent or more subject to the gene conversion mechanism than other paralogs. CONCLUSION: The method reported here is a useful computational tool to provide a database of TGAs composed of functional or nonfunctional gene copies. Such a database has obvious applications in the fields of structural and comparative genomics. Notably, a detailed study of the TGA catalog will make it possible to tackle the fundamental questions of the origin and evolution of tandem gene clusters.
format Text
id pubmed-2822764
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28227642010-02-17 Genome-wide computational prediction of tandem gene arrays: application in yeasts Despons, Laurence Baret, Philippe V Frangeul, Lionel Louis, Véronique Leh Durrens, Pascal Souciet, Jean-Luc BMC Genomics Methodology Article BACKGROUND: This paper describes an efficient in silico method for detecting tandem gene arrays (TGAs) in fully sequenced and compact genomes such as those of prokaryotes or unicellular eukaryotes. The originality of this method lies in the search of protein sequence similarities in the vicinity of each coding sequence, which allows the prediction of tandem duplicated gene copies independently of their functionality. RESULTS: Applied to nine hemiascomycete yeast genomes, this method predicts that 2% of the genes are involved in TGAs and gene relics are present in 11% of TGAs. The frequency of TGAs with degenerated gene copies means that a significant fraction of tandem duplicated genes follows the birth-and-death model of evolution. A comparison of sequence identity distributions between sets of homologous gene pairs shows that the different copies of tandem arrayed paralogs are less divergent than copies of dispersed paralogs in yeast genomes. It suggests that paralogs included in tandem structures are more recent or more subject to the gene conversion mechanism than other paralogs. CONCLUSION: The method reported here is a useful computational tool to provide a database of TGAs composed of functional or nonfunctional gene copies. Such a database has obvious applications in the fields of structural and comparative genomics. Notably, a detailed study of the TGA catalog will make it possible to tackle the fundamental questions of the origin and evolution of tandem gene clusters. BioMed Central 2010-01-21 /pmc/articles/PMC2822764/ /pubmed/20092627 http://dx.doi.org/10.1186/1471-2164-11-56 Text en Copyright ©2010 Despons et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Despons, Laurence
Baret, Philippe V
Frangeul, Lionel
Louis, Véronique Leh
Durrens, Pascal
Souciet, Jean-Luc
Genome-wide computational prediction of tandem gene arrays: application in yeasts
title Genome-wide computational prediction of tandem gene arrays: application in yeasts
title_full Genome-wide computational prediction of tandem gene arrays: application in yeasts
title_fullStr Genome-wide computational prediction of tandem gene arrays: application in yeasts
title_full_unstemmed Genome-wide computational prediction of tandem gene arrays: application in yeasts
title_short Genome-wide computational prediction of tandem gene arrays: application in yeasts
title_sort genome-wide computational prediction of tandem gene arrays: application in yeasts
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2822764/
https://www.ncbi.nlm.nih.gov/pubmed/20092627
http://dx.doi.org/10.1186/1471-2164-11-56
work_keys_str_mv AT desponslaurence genomewidecomputationalpredictionoftandemgenearraysapplicationinyeasts
AT baretphilippev genomewidecomputationalpredictionoftandemgenearraysapplicationinyeasts
AT frangeullionel genomewidecomputationalpredictionoftandemgenearraysapplicationinyeasts
AT louisveroniqueleh genomewidecomputationalpredictionoftandemgenearraysapplicationinyeasts
AT durrenspascal genomewidecomputationalpredictionoftandemgenearraysapplicationinyeasts
AT soucietjeanluc genomewidecomputationalpredictionoftandemgenearraysapplicationinyeasts