Cargando…

A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes

BACKGROUND: The C2H2 zinc-finger (ZNF) containing gene family is one of the largest and most complex gene families in metazoan genomes. These genes are known to exist in almost all eukaryotes, and they constitute a major subset of eukaryotic transcription factors. The genes of this family usually oc...

Descripción completa

Detalles Bibliográficos
Autores principales: Seetharam, Arun, Stuart, Gary W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3701560/
https://www.ncbi.nlm.nih.gov/pubmed/23800006
http://dx.doi.org/10.1186/1471-2164-14-420
_version_ 1782275665170530304
author Seetharam, Arun
Stuart, Gary W
author_facet Seetharam, Arun
Stuart, Gary W
author_sort Seetharam, Arun
collection PubMed
description BACKGROUND: The C2H2 zinc-finger (ZNF) containing gene family is one of the largest and most complex gene families in metazoan genomes. These genes are known to exist in almost all eukaryotes, and they constitute a major subset of eukaryotic transcription factors. The genes of this family usually occur as clusters in genomes and are thought to have undergone a massive expansion in vertebrates by multiple tandem duplication events (BMC Evol Biol 8:176, 2008). RESULTS: In this study, we combined two popular approaches for homolog detection, Reciprocal Best Hit (RBH) (Proc Natl Acad Sci USA 95:6239–6244, 1998) and Hidden–Markov model (HMM) profiles search (Bioinformatics 14:755-763, 1998), on a diverse set of complete genomes of 124 eukaryotic species ranging from excavates to humans to identify all detectable members of 37 C2H2 ZNF gene families. We succeeded in identifying 3,890 genes as distinct members of 37 C2H2 gene families. These 37 families are distributed among the eukaryotes as progressive additions of gene blocks with increasing complexity of the organisms. The first block featuring the protists had 7 families, the second block featuring plants had 2 families, the third block featuring the fungi had 2 families (one of which was also present in plants) and the final block consisted of metazoans with 25 families. Among the metazoans, the simpler unicellular metazoans had just 15 of the 25 families while most of the bilaterians had all 25 families making up a total of 37 families. Multiple potential examples of lineage-specific gene duplications and gene losses were also observed. CONCLUSIONS: Our hybrid approach combines features of the both RBH and HMM methods for homolog detection. This largely automated technique is much faster than manual methods and is able to detect homologs accurately and efficiently among a diverse set of organisms. Our analysis of the 37 evolutionarily conserved C2H2 ZNF gene families revealed a stepwise appearance of ZNF families, agreeing well with the phylogenetic relationship of the organisms compared and their presumed stepwise increase in complexity (Science 300:1694, 2003).
format Online
Article
Text
id pubmed-3701560
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-37015602013-07-05 A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes Seetharam, Arun Stuart, Gary W BMC Genomics Research Article BACKGROUND: The C2H2 zinc-finger (ZNF) containing gene family is one of the largest and most complex gene families in metazoan genomes. These genes are known to exist in almost all eukaryotes, and they constitute a major subset of eukaryotic transcription factors. The genes of this family usually occur as clusters in genomes and are thought to have undergone a massive expansion in vertebrates by multiple tandem duplication events (BMC Evol Biol 8:176, 2008). RESULTS: In this study, we combined two popular approaches for homolog detection, Reciprocal Best Hit (RBH) (Proc Natl Acad Sci USA 95:6239–6244, 1998) and Hidden–Markov model (HMM) profiles search (Bioinformatics 14:755-763, 1998), on a diverse set of complete genomes of 124 eukaryotic species ranging from excavates to humans to identify all detectable members of 37 C2H2 ZNF gene families. We succeeded in identifying 3,890 genes as distinct members of 37 C2H2 gene families. These 37 families are distributed among the eukaryotes as progressive additions of gene blocks with increasing complexity of the organisms. The first block featuring the protists had 7 families, the second block featuring plants had 2 families, the third block featuring the fungi had 2 families (one of which was also present in plants) and the final block consisted of metazoans with 25 families. Among the metazoans, the simpler unicellular metazoans had just 15 of the 25 families while most of the bilaterians had all 25 families making up a total of 37 families. Multiple potential examples of lineage-specific gene duplications and gene losses were also observed. CONCLUSIONS: Our hybrid approach combines features of the both RBH and HMM methods for homolog detection. This largely automated technique is much faster than manual methods and is able to detect homologs accurately and efficiently among a diverse set of organisms. Our analysis of the 37 evolutionarily conserved C2H2 ZNF gene families revealed a stepwise appearance of ZNF families, agreeing well with the phylogenetic relationship of the organisms compared and their presumed stepwise increase in complexity (Science 300:1694, 2003). BioMed Central 2013-06-24 /pmc/articles/PMC3701560/ /pubmed/23800006 http://dx.doi.org/10.1186/1471-2164-14-420 Text en Copyright © 2013 Seetharam and Stuart; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Seetharam, Arun
Stuart, Gary W
A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes
title A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes
title_full A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes
title_fullStr A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes
title_full_unstemmed A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes
title_short A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes
title_sort study on the distribution of 37 well conserved families of c2h2 zinc finger genes in eukaryotes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3701560/
https://www.ncbi.nlm.nih.gov/pubmed/23800006
http://dx.doi.org/10.1186/1471-2164-14-420
work_keys_str_mv AT seetharamarun astudyonthedistributionof37wellconservedfamiliesofc2h2zincfingergenesineukaryotes
AT stuartgaryw astudyonthedistributionof37wellconservedfamiliesofc2h2zincfingergenesineukaryotes
AT seetharamarun studyonthedistributionof37wellconservedfamiliesofc2h2zincfingergenesineukaryotes
AT stuartgaryw studyonthedistributionof37wellconservedfamiliesofc2h2zincfingergenesineukaryotes