Cargando…
A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes
BACKGROUND: The C2H2 zinc-finger (ZNF) containing gene family is one of the largest and most complex gene families in metazoan genomes. These genes are known to exist in almost all eukaryotes, and they constitute a major subset of eukaryotic transcription factors. The genes of this family usually oc...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3701560/ https://www.ncbi.nlm.nih.gov/pubmed/23800006 http://dx.doi.org/10.1186/1471-2164-14-420 |
_version_ | 1782275665170530304 |
---|---|
author | Seetharam, Arun Stuart, Gary W |
author_facet | Seetharam, Arun Stuart, Gary W |
author_sort | Seetharam, Arun |
collection | PubMed |
description | BACKGROUND: The C2H2 zinc-finger (ZNF) containing gene family is one of the largest and most complex gene families in metazoan genomes. These genes are known to exist in almost all eukaryotes, and they constitute a major subset of eukaryotic transcription factors. The genes of this family usually occur as clusters in genomes and are thought to have undergone a massive expansion in vertebrates by multiple tandem duplication events (BMC Evol Biol 8:176, 2008). RESULTS: In this study, we combined two popular approaches for homolog detection, Reciprocal Best Hit (RBH) (Proc Natl Acad Sci USA 95:6239–6244, 1998) and Hidden–Markov model (HMM) profiles search (Bioinformatics 14:755-763, 1998), on a diverse set of complete genomes of 124 eukaryotic species ranging from excavates to humans to identify all detectable members of 37 C2H2 ZNF gene families. We succeeded in identifying 3,890 genes as distinct members of 37 C2H2 gene families. These 37 families are distributed among the eukaryotes as progressive additions of gene blocks with increasing complexity of the organisms. The first block featuring the protists had 7 families, the second block featuring plants had 2 families, the third block featuring the fungi had 2 families (one of which was also present in plants) and the final block consisted of metazoans with 25 families. Among the metazoans, the simpler unicellular metazoans had just 15 of the 25 families while most of the bilaterians had all 25 families making up a total of 37 families. Multiple potential examples of lineage-specific gene duplications and gene losses were also observed. CONCLUSIONS: Our hybrid approach combines features of the both RBH and HMM methods for homolog detection. This largely automated technique is much faster than manual methods and is able to detect homologs accurately and efficiently among a diverse set of organisms. Our analysis of the 37 evolutionarily conserved C2H2 ZNF gene families revealed a stepwise appearance of ZNF families, agreeing well with the phylogenetic relationship of the organisms compared and their presumed stepwise increase in complexity (Science 300:1694, 2003). |
format | Online Article Text |
id | pubmed-3701560 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-37015602013-07-05 A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes Seetharam, Arun Stuart, Gary W BMC Genomics Research Article BACKGROUND: The C2H2 zinc-finger (ZNF) containing gene family is one of the largest and most complex gene families in metazoan genomes. These genes are known to exist in almost all eukaryotes, and they constitute a major subset of eukaryotic transcription factors. The genes of this family usually occur as clusters in genomes and are thought to have undergone a massive expansion in vertebrates by multiple tandem duplication events (BMC Evol Biol 8:176, 2008). RESULTS: In this study, we combined two popular approaches for homolog detection, Reciprocal Best Hit (RBH) (Proc Natl Acad Sci USA 95:6239–6244, 1998) and Hidden–Markov model (HMM) profiles search (Bioinformatics 14:755-763, 1998), on a diverse set of complete genomes of 124 eukaryotic species ranging from excavates to humans to identify all detectable members of 37 C2H2 ZNF gene families. We succeeded in identifying 3,890 genes as distinct members of 37 C2H2 gene families. These 37 families are distributed among the eukaryotes as progressive additions of gene blocks with increasing complexity of the organisms. The first block featuring the protists had 7 families, the second block featuring plants had 2 families, the third block featuring the fungi had 2 families (one of which was also present in plants) and the final block consisted of metazoans with 25 families. Among the metazoans, the simpler unicellular metazoans had just 15 of the 25 families while most of the bilaterians had all 25 families making up a total of 37 families. Multiple potential examples of lineage-specific gene duplications and gene losses were also observed. CONCLUSIONS: Our hybrid approach combines features of the both RBH and HMM methods for homolog detection. This largely automated technique is much faster than manual methods and is able to detect homologs accurately and efficiently among a diverse set of organisms. Our analysis of the 37 evolutionarily conserved C2H2 ZNF gene families revealed a stepwise appearance of ZNF families, agreeing well with the phylogenetic relationship of the organisms compared and their presumed stepwise increase in complexity (Science 300:1694, 2003). BioMed Central 2013-06-24 /pmc/articles/PMC3701560/ /pubmed/23800006 http://dx.doi.org/10.1186/1471-2164-14-420 Text en Copyright © 2013 Seetharam and Stuart; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Seetharam, Arun Stuart, Gary W A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes |
title | A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes |
title_full | A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes |
title_fullStr | A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes |
title_full_unstemmed | A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes |
title_short | A study on the distribution of 37 well conserved families of C2H2 zinc finger genes in eukaryotes |
title_sort | study on the distribution of 37 well conserved families of c2h2 zinc finger genes in eukaryotes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3701560/ https://www.ncbi.nlm.nih.gov/pubmed/23800006 http://dx.doi.org/10.1186/1471-2164-14-420 |
work_keys_str_mv | AT seetharamarun astudyonthedistributionof37wellconservedfamiliesofc2h2zincfingergenesineukaryotes AT stuartgaryw astudyonthedistributionof37wellconservedfamiliesofc2h2zincfingergenesineukaryotes AT seetharamarun studyonthedistributionof37wellconservedfamiliesofc2h2zincfingergenesineukaryotes AT stuartgaryw studyonthedistributionof37wellconservedfamiliesofc2h2zincfingergenesineukaryotes |