Cargando…
FUNYBASE: a FUNgal phYlogenomic dataBASE
BACKGROUND: The increasing availability of fungal genome sequences provides large numbers of proteins for evolutionary and phylogenetic analyses. However the heterogeneity of data, including the quality of genome annotation and the difficulty of retrieving true orthologs, makes such investigations c...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2008
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2600828/ https://www.ncbi.nlm.nih.gov/pubmed/18954438 http://dx.doi.org/10.1186/1471-2105-9-456 |
_version_ | 1782162215612186624 |
---|---|
author | Marthey, Sylvain Aguileta, Gabriela Rodolphe, François Gendrault, Annie Giraud, Tatiana Fournier, Elisabeth Lopez-Villavicencio, Manuela Gautier, Angélique Lebrun, Marc-Henri Chiapello, Hélène |
author_facet | Marthey, Sylvain Aguileta, Gabriela Rodolphe, François Gendrault, Annie Giraud, Tatiana Fournier, Elisabeth Lopez-Villavicencio, Manuela Gautier, Angélique Lebrun, Marc-Henri Chiapello, Hélène |
author_sort | Marthey, Sylvain |
collection | PubMed |
description | BACKGROUND: The increasing availability of fungal genome sequences provides large numbers of proteins for evolutionary and phylogenetic analyses. However the heterogeneity of data, including the quality of genome annotation and the difficulty of retrieving true orthologs, makes such investigations challenging. The aim of this study was to provide a reliable and integrated resource of orthologous gene families to perform comparative and phylogenetic analyses in fungi. DESCRIPTION: FUNYBASE is a database dedicated to the analysis of fungal single-copy genes extracted from available fungal genomes sequences, their classification into reliable clusters of orthologs, and the assessment of their informative value for phylogenetic reconstruction based on amino acid sequences. The current release of FUNYBASE contains two types of protein data: (i) a complete set of protein sequences extracted from 30 public fungal genomes and classified into clusters of orthologs using a robust automated procedure, and (ii) a subset of 246 reliable ortholog clusters present as single copy genes in 21 fungal genomes. For each of these 246 ortholog clusters, phylogenetic trees were reconstructed based on their amino acid sequences. To assess the informative value of each ortholog cluster, each was compared to a reference species tree constructed using a concatenation of roughly half of the 246 sequences that are best approximated by the WAG evolutionary model. The orthologs were classified according to a topological score, which measures their ability to recover the same topology as the reference species tree. The full results of these analyses are available on-line with a user-friendly interface that allows for searches to be performed by species name, the ortholog cluster, various keywords, or using the BLAST algorithm. Examples of fruitful utilization of FUNYBASE for investigation of fungal phylogenetics are also presented. CONCLUSION: FUNYBASE constitutes a novel and useful resource for two types of analyses: (i) comparative studies can be greatly facilitated by reliable clusters of orthologs across sets of user-defined fungal genomes, and (ii) phylogenetic reconstruction can be improved by identifying genes with the highest informative value at the desired taxonomic level. |
format | Text |
id | pubmed-2600828 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2008 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26008282008-12-12 FUNYBASE: a FUNgal phYlogenomic dataBASE Marthey, Sylvain Aguileta, Gabriela Rodolphe, François Gendrault, Annie Giraud, Tatiana Fournier, Elisabeth Lopez-Villavicencio, Manuela Gautier, Angélique Lebrun, Marc-Henri Chiapello, Hélène BMC Bioinformatics Database BACKGROUND: The increasing availability of fungal genome sequences provides large numbers of proteins for evolutionary and phylogenetic analyses. However the heterogeneity of data, including the quality of genome annotation and the difficulty of retrieving true orthologs, makes such investigations challenging. The aim of this study was to provide a reliable and integrated resource of orthologous gene families to perform comparative and phylogenetic analyses in fungi. DESCRIPTION: FUNYBASE is a database dedicated to the analysis of fungal single-copy genes extracted from available fungal genomes sequences, their classification into reliable clusters of orthologs, and the assessment of their informative value for phylogenetic reconstruction based on amino acid sequences. The current release of FUNYBASE contains two types of protein data: (i) a complete set of protein sequences extracted from 30 public fungal genomes and classified into clusters of orthologs using a robust automated procedure, and (ii) a subset of 246 reliable ortholog clusters present as single copy genes in 21 fungal genomes. For each of these 246 ortholog clusters, phylogenetic trees were reconstructed based on their amino acid sequences. To assess the informative value of each ortholog cluster, each was compared to a reference species tree constructed using a concatenation of roughly half of the 246 sequences that are best approximated by the WAG evolutionary model. The orthologs were classified according to a topological score, which measures their ability to recover the same topology as the reference species tree. The full results of these analyses are available on-line with a user-friendly interface that allows for searches to be performed by species name, the ortholog cluster, various keywords, or using the BLAST algorithm. Examples of fruitful utilization of FUNYBASE for investigation of fungal phylogenetics are also presented. CONCLUSION: FUNYBASE constitutes a novel and useful resource for two types of analyses: (i) comparative studies can be greatly facilitated by reliable clusters of orthologs across sets of user-defined fungal genomes, and (ii) phylogenetic reconstruction can be improved by identifying genes with the highest informative value at the desired taxonomic level. BioMed Central 2008-10-27 /pmc/articles/PMC2600828/ /pubmed/18954438 http://dx.doi.org/10.1186/1471-2105-9-456 Text en Copyright © 2008 Marthey et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Database Marthey, Sylvain Aguileta, Gabriela Rodolphe, François Gendrault, Annie Giraud, Tatiana Fournier, Elisabeth Lopez-Villavicencio, Manuela Gautier, Angélique Lebrun, Marc-Henri Chiapello, Hélène FUNYBASE: a FUNgal phYlogenomic dataBASE |
title | FUNYBASE: a FUNgal phYlogenomic dataBASE |
title_full | FUNYBASE: a FUNgal phYlogenomic dataBASE |
title_fullStr | FUNYBASE: a FUNgal phYlogenomic dataBASE |
title_full_unstemmed | FUNYBASE: a FUNgal phYlogenomic dataBASE |
title_short | FUNYBASE: a FUNgal phYlogenomic dataBASE |
title_sort | funybase: a fungal phylogenomic database |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2600828/ https://www.ncbi.nlm.nih.gov/pubmed/18954438 http://dx.doi.org/10.1186/1471-2105-9-456 |
work_keys_str_mv | AT martheysylvain funybaseafungalphylogenomicdatabase AT aguiletagabriela funybaseafungalphylogenomicdatabase AT rodolphefrancois funybaseafungalphylogenomicdatabase AT gendraultannie funybaseafungalphylogenomicdatabase AT giraudtatiana funybaseafungalphylogenomicdatabase AT fournierelisabeth funybaseafungalphylogenomicdatabase AT lopezvillavicenciomanuela funybaseafungalphylogenomicdatabase AT gautierangelique funybaseafungalphylogenomicdatabase AT lebrunmarchenri funybaseafungalphylogenomicdatabase AT chiapellohelene funybaseafungalphylogenomicdatabase |