Cargando…

FUNYBASE: a FUNgal phYlogenomic dataBASE

BACKGROUND: The increasing availability of fungal genome sequences provides large numbers of proteins for evolutionary and phylogenetic analyses. However the heterogeneity of data, including the quality of genome annotation and the difficulty of retrieving true orthologs, makes such investigations c...

Descripción completa

Detalles Bibliográficos
Autores principales: Marthey, Sylvain, Aguileta, Gabriela, Rodolphe, François, Gendrault, Annie, Giraud, Tatiana, Fournier, Elisabeth, Lopez-Villavicencio, Manuela, Gautier, Angélique, Lebrun, Marc-Henri, Chiapello, Hélène
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2600828/
https://www.ncbi.nlm.nih.gov/pubmed/18954438
http://dx.doi.org/10.1186/1471-2105-9-456
_version_ 1782162215612186624
author Marthey, Sylvain
Aguileta, Gabriela
Rodolphe, François
Gendrault, Annie
Giraud, Tatiana
Fournier, Elisabeth
Lopez-Villavicencio, Manuela
Gautier, Angélique
Lebrun, Marc-Henri
Chiapello, Hélène
author_facet Marthey, Sylvain
Aguileta, Gabriela
Rodolphe, François
Gendrault, Annie
Giraud, Tatiana
Fournier, Elisabeth
Lopez-Villavicencio, Manuela
Gautier, Angélique
Lebrun, Marc-Henri
Chiapello, Hélène
author_sort Marthey, Sylvain
collection PubMed
description BACKGROUND: The increasing availability of fungal genome sequences provides large numbers of proteins for evolutionary and phylogenetic analyses. However the heterogeneity of data, including the quality of genome annotation and the difficulty of retrieving true orthologs, makes such investigations challenging. The aim of this study was to provide a reliable and integrated resource of orthologous gene families to perform comparative and phylogenetic analyses in fungi. DESCRIPTION: FUNYBASE is a database dedicated to the analysis of fungal single-copy genes extracted from available fungal genomes sequences, their classification into reliable clusters of orthologs, and the assessment of their informative value for phylogenetic reconstruction based on amino acid sequences. The current release of FUNYBASE contains two types of protein data: (i) a complete set of protein sequences extracted from 30 public fungal genomes and classified into clusters of orthologs using a robust automated procedure, and (ii) a subset of 246 reliable ortholog clusters present as single copy genes in 21 fungal genomes. For each of these 246 ortholog clusters, phylogenetic trees were reconstructed based on their amino acid sequences. To assess the informative value of each ortholog cluster, each was compared to a reference species tree constructed using a concatenation of roughly half of the 246 sequences that are best approximated by the WAG evolutionary model. The orthologs were classified according to a topological score, which measures their ability to recover the same topology as the reference species tree. The full results of these analyses are available on-line with a user-friendly interface that allows for searches to be performed by species name, the ortholog cluster, various keywords, or using the BLAST algorithm. Examples of fruitful utilization of FUNYBASE for investigation of fungal phylogenetics are also presented. CONCLUSION: FUNYBASE constitutes a novel and useful resource for two types of analyses: (i) comparative studies can be greatly facilitated by reliable clusters of orthologs across sets of user-defined fungal genomes, and (ii) phylogenetic reconstruction can be improved by identifying genes with the highest informative value at the desired taxonomic level.
format Text
id pubmed-2600828
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26008282008-12-12 FUNYBASE: a FUNgal phYlogenomic dataBASE Marthey, Sylvain Aguileta, Gabriela Rodolphe, François Gendrault, Annie Giraud, Tatiana Fournier, Elisabeth Lopez-Villavicencio, Manuela Gautier, Angélique Lebrun, Marc-Henri Chiapello, Hélène BMC Bioinformatics Database BACKGROUND: The increasing availability of fungal genome sequences provides large numbers of proteins for evolutionary and phylogenetic analyses. However the heterogeneity of data, including the quality of genome annotation and the difficulty of retrieving true orthologs, makes such investigations challenging. The aim of this study was to provide a reliable and integrated resource of orthologous gene families to perform comparative and phylogenetic analyses in fungi. DESCRIPTION: FUNYBASE is a database dedicated to the analysis of fungal single-copy genes extracted from available fungal genomes sequences, their classification into reliable clusters of orthologs, and the assessment of their informative value for phylogenetic reconstruction based on amino acid sequences. The current release of FUNYBASE contains two types of protein data: (i) a complete set of protein sequences extracted from 30 public fungal genomes and classified into clusters of orthologs using a robust automated procedure, and (ii) a subset of 246 reliable ortholog clusters present as single copy genes in 21 fungal genomes. For each of these 246 ortholog clusters, phylogenetic trees were reconstructed based on their amino acid sequences. To assess the informative value of each ortholog cluster, each was compared to a reference species tree constructed using a concatenation of roughly half of the 246 sequences that are best approximated by the WAG evolutionary model. The orthologs were classified according to a topological score, which measures their ability to recover the same topology as the reference species tree. The full results of these analyses are available on-line with a user-friendly interface that allows for searches to be performed by species name, the ortholog cluster, various keywords, or using the BLAST algorithm. Examples of fruitful utilization of FUNYBASE for investigation of fungal phylogenetics are also presented. CONCLUSION: FUNYBASE constitutes a novel and useful resource for two types of analyses: (i) comparative studies can be greatly facilitated by reliable clusters of orthologs across sets of user-defined fungal genomes, and (ii) phylogenetic reconstruction can be improved by identifying genes with the highest informative value at the desired taxonomic level. BioMed Central 2008-10-27 /pmc/articles/PMC2600828/ /pubmed/18954438 http://dx.doi.org/10.1186/1471-2105-9-456 Text en Copyright © 2008 Marthey et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Marthey, Sylvain
Aguileta, Gabriela
Rodolphe, François
Gendrault, Annie
Giraud, Tatiana
Fournier, Elisabeth
Lopez-Villavicencio, Manuela
Gautier, Angélique
Lebrun, Marc-Henri
Chiapello, Hélène
FUNYBASE: a FUNgal phYlogenomic dataBASE
title FUNYBASE: a FUNgal phYlogenomic dataBASE
title_full FUNYBASE: a FUNgal phYlogenomic dataBASE
title_fullStr FUNYBASE: a FUNgal phYlogenomic dataBASE
title_full_unstemmed FUNYBASE: a FUNgal phYlogenomic dataBASE
title_short FUNYBASE: a FUNgal phYlogenomic dataBASE
title_sort funybase: a fungal phylogenomic database
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2600828/
https://www.ncbi.nlm.nih.gov/pubmed/18954438
http://dx.doi.org/10.1186/1471-2105-9-456
work_keys_str_mv AT martheysylvain funybaseafungalphylogenomicdatabase
AT aguiletagabriela funybaseafungalphylogenomicdatabase
AT rodolphefrancois funybaseafungalphylogenomicdatabase
AT gendraultannie funybaseafungalphylogenomicdatabase
AT giraudtatiana funybaseafungalphylogenomicdatabase
AT fournierelisabeth funybaseafungalphylogenomicdatabase
AT lopezvillavicenciomanuela funybaseafungalphylogenomicdatabase
AT gautierangelique funybaseafungalphylogenomicdatabase
AT lebrunmarchenri funybaseafungalphylogenomicdatabase
AT chiapellohelene funybaseafungalphylogenomicdatabase