Cargando…

PlantOrDB: a genome-wide ortholog database for land plants and green algae

BACKGROUND: Genes with different functions are originally generated from some ancestral genes by gene duplication, mutation and functional recombination. It is widely accepted that orthologs are homologous genes evolved from speciation events while paralogs are homologous genes resulted from gene du...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Lei, Ji, Guoli, Ye, Congting, Shu, Changlong, Zhang, Jie, Liang, Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481079/
https://www.ncbi.nlm.nih.gov/pubmed/26112452
http://dx.doi.org/10.1186/s12870-015-0531-4
_version_ 1782378236026552320
author Li, Lei
Ji, Guoli
Ye, Congting
Shu, Changlong
Zhang, Jie
Liang, Chun
author_facet Li, Lei
Ji, Guoli
Ye, Congting
Shu, Changlong
Zhang, Jie
Liang, Chun
author_sort Li, Lei
collection PubMed
description BACKGROUND: Genes with different functions are originally generated from some ancestral genes by gene duplication, mutation and functional recombination. It is widely accepted that orthologs are homologous genes evolved from speciation events while paralogs are homologous genes resulted from gene duplication events.With the rapid increase of genomic data, identifying and distinguishing these genes among different species is becoming an important part of functional genomics research. DESCRIPTION: Using 35 plant and 6 green algal genomes from Phytozome v9, we clustered 1,291,670 peptide sequences into 49,355 homologous gene families in terms of sequence similarity. For each gene family, we have generated a peptide sequence alignment and phylogenetic tree, and identified the speciation/duplication events for every node within the tree. For each node, we also identified and highlighted diagnostic characters that facilitate appropriate addition of a new query sequence into the existing phylogenetic tree and sequence alignment of its best matched gene family. Based on a desired species or subgroup of all species, users can view the phylogenetic tree, sequence alignment and diagnostic characters for a given gene family selectively. PlantOrDB not only allows users to identify orthologs or paralogs from phylogenetic trees, but also provides all orthologs that are built using Reciprocal Best Hit (RBH) pairwise alignment method. Users can upload their own sequences to find the best matched gene families, and visualize their query sequences within the relevant phylogenetic trees and sequence alignments. CONCLUSION: PlantOrDB (http://bioinfolab.miamioh.edu/plantordb) is a genome-wide ortholog database for land plants and green algae. PlantOrDB offers highly interactive visualization, accurate query classification and powerful search functions useful for functional genomic research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12870-015-0531-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4481079
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44810792015-06-27 PlantOrDB: a genome-wide ortholog database for land plants and green algae Li, Lei Ji, Guoli Ye, Congting Shu, Changlong Zhang, Jie Liang, Chun BMC Plant Biol Database BACKGROUND: Genes with different functions are originally generated from some ancestral genes by gene duplication, mutation and functional recombination. It is widely accepted that orthologs are homologous genes evolved from speciation events while paralogs are homologous genes resulted from gene duplication events.With the rapid increase of genomic data, identifying and distinguishing these genes among different species is becoming an important part of functional genomics research. DESCRIPTION: Using 35 plant and 6 green algal genomes from Phytozome v9, we clustered 1,291,670 peptide sequences into 49,355 homologous gene families in terms of sequence similarity. For each gene family, we have generated a peptide sequence alignment and phylogenetic tree, and identified the speciation/duplication events for every node within the tree. For each node, we also identified and highlighted diagnostic characters that facilitate appropriate addition of a new query sequence into the existing phylogenetic tree and sequence alignment of its best matched gene family. Based on a desired species or subgroup of all species, users can view the phylogenetic tree, sequence alignment and diagnostic characters for a given gene family selectively. PlantOrDB not only allows users to identify orthologs or paralogs from phylogenetic trees, but also provides all orthologs that are built using Reciprocal Best Hit (RBH) pairwise alignment method. Users can upload their own sequences to find the best matched gene families, and visualize their query sequences within the relevant phylogenetic trees and sequence alignments. CONCLUSION: PlantOrDB (http://bioinfolab.miamioh.edu/plantordb) is a genome-wide ortholog database for land plants and green algae. PlantOrDB offers highly interactive visualization, accurate query classification and powerful search functions useful for functional genomic research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12870-015-0531-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-06-26 /pmc/articles/PMC4481079/ /pubmed/26112452 http://dx.doi.org/10.1186/s12870-015-0531-4 Text en © Li et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Database
Li, Lei
Ji, Guoli
Ye, Congting
Shu, Changlong
Zhang, Jie
Liang, Chun
PlantOrDB: a genome-wide ortholog database for land plants and green algae
title PlantOrDB: a genome-wide ortholog database for land plants and green algae
title_full PlantOrDB: a genome-wide ortholog database for land plants and green algae
title_fullStr PlantOrDB: a genome-wide ortholog database for land plants and green algae
title_full_unstemmed PlantOrDB: a genome-wide ortholog database for land plants and green algae
title_short PlantOrDB: a genome-wide ortholog database for land plants and green algae
title_sort plantordb: a genome-wide ortholog database for land plants and green algae
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481079/
https://www.ncbi.nlm.nih.gov/pubmed/26112452
http://dx.doi.org/10.1186/s12870-015-0531-4
work_keys_str_mv AT lilei plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae
AT jiguoli plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae
AT yecongting plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae
AT shuchanglong plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae
AT zhangjie plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae
AT liangchun plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae