Cargando…
PlantOrDB: a genome-wide ortholog database for land plants and green algae
BACKGROUND: Genes with different functions are originally generated from some ancestral genes by gene duplication, mutation and functional recombination. It is widely accepted that orthologs are homologous genes evolved from speciation events while paralogs are homologous genes resulted from gene du...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481079/ https://www.ncbi.nlm.nih.gov/pubmed/26112452 http://dx.doi.org/10.1186/s12870-015-0531-4 |
_version_ | 1782378236026552320 |
---|---|
author | Li, Lei Ji, Guoli Ye, Congting Shu, Changlong Zhang, Jie Liang, Chun |
author_facet | Li, Lei Ji, Guoli Ye, Congting Shu, Changlong Zhang, Jie Liang, Chun |
author_sort | Li, Lei |
collection | PubMed |
description | BACKGROUND: Genes with different functions are originally generated from some ancestral genes by gene duplication, mutation and functional recombination. It is widely accepted that orthologs are homologous genes evolved from speciation events while paralogs are homologous genes resulted from gene duplication events.With the rapid increase of genomic data, identifying and distinguishing these genes among different species is becoming an important part of functional genomics research. DESCRIPTION: Using 35 plant and 6 green algal genomes from Phytozome v9, we clustered 1,291,670 peptide sequences into 49,355 homologous gene families in terms of sequence similarity. For each gene family, we have generated a peptide sequence alignment and phylogenetic tree, and identified the speciation/duplication events for every node within the tree. For each node, we also identified and highlighted diagnostic characters that facilitate appropriate addition of a new query sequence into the existing phylogenetic tree and sequence alignment of its best matched gene family. Based on a desired species or subgroup of all species, users can view the phylogenetic tree, sequence alignment and diagnostic characters for a given gene family selectively. PlantOrDB not only allows users to identify orthologs or paralogs from phylogenetic trees, but also provides all orthologs that are built using Reciprocal Best Hit (RBH) pairwise alignment method. Users can upload their own sequences to find the best matched gene families, and visualize their query sequences within the relevant phylogenetic trees and sequence alignments. CONCLUSION: PlantOrDB (http://bioinfolab.miamioh.edu/plantordb) is a genome-wide ortholog database for land plants and green algae. PlantOrDB offers highly interactive visualization, accurate query classification and powerful search functions useful for functional genomic research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12870-015-0531-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4481079 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-44810792015-06-27 PlantOrDB: a genome-wide ortholog database for land plants and green algae Li, Lei Ji, Guoli Ye, Congting Shu, Changlong Zhang, Jie Liang, Chun BMC Plant Biol Database BACKGROUND: Genes with different functions are originally generated from some ancestral genes by gene duplication, mutation and functional recombination. It is widely accepted that orthologs are homologous genes evolved from speciation events while paralogs are homologous genes resulted from gene duplication events.With the rapid increase of genomic data, identifying and distinguishing these genes among different species is becoming an important part of functional genomics research. DESCRIPTION: Using 35 plant and 6 green algal genomes from Phytozome v9, we clustered 1,291,670 peptide sequences into 49,355 homologous gene families in terms of sequence similarity. For each gene family, we have generated a peptide sequence alignment and phylogenetic tree, and identified the speciation/duplication events for every node within the tree. For each node, we also identified and highlighted diagnostic characters that facilitate appropriate addition of a new query sequence into the existing phylogenetic tree and sequence alignment of its best matched gene family. Based on a desired species or subgroup of all species, users can view the phylogenetic tree, sequence alignment and diagnostic characters for a given gene family selectively. PlantOrDB not only allows users to identify orthologs or paralogs from phylogenetic trees, but also provides all orthologs that are built using Reciprocal Best Hit (RBH) pairwise alignment method. Users can upload their own sequences to find the best matched gene families, and visualize their query sequences within the relevant phylogenetic trees and sequence alignments. CONCLUSION: PlantOrDB (http://bioinfolab.miamioh.edu/plantordb) is a genome-wide ortholog database for land plants and green algae. PlantOrDB offers highly interactive visualization, accurate query classification and powerful search functions useful for functional genomic research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12870-015-0531-4) contains supplementary material, which is available to authorized users. BioMed Central 2015-06-26 /pmc/articles/PMC4481079/ /pubmed/26112452 http://dx.doi.org/10.1186/s12870-015-0531-4 Text en © Li et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Database Li, Lei Ji, Guoli Ye, Congting Shu, Changlong Zhang, Jie Liang, Chun PlantOrDB: a genome-wide ortholog database for land plants and green algae |
title | PlantOrDB: a genome-wide ortholog database for land plants and green algae |
title_full | PlantOrDB: a genome-wide ortholog database for land plants and green algae |
title_fullStr | PlantOrDB: a genome-wide ortholog database for land plants and green algae |
title_full_unstemmed | PlantOrDB: a genome-wide ortholog database for land plants and green algae |
title_short | PlantOrDB: a genome-wide ortholog database for land plants and green algae |
title_sort | plantordb: a genome-wide ortholog database for land plants and green algae |
topic | Database |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4481079/ https://www.ncbi.nlm.nih.gov/pubmed/26112452 http://dx.doi.org/10.1186/s12870-015-0531-4 |
work_keys_str_mv | AT lilei plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae AT jiguoli plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae AT yecongting plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae AT shuchanglong plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae AT zhangjie plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae AT liangchun plantordbagenomewideorthologdatabaseforlandplantsandgreenalgae |