Cargando…

DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection

BACKGROUND: Orthologs are genes derived from the same ancestor gene loci after speciation events. Orthologous proteins usually have similar sequences and perform comparable biological functions. Therefore, ortholog identification is useful in annotations of newly sequenced genomes. With rapidly incr...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Ting-wen, Wu, Timothy H, Ng, Wailap V, Lin, Wen-chang
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2957689/
https://www.ncbi.nlm.nih.gov/pubmed/21106128
http://dx.doi.org/10.1186/1471-2105-11-S7-S6
_version_ 1782188250205519872
author Chen, Ting-wen
Wu, Timothy H
Ng, Wailap V
Lin, Wen-chang
author_facet Chen, Ting-wen
Wu, Timothy H
Ng, Wailap V
Lin, Wen-chang
author_sort Chen, Ting-wen
collection PubMed
description BACKGROUND: Orthologs are genes derived from the same ancestor gene loci after speciation events. Orthologous proteins usually have similar sequences and perform comparable biological functions. Therefore, ortholog identification is useful in annotations of newly sequenced genomes. With rapidly increasing number of sequenced genomes, constructing or updating ortholog relationship between all genomes requires lots of effort and computation time. In addition, elucidating ortholog relationships between distantly related genomes is challenging because of the lower sequence similarity. Therefore, an efficient ortholog detection method that can deal with large number of distantly related genomes is desired. RESULTS: An efficient ortholog detection pipeline DODO (DOmain based Detection of Orthologs) is created on the basis of domain architectures in this study. Supported by domain composition, which usually directly related with protein function, DODO could facilitate orthologs detection across distantly related genomes. DODO works in two main steps. Starting from domain information, it first assigns protein groups according to their domain architectures and further identifies orthologs within those groups with much reduced complexity. Here DODO is shown to detect orthologs between two genomes in considerably shorter period of time than traditional methods of reciprocal best hits and it is more significant when analyzed a large number of genomes. The output results of DODO are highly comparable with other known ortholog databases. CONCLUSIONS: DODO provides a new efficient pipeline for detection of orthologs in a large number of genomes. In addition, a database established with DODO is also easier to maintain and could be updated relatively effortlessly. The pipeline of DODO could be downloaded from http://140.109.42.19:16080/dodo_web/home.htm
format Text
id pubmed-2957689
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29576892010-10-21 DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection Chen, Ting-wen Wu, Timothy H Ng, Wailap V Lin, Wen-chang BMC Bioinformatics Proceedings BACKGROUND: Orthologs are genes derived from the same ancestor gene loci after speciation events. Orthologous proteins usually have similar sequences and perform comparable biological functions. Therefore, ortholog identification is useful in annotations of newly sequenced genomes. With rapidly increasing number of sequenced genomes, constructing or updating ortholog relationship between all genomes requires lots of effort and computation time. In addition, elucidating ortholog relationships between distantly related genomes is challenging because of the lower sequence similarity. Therefore, an efficient ortholog detection method that can deal with large number of distantly related genomes is desired. RESULTS: An efficient ortholog detection pipeline DODO (DOmain based Detection of Orthologs) is created on the basis of domain architectures in this study. Supported by domain composition, which usually directly related with protein function, DODO could facilitate orthologs detection across distantly related genomes. DODO works in two main steps. Starting from domain information, it first assigns protein groups according to their domain architectures and further identifies orthologs within those groups with much reduced complexity. Here DODO is shown to detect orthologs between two genomes in considerably shorter period of time than traditional methods of reciprocal best hits and it is more significant when analyzed a large number of genomes. The output results of DODO are highly comparable with other known ortholog databases. CONCLUSIONS: DODO provides a new efficient pipeline for detection of orthologs in a large number of genomes. In addition, a database established with DODO is also easier to maintain and could be updated relatively effortlessly. The pipeline of DODO could be downloaded from http://140.109.42.19:16080/dodo_web/home.htm BioMed Central 2010-10-15 /pmc/articles/PMC2957689/ /pubmed/21106128 http://dx.doi.org/10.1186/1471-2105-11-S7-S6 Text en Copyright © 2010 Chen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Chen, Ting-wen
Wu, Timothy H
Ng, Wailap V
Lin, Wen-chang
DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection
title DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection
title_full DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection
title_fullStr DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection
title_full_unstemmed DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection
title_short DODO: an efficient orthologous genes assignment tool based on domain architectures. Domain based ortholog detection
title_sort dodo: an efficient orthologous genes assignment tool based on domain architectures. domain based ortholog detection
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2957689/
https://www.ncbi.nlm.nih.gov/pubmed/21106128
http://dx.doi.org/10.1186/1471-2105-11-S7-S6
work_keys_str_mv AT chentingwen dodoanefficientorthologousgenesassignmenttoolbasedondomainarchitecturesdomainbasedorthologdetection
AT wutimothyh dodoanefficientorthologousgenesassignmenttoolbasedondomainarchitecturesdomainbasedorthologdetection
AT ngwailapv dodoanefficientorthologousgenesassignmenttoolbasedondomainarchitecturesdomainbasedorthologdetection
AT linwenchang dodoanefficientorthologousgenesassignmenttoolbasedondomainarchitecturesdomainbasedorthologdetection