Cargando…

BBH-LS: an algorithm for computing positional homologs using sequence and gene context similarity

BACKGROUND: Identifying corresponding genes (orthologs) in different species is an important step in genome-wide comparative analysis. In particular, one-to-one correspondences between genes in different species greatly simplify certain problems such as transfer of function annotation and genome rea...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Melvin, Leong, Hon Wai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3403649/
https://www.ncbi.nlm.nih.gov/pubmed/23046607
http://dx.doi.org/10.1186/1752-0509-6-S1-S22
_version_ 1782238920779497472
author Zhang, Melvin
Leong, Hon Wai
author_facet Zhang, Melvin
Leong, Hon Wai
author_sort Zhang, Melvin
collection PubMed
description BACKGROUND: Identifying corresponding genes (orthologs) in different species is an important step in genome-wide comparative analysis. In particular, one-to-one correspondences between genes in different species greatly simplify certain problems such as transfer of function annotation and genome rearrangement studies. Positional homologs are the direct descendants of a single ancestral gene in the most recent common ancestor and by definition form one-to-one correspondence. RESULTS: In this work, we present a simple yet effective method (BBH-LS) for the identification of positional homologs from the comparative analysis of two genomes. Our BBH-LS method integrates sequence similarity and gene context similarity in order to get more accurate ortholog assignments. Specifically, BBH-LS applies the bidirectional best hit heuristic to a combination of sequence similarity and gene context similarity scores. CONCLUSION: We applied our method to the human, mouse, and rat genomes and found that BBH-LS produced the best results when using both sequence and gene context information equally. Compared to the state-of-the-art algorithms, such as MSOAR2, BBH-LS is able to identify more positional homologs with fewer false positives.
format Online
Article
Text
id pubmed-3403649
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34036492012-07-25 BBH-LS: an algorithm for computing positional homologs using sequence and gene context similarity Zhang, Melvin Leong, Hon Wai BMC Syst Biol Research BACKGROUND: Identifying corresponding genes (orthologs) in different species is an important step in genome-wide comparative analysis. In particular, one-to-one correspondences between genes in different species greatly simplify certain problems such as transfer of function annotation and genome rearrangement studies. Positional homologs are the direct descendants of a single ancestral gene in the most recent common ancestor and by definition form one-to-one correspondence. RESULTS: In this work, we present a simple yet effective method (BBH-LS) for the identification of positional homologs from the comparative analysis of two genomes. Our BBH-LS method integrates sequence similarity and gene context similarity in order to get more accurate ortholog assignments. Specifically, BBH-LS applies the bidirectional best hit heuristic to a combination of sequence similarity and gene context similarity scores. CONCLUSION: We applied our method to the human, mouse, and rat genomes and found that BBH-LS produced the best results when using both sequence and gene context information equally. Compared to the state-of-the-art algorithms, such as MSOAR2, BBH-LS is able to identify more positional homologs with fewer false positives. BioMed Central 2012-07-16 /pmc/articles/PMC3403649/ /pubmed/23046607 http://dx.doi.org/10.1186/1752-0509-6-S1-S22 Text en Copyright ©2012 Zhang and Leong; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Zhang, Melvin
Leong, Hon Wai
BBH-LS: an algorithm for computing positional homologs using sequence and gene context similarity
title BBH-LS: an algorithm for computing positional homologs using sequence and gene context similarity
title_full BBH-LS: an algorithm for computing positional homologs using sequence and gene context similarity
title_fullStr BBH-LS: an algorithm for computing positional homologs using sequence and gene context similarity
title_full_unstemmed BBH-LS: an algorithm for computing positional homologs using sequence and gene context similarity
title_short BBH-LS: an algorithm for computing positional homologs using sequence and gene context similarity
title_sort bbh-ls: an algorithm for computing positional homologs using sequence and gene context similarity
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3403649/
https://www.ncbi.nlm.nih.gov/pubmed/23046607
http://dx.doi.org/10.1186/1752-0509-6-S1-S22
work_keys_str_mv AT zhangmelvin bbhlsanalgorithmforcomputingpositionalhomologsusingsequenceandgenecontextsimilarity
AT leonghonwai bbhlsanalgorithmforcomputingpositionalhomologsusingsequenceandgenecontextsimilarity