Cargando…

GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters

Inference of multiple sequence alignments (MSAs) is a critical part of phylogenetic and comparative genomics studies. However, from the same set of sequences different MSAs are often inferred, depending on the methodologies used and the assumed parameters. Much effort has recently been devoted to im...

Descripción completa

Detalles Bibliográficos
Autores principales: Sela, Itamar, Ashkenazy, Haim, Katoh, Kazutaka, Pupko, Tal
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4489236/
https://www.ncbi.nlm.nih.gov/pubmed/25883146
http://dx.doi.org/10.1093/nar/gkv318
_version_ 1782379317038153728
author Sela, Itamar
Ashkenazy, Haim
Katoh, Kazutaka
Pupko, Tal
author_facet Sela, Itamar
Ashkenazy, Haim
Katoh, Kazutaka
Pupko, Tal
author_sort Sela, Itamar
collection PubMed
description Inference of multiple sequence alignments (MSAs) is a critical part of phylogenetic and comparative genomics studies. However, from the same set of sequences different MSAs are often inferred, depending on the methodologies used and the assumed parameters. Much effort has recently been devoted to improving the ability to identify unreliable alignment regions. Detecting such unreliable regions was previously shown to be important for downstream analyses relying on MSAs, such as the detection of positive selection. Here we developed GUIDANCE2, a new integrative methodology that accounts for: (i) uncertainty in the process of indel formation, (ii) uncertainty in the assumed guide tree and (iii) co-optimal solutions in the pairwise alignments, used as building blocks in progressive alignment algorithms. We compared GUIDANCE2 with seven methodologies to detect unreliable MSA regions using extensive simulations and empirical benchmarks. We show that GUIDANCE2 outperforms all previously developed methodologies. Furthermore, GUIDANCE2 also provides a set of alternative MSAs which can be useful for downstream analyses. The novel algorithm is implemented as a web-server, available at: http://guidance.tau.ac.il.
format Online
Article
Text
id pubmed-4489236
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44892362015-07-07 GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters Sela, Itamar Ashkenazy, Haim Katoh, Kazutaka Pupko, Tal Nucleic Acids Res Web Server issue Inference of multiple sequence alignments (MSAs) is a critical part of phylogenetic and comparative genomics studies. However, from the same set of sequences different MSAs are often inferred, depending on the methodologies used and the assumed parameters. Much effort has recently been devoted to improving the ability to identify unreliable alignment regions. Detecting such unreliable regions was previously shown to be important for downstream analyses relying on MSAs, such as the detection of positive selection. Here we developed GUIDANCE2, a new integrative methodology that accounts for: (i) uncertainty in the process of indel formation, (ii) uncertainty in the assumed guide tree and (iii) co-optimal solutions in the pairwise alignments, used as building blocks in progressive alignment algorithms. We compared GUIDANCE2 with seven methodologies to detect unreliable MSA regions using extensive simulations and empirical benchmarks. We show that GUIDANCE2 outperforms all previously developed methodologies. Furthermore, GUIDANCE2 also provides a set of alternative MSAs which can be useful for downstream analyses. The novel algorithm is implemented as a web-server, available at: http://guidance.tau.ac.il. Oxford University Press 2015-07-01 2015-04-16 /pmc/articles/PMC4489236/ /pubmed/25883146 http://dx.doi.org/10.1093/nar/gkv318 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Web Server issue
Sela, Itamar
Ashkenazy, Haim
Katoh, Kazutaka
Pupko, Tal
GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters
title GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters
title_full GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters
title_fullStr GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters
title_full_unstemmed GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters
title_short GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters
title_sort guidance2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters
topic Web Server issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4489236/
https://www.ncbi.nlm.nih.gov/pubmed/25883146
http://dx.doi.org/10.1093/nar/gkv318
work_keys_str_mv AT selaitamar guidance2accuratedetectionofunreliablealignmentregionsaccountingfortheuncertaintyofmultipleparameters
AT ashkenazyhaim guidance2accuratedetectionofunreliablealignmentregionsaccountingfortheuncertaintyofmultipleparameters
AT katohkazutaka guidance2accuratedetectionofunreliablealignmentregionsaccountingfortheuncertaintyofmultipleparameters
AT pupkotal guidance2accuratedetectionofunreliablealignmentregionsaccountingfortheuncertaintyofmultipleparameters