Cargando…

PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences

Both multiple sequence alignment and phylogenetic analysis are problematic in the “twilight zone” of sequence similarity (≤25% amino acid identity). Herein we explore the accuracy of phylogenetic inference at extreme sequence divergence using a variety of simulated data sets. We evaluate four leadin...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhardwaj, Gaurav, Ko, Kyung Dae, Hong, Yoojin, Zhang, Zhenhai, Ho, Ngai Lam, Chintapalli, Sree V., Kline, Lindsay A., Gotlin, Matthew, Hartranft, David Nicholas, Patterson, Morgen E., Dave, Foram, Smith, Evan J., Holmes, Edward C., Patterson, Randen L., van Rossum, Damian B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3325999/
https://www.ncbi.nlm.nih.gov/pubmed/22514627
http://dx.doi.org/10.1371/journal.pone.0034261
_version_ 1782229486992883712
author Bhardwaj, Gaurav
Ko, Kyung Dae
Hong, Yoojin
Zhang, Zhenhai
Ho, Ngai Lam
Chintapalli, Sree V.
Kline, Lindsay A.
Gotlin, Matthew
Hartranft, David Nicholas
Patterson, Morgen E.
Dave, Foram
Smith, Evan J.
Holmes, Edward C.
Patterson, Randen L.
van Rossum, Damian B.
author_facet Bhardwaj, Gaurav
Ko, Kyung Dae
Hong, Yoojin
Zhang, Zhenhai
Ho, Ngai Lam
Chintapalli, Sree V.
Kline, Lindsay A.
Gotlin, Matthew
Hartranft, David Nicholas
Patterson, Morgen E.
Dave, Foram
Smith, Evan J.
Holmes, Edward C.
Patterson, Randen L.
van Rossum, Damian B.
author_sort Bhardwaj, Gaurav
collection PubMed
description Both multiple sequence alignment and phylogenetic analysis are problematic in the “twilight zone” of sequence similarity (≤25% amino acid identity). Herein we explore the accuracy of phylogenetic inference at extreme sequence divergence using a variety of simulated data sets. We evaluate four leading multiple sequence alignment (MSA) methods (MAFFT, T-COFFEE, CLUSTAL, and MUSCLE) and six commonly used programs of tree estimation (Distance-based: Neighbor-Joining; Character-based: PhyML, RAxML, GARLI, Maximum Parsimony, and Bayesian) against a novel MSA-independent method (PHYRN) described here. Strikingly, at “midnight zone” genetic distances (∼7% pairwise identity and 4.0 gaps per position), PHYRN returns high-resolution phylogenies that outperform traditional approaches. We reason this is due to PHRYN's capability to amplify informative positions, even at the most extreme levels of sequence divergence. We also assess the applicability of the PHYRN algorithm for inferring deep evolutionary relationships in the divergent DANGER protein superfamily, for which PHYRN infers a more robust tree compared to MSA-based approaches. Taken together, these results demonstrate that PHYRN represents a powerful mechanism for mapping uncharted frontiers in highly divergent protein sequence data sets.
format Online
Article
Text
id pubmed-3325999
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33259992012-04-18 PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences Bhardwaj, Gaurav Ko, Kyung Dae Hong, Yoojin Zhang, Zhenhai Ho, Ngai Lam Chintapalli, Sree V. Kline, Lindsay A. Gotlin, Matthew Hartranft, David Nicholas Patterson, Morgen E. Dave, Foram Smith, Evan J. Holmes, Edward C. Patterson, Randen L. van Rossum, Damian B. PLoS One Research Article Both multiple sequence alignment and phylogenetic analysis are problematic in the “twilight zone” of sequence similarity (≤25% amino acid identity). Herein we explore the accuracy of phylogenetic inference at extreme sequence divergence using a variety of simulated data sets. We evaluate four leading multiple sequence alignment (MSA) methods (MAFFT, T-COFFEE, CLUSTAL, and MUSCLE) and six commonly used programs of tree estimation (Distance-based: Neighbor-Joining; Character-based: PhyML, RAxML, GARLI, Maximum Parsimony, and Bayesian) against a novel MSA-independent method (PHYRN) described here. Strikingly, at “midnight zone” genetic distances (∼7% pairwise identity and 4.0 gaps per position), PHYRN returns high-resolution phylogenies that outperform traditional approaches. We reason this is due to PHRYN's capability to amplify informative positions, even at the most extreme levels of sequence divergence. We also assess the applicability of the PHYRN algorithm for inferring deep evolutionary relationships in the divergent DANGER protein superfamily, for which PHYRN infers a more robust tree compared to MSA-based approaches. Taken together, these results demonstrate that PHYRN represents a powerful mechanism for mapping uncharted frontiers in highly divergent protein sequence data sets. Public Library of Science 2012-04-13 /pmc/articles/PMC3325999/ /pubmed/22514627 http://dx.doi.org/10.1371/journal.pone.0034261 Text en This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Bhardwaj, Gaurav
Ko, Kyung Dae
Hong, Yoojin
Zhang, Zhenhai
Ho, Ngai Lam
Chintapalli, Sree V.
Kline, Lindsay A.
Gotlin, Matthew
Hartranft, David Nicholas
Patterson, Morgen E.
Dave, Foram
Smith, Evan J.
Holmes, Edward C.
Patterson, Randen L.
van Rossum, Damian B.
PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences
title PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences
title_full PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences
title_fullStr PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences
title_full_unstemmed PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences
title_short PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences
title_sort phyrn: a robust method for phylogenetic analysis of highly divergent sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3325999/
https://www.ncbi.nlm.nih.gov/pubmed/22514627
http://dx.doi.org/10.1371/journal.pone.0034261
work_keys_str_mv AT bhardwajgaurav phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT kokyungdae phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT hongyoojin phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT zhangzhenhai phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT hongailam phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT chintapallisreev phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT klinelindsaya phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT gotlinmatthew phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT hartranftdavidnicholas phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT pattersonmorgene phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT daveforam phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT smithevanj phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT holmesedwardc phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT pattersonrandenl phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences
AT vanrossumdamianb phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences