Cargando…
PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences
Both multiple sequence alignment and phylogenetic analysis are problematic in the “twilight zone” of sequence similarity (≤25% amino acid identity). Herein we explore the accuracy of phylogenetic inference at extreme sequence divergence using a variety of simulated data sets. We evaluate four leadin...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3325999/ https://www.ncbi.nlm.nih.gov/pubmed/22514627 http://dx.doi.org/10.1371/journal.pone.0034261 |
_version_ | 1782229486992883712 |
---|---|
author | Bhardwaj, Gaurav Ko, Kyung Dae Hong, Yoojin Zhang, Zhenhai Ho, Ngai Lam Chintapalli, Sree V. Kline, Lindsay A. Gotlin, Matthew Hartranft, David Nicholas Patterson, Morgen E. Dave, Foram Smith, Evan J. Holmes, Edward C. Patterson, Randen L. van Rossum, Damian B. |
author_facet | Bhardwaj, Gaurav Ko, Kyung Dae Hong, Yoojin Zhang, Zhenhai Ho, Ngai Lam Chintapalli, Sree V. Kline, Lindsay A. Gotlin, Matthew Hartranft, David Nicholas Patterson, Morgen E. Dave, Foram Smith, Evan J. Holmes, Edward C. Patterson, Randen L. van Rossum, Damian B. |
author_sort | Bhardwaj, Gaurav |
collection | PubMed |
description | Both multiple sequence alignment and phylogenetic analysis are problematic in the “twilight zone” of sequence similarity (≤25% amino acid identity). Herein we explore the accuracy of phylogenetic inference at extreme sequence divergence using a variety of simulated data sets. We evaluate four leading multiple sequence alignment (MSA) methods (MAFFT, T-COFFEE, CLUSTAL, and MUSCLE) and six commonly used programs of tree estimation (Distance-based: Neighbor-Joining; Character-based: PhyML, RAxML, GARLI, Maximum Parsimony, and Bayesian) against a novel MSA-independent method (PHYRN) described here. Strikingly, at “midnight zone” genetic distances (∼7% pairwise identity and 4.0 gaps per position), PHYRN returns high-resolution phylogenies that outperform traditional approaches. We reason this is due to PHRYN's capability to amplify informative positions, even at the most extreme levels of sequence divergence. We also assess the applicability of the PHYRN algorithm for inferring deep evolutionary relationships in the divergent DANGER protein superfamily, for which PHYRN infers a more robust tree compared to MSA-based approaches. Taken together, these results demonstrate that PHYRN represents a powerful mechanism for mapping uncharted frontiers in highly divergent protein sequence data sets. |
format | Online Article Text |
id | pubmed-3325999 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-33259992012-04-18 PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences Bhardwaj, Gaurav Ko, Kyung Dae Hong, Yoojin Zhang, Zhenhai Ho, Ngai Lam Chintapalli, Sree V. Kline, Lindsay A. Gotlin, Matthew Hartranft, David Nicholas Patterson, Morgen E. Dave, Foram Smith, Evan J. Holmes, Edward C. Patterson, Randen L. van Rossum, Damian B. PLoS One Research Article Both multiple sequence alignment and phylogenetic analysis are problematic in the “twilight zone” of sequence similarity (≤25% amino acid identity). Herein we explore the accuracy of phylogenetic inference at extreme sequence divergence using a variety of simulated data sets. We evaluate four leading multiple sequence alignment (MSA) methods (MAFFT, T-COFFEE, CLUSTAL, and MUSCLE) and six commonly used programs of tree estimation (Distance-based: Neighbor-Joining; Character-based: PhyML, RAxML, GARLI, Maximum Parsimony, and Bayesian) against a novel MSA-independent method (PHYRN) described here. Strikingly, at “midnight zone” genetic distances (∼7% pairwise identity and 4.0 gaps per position), PHYRN returns high-resolution phylogenies that outperform traditional approaches. We reason this is due to PHRYN's capability to amplify informative positions, even at the most extreme levels of sequence divergence. We also assess the applicability of the PHYRN algorithm for inferring deep evolutionary relationships in the divergent DANGER protein superfamily, for which PHYRN infers a more robust tree compared to MSA-based approaches. Taken together, these results demonstrate that PHYRN represents a powerful mechanism for mapping uncharted frontiers in highly divergent protein sequence data sets. Public Library of Science 2012-04-13 /pmc/articles/PMC3325999/ /pubmed/22514627 http://dx.doi.org/10.1371/journal.pone.0034261 Text en This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. |
spellingShingle | Research Article Bhardwaj, Gaurav Ko, Kyung Dae Hong, Yoojin Zhang, Zhenhai Ho, Ngai Lam Chintapalli, Sree V. Kline, Lindsay A. Gotlin, Matthew Hartranft, David Nicholas Patterson, Morgen E. Dave, Foram Smith, Evan J. Holmes, Edward C. Patterson, Randen L. van Rossum, Damian B. PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences |
title | PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences |
title_full | PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences |
title_fullStr | PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences |
title_full_unstemmed | PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences |
title_short | PHYRN: A Robust Method for Phylogenetic Analysis of Highly Divergent Sequences |
title_sort | phyrn: a robust method for phylogenetic analysis of highly divergent sequences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3325999/ https://www.ncbi.nlm.nih.gov/pubmed/22514627 http://dx.doi.org/10.1371/journal.pone.0034261 |
work_keys_str_mv | AT bhardwajgaurav phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT kokyungdae phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT hongyoojin phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT zhangzhenhai phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT hongailam phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT chintapallisreev phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT klinelindsaya phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT gotlinmatthew phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT hartranftdavidnicholas phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT pattersonmorgene phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT daveforam phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT smithevanj phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT holmesedwardc phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT pattersonrandenl phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences AT vanrossumdamianb phyrnarobustmethodforphylogeneticanalysisofhighlydivergentsequences |