Cargando…

Tree and rate estimation by local evaluation of heterochronous nucleotide data

Motivation: Heterochronous gene sequence data is important for characterizing the evolutionary processes of fast-evolving organisms such as RNA viruses. A limited set of algorithms exists for estimating the rate of nucleotide substitution and inferring phylogenetic trees from such data. The authors...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Zhu, O'Brien, John D., Zheng, Xiaobin, Zhu, Huai-Qiu, She, Zhen-Su
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7187891/
https://www.ncbi.nlm.nih.gov/pubmed/17110369
http://dx.doi.org/10.1093/bioinformatics/btl577
_version_ 1783527245274415104
author Yang, Zhu
O'Brien, John D.
Zheng, Xiaobin
Zhu, Huai-Qiu
She, Zhen-Su
author_facet Yang, Zhu
O'Brien, John D.
Zheng, Xiaobin
Zhu, Huai-Qiu
She, Zhen-Su
author_sort Yang, Zhu
collection PubMed
description Motivation: Heterochronous gene sequence data is important for characterizing the evolutionary processes of fast-evolving organisms such as RNA viruses. A limited set of algorithms exists for estimating the rate of nucleotide substitution and inferring phylogenetic trees from such data. The authors here present a new method, Tree and Rate Estimation by Local Evaluation (TREBLE) that robustly calculates the rate of nucleotide substitution and phylogeny with several orders of magnitude improvement in computational time. Methods: For the basis of its rate estimation TREBLE novelly utilizes a geometric interpretation of the molecular clock assumption to deduce a local estimate of the rate of nucleotide substitution for triplets of dated sequences. Averaging the triplet estimates via a variance weighting yields a global estimate of the rate. From this value, an iterative refinement procedure relying on statistical properties of the triplets then generates a final estimate of the global rate of nucleotide substitution. The estimated global rate is then utilized to find the tree from the pairwise distance matrix via an UPGMA-like algorithm. Results: Simulation studies show that TREBLE estimates the rate of nucleotide substitution with point estimates comparable with the best of available methods. Confidence intervals are comparable with that of BEAST. TREBLE's phylogenetic reconstruction is significantly improved over the other distance matrix method but not as accurate as the Bayesian algorithm. Compared with three other algorithms, TREBLE reduces computational time by a minimum factor of 3000. Relative to the algorithm with the most accurate estimates for the rate of nucleotide substitution (i.e. BEAST), TREBLE is over 10 000 times more computationally efficient. Availability: Contact: jdobrien@ucla.edu
format Online
Article
Text
id pubmed-7187891
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-71878912020-04-29 Tree and rate estimation by local evaluation of heterochronous nucleotide data Yang, Zhu O'Brien, John D. Zheng, Xiaobin Zhu, Huai-Qiu She, Zhen-Su Bioinformatics Original Papers Motivation: Heterochronous gene sequence data is important for characterizing the evolutionary processes of fast-evolving organisms such as RNA viruses. A limited set of algorithms exists for estimating the rate of nucleotide substitution and inferring phylogenetic trees from such data. The authors here present a new method, Tree and Rate Estimation by Local Evaluation (TREBLE) that robustly calculates the rate of nucleotide substitution and phylogeny with several orders of magnitude improvement in computational time. Methods: For the basis of its rate estimation TREBLE novelly utilizes a geometric interpretation of the molecular clock assumption to deduce a local estimate of the rate of nucleotide substitution for triplets of dated sequences. Averaging the triplet estimates via a variance weighting yields a global estimate of the rate. From this value, an iterative refinement procedure relying on statistical properties of the triplets then generates a final estimate of the global rate of nucleotide substitution. The estimated global rate is then utilized to find the tree from the pairwise distance matrix via an UPGMA-like algorithm. Results: Simulation studies show that TREBLE estimates the rate of nucleotide substitution with point estimates comparable with the best of available methods. Confidence intervals are comparable with that of BEAST. TREBLE's phylogenetic reconstruction is significantly improved over the other distance matrix method but not as accurate as the Bayesian algorithm. Compared with three other algorithms, TREBLE reduces computational time by a minimum factor of 3000. Relative to the algorithm with the most accurate estimates for the rate of nucleotide substitution (i.e. BEAST), TREBLE is over 10 000 times more computationally efficient. Availability: Contact: jdobrien@ucla.edu Oxford University Press 2007-01-15 2006-11-16 /pmc/articles/PMC7187891/ /pubmed/17110369 http://dx.doi.org/10.1093/bioinformatics/btl577 Text en © 2006 The Author(s) This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. This article is made available via the PMC Open Access Subset for unrestricted re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the COVID-19 pandemic or until permissions are revoked in writing. Upon expiration of these permissions, PMC is granted a perpetual license to make this article available via PMC and Europe PMC, consistent with existing copyright protections.
spellingShingle Original Papers
Yang, Zhu
O'Brien, John D.
Zheng, Xiaobin
Zhu, Huai-Qiu
She, Zhen-Su
Tree and rate estimation by local evaluation of heterochronous nucleotide data
title Tree and rate estimation by local evaluation of heterochronous nucleotide data
title_full Tree and rate estimation by local evaluation of heterochronous nucleotide data
title_fullStr Tree and rate estimation by local evaluation of heterochronous nucleotide data
title_full_unstemmed Tree and rate estimation by local evaluation of heterochronous nucleotide data
title_short Tree and rate estimation by local evaluation of heterochronous nucleotide data
title_sort tree and rate estimation by local evaluation of heterochronous nucleotide data
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7187891/
https://www.ncbi.nlm.nih.gov/pubmed/17110369
http://dx.doi.org/10.1093/bioinformatics/btl577
work_keys_str_mv AT yangzhu treeandrateestimationbylocalevaluationofheterochronousnucleotidedata
AT obrienjohnd treeandrateestimationbylocalevaluationofheterochronousnucleotidedata
AT zhengxiaobin treeandrateestimationbylocalevaluationofheterochronousnucleotidedata
AT zhuhuaiqiu treeandrateestimationbylocalevaluationofheterochronousnucleotidedata
AT shezhensu treeandrateestimationbylocalevaluationofheterochronousnucleotidedata