Cargando…

DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood

BACKGROUND: The study of discrete characters is crucial for the understanding of evolutionary processes. Even though great advances have been made in the analysis of nucleotide sequences, computer programs for non-DNA discrete characters are often dedicated to specific analyses and lack flexibility....

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Tane, Hao, Weilong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4261585/
https://www.ncbi.nlm.nih.gov/pubmed/25260628
http://dx.doi.org/10.1186/1471-2105-15-320
_version_ 1782348298756030464
author Kim, Tane
Hao, Weilong
author_facet Kim, Tane
Hao, Weilong
author_sort Kim, Tane
collection PubMed
description BACKGROUND: The study of discrete characters is crucial for the understanding of evolutionary processes. Even though great advances have been made in the analysis of nucleotide sequences, computer programs for non-DNA discrete characters are often dedicated to specific analyses and lack flexibility. Discrete characters often have different transition rate matrices, variable rates among sites and sometimes contain unobservable states. To obtain the ability to accurately estimate a variety of discrete characters, programs with sophisticated methodologies and flexible settings are desired. RESULTS: DiscML performs maximum likelihood estimation for evolutionary rates of discrete characters on a provided phylogeny with the options that correct for unobservable data, rate variations, and unknown prior root probabilities from the empirical data. It gives users options to customize the instantaneous transition rate matrices, or to choose pre-determined matrices from models such as birth-and-death (BD), birth-death-and-innovation (BDI), equal rates (ER), symmetric (SYM), general time-reversible (GTR) and all rates different (ARD). Moreover, we show application examples of DiscML on gene family data and on intron presence/absence data. CONCLUSION: DiscML was developed as a unified R program for estimating evolutionary rates of discrete characters with no restriction on the number of character states, and with flexibility to use different transition models. DiscML is ideal for the analyses of binary (1s/0s) patterns, multi-gene families, and multistate discrete morphological characteristics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-320) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4261585
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42615852014-12-10 DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood Kim, Tane Hao, Weilong BMC Bioinformatics Software BACKGROUND: The study of discrete characters is crucial for the understanding of evolutionary processes. Even though great advances have been made in the analysis of nucleotide sequences, computer programs for non-DNA discrete characters are often dedicated to specific analyses and lack flexibility. Discrete characters often have different transition rate matrices, variable rates among sites and sometimes contain unobservable states. To obtain the ability to accurately estimate a variety of discrete characters, programs with sophisticated methodologies and flexible settings are desired. RESULTS: DiscML performs maximum likelihood estimation for evolutionary rates of discrete characters on a provided phylogeny with the options that correct for unobservable data, rate variations, and unknown prior root probabilities from the empirical data. It gives users options to customize the instantaneous transition rate matrices, or to choose pre-determined matrices from models such as birth-and-death (BD), birth-death-and-innovation (BDI), equal rates (ER), symmetric (SYM), general time-reversible (GTR) and all rates different (ARD). Moreover, we show application examples of DiscML on gene family data and on intron presence/absence data. CONCLUSION: DiscML was developed as a unified R program for estimating evolutionary rates of discrete characters with no restriction on the number of character states, and with flexibility to use different transition models. DiscML is ideal for the analyses of binary (1s/0s) patterns, multi-gene families, and multistate discrete morphological characteristics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-320) contains supplementary material, which is available to authorized users. BioMed Central 2014-09-27 /pmc/articles/PMC4261585/ /pubmed/25260628 http://dx.doi.org/10.1186/1471-2105-15-320 Text en © Kim and Hao; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License(http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Kim, Tane
Hao, Weilong
DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood
title DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood
title_full DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood
title_fullStr DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood
title_full_unstemmed DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood
title_short DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood
title_sort discml: an r package for estimating evolutionary rates of discrete characters using maximum likelihood
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4261585/
https://www.ncbi.nlm.nih.gov/pubmed/25260628
http://dx.doi.org/10.1186/1471-2105-15-320
work_keys_str_mv AT kimtane discmlanrpackageforestimatingevolutionaryratesofdiscretecharactersusingmaximumlikelihood
AT haoweilong discmlanrpackageforestimatingevolutionaryratesofdiscretecharactersusingmaximumlikelihood