Cargando…
DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood
BACKGROUND: The study of discrete characters is crucial for the understanding of evolutionary processes. Even though great advances have been made in the analysis of nucleotide sequences, computer programs for non-DNA discrete characters are often dedicated to specific analyses and lack flexibility....
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4261585/ https://www.ncbi.nlm.nih.gov/pubmed/25260628 http://dx.doi.org/10.1186/1471-2105-15-320 |
_version_ | 1782348298756030464 |
---|---|
author | Kim, Tane Hao, Weilong |
author_facet | Kim, Tane Hao, Weilong |
author_sort | Kim, Tane |
collection | PubMed |
description | BACKGROUND: The study of discrete characters is crucial for the understanding of evolutionary processes. Even though great advances have been made in the analysis of nucleotide sequences, computer programs for non-DNA discrete characters are often dedicated to specific analyses and lack flexibility. Discrete characters often have different transition rate matrices, variable rates among sites and sometimes contain unobservable states. To obtain the ability to accurately estimate a variety of discrete characters, programs with sophisticated methodologies and flexible settings are desired. RESULTS: DiscML performs maximum likelihood estimation for evolutionary rates of discrete characters on a provided phylogeny with the options that correct for unobservable data, rate variations, and unknown prior root probabilities from the empirical data. It gives users options to customize the instantaneous transition rate matrices, or to choose pre-determined matrices from models such as birth-and-death (BD), birth-death-and-innovation (BDI), equal rates (ER), symmetric (SYM), general time-reversible (GTR) and all rates different (ARD). Moreover, we show application examples of DiscML on gene family data and on intron presence/absence data. CONCLUSION: DiscML was developed as a unified R program for estimating evolutionary rates of discrete characters with no restriction on the number of character states, and with flexibility to use different transition models. DiscML is ideal for the analyses of binary (1s/0s) patterns, multi-gene families, and multistate discrete morphological characteristics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-320) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4261585 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42615852014-12-10 DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood Kim, Tane Hao, Weilong BMC Bioinformatics Software BACKGROUND: The study of discrete characters is crucial for the understanding of evolutionary processes. Even though great advances have been made in the analysis of nucleotide sequences, computer programs for non-DNA discrete characters are often dedicated to specific analyses and lack flexibility. Discrete characters often have different transition rate matrices, variable rates among sites and sometimes contain unobservable states. To obtain the ability to accurately estimate a variety of discrete characters, programs with sophisticated methodologies and flexible settings are desired. RESULTS: DiscML performs maximum likelihood estimation for evolutionary rates of discrete characters on a provided phylogeny with the options that correct for unobservable data, rate variations, and unknown prior root probabilities from the empirical data. It gives users options to customize the instantaneous transition rate matrices, or to choose pre-determined matrices from models such as birth-and-death (BD), birth-death-and-innovation (BDI), equal rates (ER), symmetric (SYM), general time-reversible (GTR) and all rates different (ARD). Moreover, we show application examples of DiscML on gene family data and on intron presence/absence data. CONCLUSION: DiscML was developed as a unified R program for estimating evolutionary rates of discrete characters with no restriction on the number of character states, and with flexibility to use different transition models. DiscML is ideal for the analyses of binary (1s/0s) patterns, multi-gene families, and multistate discrete morphological characteristics. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2105-15-320) contains supplementary material, which is available to authorized users. BioMed Central 2014-09-27 /pmc/articles/PMC4261585/ /pubmed/25260628 http://dx.doi.org/10.1186/1471-2105-15-320 Text en © Kim and Hao; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License(http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Kim, Tane Hao, Weilong DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood |
title | DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood |
title_full | DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood |
title_fullStr | DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood |
title_full_unstemmed | DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood |
title_short | DiscML: an R package for estimating evolutionary rates of discrete characters using maximum likelihood |
title_sort | discml: an r package for estimating evolutionary rates of discrete characters using maximum likelihood |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4261585/ https://www.ncbi.nlm.nih.gov/pubmed/25260628 http://dx.doi.org/10.1186/1471-2105-15-320 |
work_keys_str_mv | AT kimtane discmlanrpackageforestimatingevolutionaryratesofdiscretecharactersusingmaximumlikelihood AT haoweilong discmlanrpackageforestimatingevolutionaryratesofdiscretecharactersusingmaximumlikelihood |