Cargando…

Phylogenetic inference from binary sequences reduced by primary DNA sequences

Given a bi-classification of nucleotides, we can obtain a reduced binary sequence of a primary DNA sequence. This binary sequence will undoubtedly retain some biological information and lose the rest. Here we want to know what kind of and how much biological information an individual binary sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Zheng, Xiaoqi, Dou, Yongchao, Wang, Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7088321/
https://www.ncbi.nlm.nih.gov/pubmed/32214589
http://dx.doi.org/10.1007/s10910-008-9504-2
Descripción
Sumario:Given a bi-classification of nucleotides, we can obtain a reduced binary sequence of a primary DNA sequence. This binary sequence will undoubtedly retain some biological information and lose the rest. Here we want to know what kind of and how much biological information an individual binary sequence carries. Three classifications of nucleotides are explored in the present article. Phylogenetic trees are built from these binary sequences by the Neighbor-Joining (NJ) method, with evolutionary distance evaluated on the basis of a symbolic sequence complexity. We find that, for all data sets studied, binary sequences reduced by the purine/pyrimidine classification give reliable phylogeny (almost the same as that from the primary sequences), while the other two result in discrepancies at different levels. Some possible reasons and a simple model of sequence evolutionary are introduced to interpret this phenomenon.