Cargando…

An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction

Alignment methods have faced disadvantages in sequence comparison and phylogeny reconstruction due to their high computational costs in handling time and space complexity. On the other hand, alignment-free methods incur low computational costs and have recently gained popularity in the field of bioi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Yao-Qun, Yu, Zu-Guo, Tang, Run-Bin, Han, Guo-Sheng, Anh, Vo V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568955/
https://www.ncbi.nlm.nih.gov/pubmed/34745231
http://dx.doi.org/10.3389/fgene.2021.766496
_version_ 1784594543587360768
author Wu, Yao-Qun
Yu, Zu-Guo
Tang, Run-Bin
Han, Guo-Sheng
Anh, Vo V.
author_facet Wu, Yao-Qun
Yu, Zu-Guo
Tang, Run-Bin
Han, Guo-Sheng
Anh, Vo V.
author_sort Wu, Yao-Qun
collection PubMed
description Alignment methods have faced disadvantages in sequence comparison and phylogeny reconstruction due to their high computational costs in handling time and space complexity. On the other hand, alignment-free methods incur low computational costs and have recently gained popularity in the field of bioinformatics. Here we propose a new alignment-free method for phylogenetic tree reconstruction based on whole genome sequences. A key component is a measure called information-entropy position-weighted k-mer relative measure (IEPWRMkmer), which combines the position-weighted measure of k-mers proposed by our group and the information entropy of frequency of k-mers. The Manhattan distance is used to calculate the pairwise distance between species. Finally, we use the Neighbor-Joining method to construct the phylogenetic tree. To evaluate the performance of this method, we perform phylogenetic analysis on two datasets used by other researchers. The results demonstrate that the IEPWRMkmer method is efficient and reliable. The source codes of our method are provided at https://github.com/ wuyaoqun37/IEPWRMkmer.
format Online
Article
Text
id pubmed-8568955
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-85689552021-11-06 An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction Wu, Yao-Qun Yu, Zu-Guo Tang, Run-Bin Han, Guo-Sheng Anh, Vo V. Front Genet Genetics Alignment methods have faced disadvantages in sequence comparison and phylogeny reconstruction due to their high computational costs in handling time and space complexity. On the other hand, alignment-free methods incur low computational costs and have recently gained popularity in the field of bioinformatics. Here we propose a new alignment-free method for phylogenetic tree reconstruction based on whole genome sequences. A key component is a measure called information-entropy position-weighted k-mer relative measure (IEPWRMkmer), which combines the position-weighted measure of k-mers proposed by our group and the information entropy of frequency of k-mers. The Manhattan distance is used to calculate the pairwise distance between species. Finally, we use the Neighbor-Joining method to construct the phylogenetic tree. To evaluate the performance of this method, we perform phylogenetic analysis on two datasets used by other researchers. The results demonstrate that the IEPWRMkmer method is efficient and reliable. The source codes of our method are provided at https://github.com/ wuyaoqun37/IEPWRMkmer. Frontiers Media S.A. 2021-10-22 /pmc/articles/PMC8568955/ /pubmed/34745231 http://dx.doi.org/10.3389/fgene.2021.766496 Text en Copyright © 2021 Wu, Yu, Tang, Han and Anh. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wu, Yao-Qun
Yu, Zu-Guo
Tang, Run-Bin
Han, Guo-Sheng
Anh, Vo V.
An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction
title An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction
title_full An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction
title_fullStr An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction
title_full_unstemmed An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction
title_short An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction
title_sort information-entropy position-weighted k-mer relative measure for whole genome phylogeny reconstruction
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568955/
https://www.ncbi.nlm.nih.gov/pubmed/34745231
http://dx.doi.org/10.3389/fgene.2021.766496
work_keys_str_mv AT wuyaoqun aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction
AT yuzuguo aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction
AT tangrunbin aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction
AT hanguosheng aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction
AT anhvov aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction
AT wuyaoqun informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction
AT yuzuguo informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction
AT tangrunbin informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction
AT hanguosheng informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction
AT anhvov informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction