Cargando…
An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction
Alignment methods have faced disadvantages in sequence comparison and phylogeny reconstruction due to their high computational costs in handling time and space complexity. On the other hand, alignment-free methods incur low computational costs and have recently gained popularity in the field of bioi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568955/ https://www.ncbi.nlm.nih.gov/pubmed/34745231 http://dx.doi.org/10.3389/fgene.2021.766496 |
_version_ | 1784594543587360768 |
---|---|
author | Wu, Yao-Qun Yu, Zu-Guo Tang, Run-Bin Han, Guo-Sheng Anh, Vo V. |
author_facet | Wu, Yao-Qun Yu, Zu-Guo Tang, Run-Bin Han, Guo-Sheng Anh, Vo V. |
author_sort | Wu, Yao-Qun |
collection | PubMed |
description | Alignment methods have faced disadvantages in sequence comparison and phylogeny reconstruction due to their high computational costs in handling time and space complexity. On the other hand, alignment-free methods incur low computational costs and have recently gained popularity in the field of bioinformatics. Here we propose a new alignment-free method for phylogenetic tree reconstruction based on whole genome sequences. A key component is a measure called information-entropy position-weighted k-mer relative measure (IEPWRMkmer), which combines the position-weighted measure of k-mers proposed by our group and the information entropy of frequency of k-mers. The Manhattan distance is used to calculate the pairwise distance between species. Finally, we use the Neighbor-Joining method to construct the phylogenetic tree. To evaluate the performance of this method, we perform phylogenetic analysis on two datasets used by other researchers. The results demonstrate that the IEPWRMkmer method is efficient and reliable. The source codes of our method are provided at https://github.com/ wuyaoqun37/IEPWRMkmer. |
format | Online Article Text |
id | pubmed-8568955 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-85689552021-11-06 An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction Wu, Yao-Qun Yu, Zu-Guo Tang, Run-Bin Han, Guo-Sheng Anh, Vo V. Front Genet Genetics Alignment methods have faced disadvantages in sequence comparison and phylogeny reconstruction due to their high computational costs in handling time and space complexity. On the other hand, alignment-free methods incur low computational costs and have recently gained popularity in the field of bioinformatics. Here we propose a new alignment-free method for phylogenetic tree reconstruction based on whole genome sequences. A key component is a measure called information-entropy position-weighted k-mer relative measure (IEPWRMkmer), which combines the position-weighted measure of k-mers proposed by our group and the information entropy of frequency of k-mers. The Manhattan distance is used to calculate the pairwise distance between species. Finally, we use the Neighbor-Joining method to construct the phylogenetic tree. To evaluate the performance of this method, we perform phylogenetic analysis on two datasets used by other researchers. The results demonstrate that the IEPWRMkmer method is efficient and reliable. The source codes of our method are provided at https://github.com/ wuyaoqun37/IEPWRMkmer. Frontiers Media S.A. 2021-10-22 /pmc/articles/PMC8568955/ /pubmed/34745231 http://dx.doi.org/10.3389/fgene.2021.766496 Text en Copyright © 2021 Wu, Yu, Tang, Han and Anh. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Wu, Yao-Qun Yu, Zu-Guo Tang, Run-Bin Han, Guo-Sheng Anh, Vo V. An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction |
title | An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction |
title_full | An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction |
title_fullStr | An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction |
title_full_unstemmed | An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction |
title_short | An Information-Entropy Position-Weighted K-Mer Relative Measure for Whole Genome Phylogeny Reconstruction |
title_sort | information-entropy position-weighted k-mer relative measure for whole genome phylogeny reconstruction |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568955/ https://www.ncbi.nlm.nih.gov/pubmed/34745231 http://dx.doi.org/10.3389/fgene.2021.766496 |
work_keys_str_mv | AT wuyaoqun aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction AT yuzuguo aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction AT tangrunbin aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction AT hanguosheng aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction AT anhvov aninformationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction AT wuyaoqun informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction AT yuzuguo informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction AT tangrunbin informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction AT hanguosheng informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction AT anhvov informationentropypositionweightedkmerrelativemeasureforwholegenomephylogenyreconstruction |