Cargando…

Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses

The SARS-CoV-2 is the third coronavirus in addition to SARS-CoV and MERS-CoV that causes severe respiratory syndrome in humans. All of them likely crossed the interspecific barrier between animals and humans and are of zoonotic origin, respectively. The origin and evolution of viruses and their phyl...

Descripción completa

Detalles Bibliográficos
Autores principales: Kirichenko, Anastasiya D., Poroshina, Anastasiya A., Sherbakov, Dmitry Yu., Sadovsky, Michael G., Krutovsky, Konstantin V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8903263/
https://www.ncbi.nlm.nih.gov/pubmed/35259178
http://dx.doi.org/10.1371/journal.pone.0264640
_version_ 1784664722567593984
author Kirichenko, Anastasiya D.
Poroshina, Anastasiya A.
Sherbakov, Dmitry Yu.
Sadovsky, Michael G.
Krutovsky, Konstantin V.
author_facet Kirichenko, Anastasiya D.
Poroshina, Anastasiya A.
Sherbakov, Dmitry Yu.
Sadovsky, Michael G.
Krutovsky, Konstantin V.
author_sort Kirichenko, Anastasiya D.
collection PubMed
description The SARS-CoV-2 is the third coronavirus in addition to SARS-CoV and MERS-CoV that causes severe respiratory syndrome in humans. All of them likely crossed the interspecific barrier between animals and humans and are of zoonotic origin, respectively. The origin and evolution of viruses and their phylogenetic relationships are of great importance for study of their pathogenicity and development of antiviral drugs and vaccines. The main objective of the presented study was to compare two methods for identifying relationships between coronavirus genomes: phylogenetic one based on the whole genome alignment followed by molecular phylogenetic tree inference and alignment-free clustering of triplet frequencies, respectively, using 69 coronavirus genomes selected from two public databases. Both approaches resulted in well-resolved robust classifications. In general, the clusters identified by the first approach were in good agreement with the classes identified by the second using K-means and the elastic map method, but not always, which still needs to be explained. Both approaches demonstrated also a significant divergence of genomes on a taxonomic level, but there was less correspondence between genomes regarding the types of diseases they caused, which may be due to the individual characteristics of the host. This research showed that alignment-free methods are efficient in combination with alignment-based methods. They have a significant advantage in computational complexity and provide valuable additional alternative information on the genomes relationships.
format Online
Article
Text
id pubmed-8903263
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-89032632022-03-09 Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses Kirichenko, Anastasiya D. Poroshina, Anastasiya A. Sherbakov, Dmitry Yu. Sadovsky, Michael G. Krutovsky, Konstantin V. PLoS One Research Article The SARS-CoV-2 is the third coronavirus in addition to SARS-CoV and MERS-CoV that causes severe respiratory syndrome in humans. All of them likely crossed the interspecific barrier between animals and humans and are of zoonotic origin, respectively. The origin and evolution of viruses and their phylogenetic relationships are of great importance for study of their pathogenicity and development of antiviral drugs and vaccines. The main objective of the presented study was to compare two methods for identifying relationships between coronavirus genomes: phylogenetic one based on the whole genome alignment followed by molecular phylogenetic tree inference and alignment-free clustering of triplet frequencies, respectively, using 69 coronavirus genomes selected from two public databases. Both approaches resulted in well-resolved robust classifications. In general, the clusters identified by the first approach were in good agreement with the classes identified by the second using K-means and the elastic map method, but not always, which still needs to be explained. Both approaches demonstrated also a significant divergence of genomes on a taxonomic level, but there was less correspondence between genomes regarding the types of diseases they caused, which may be due to the individual characteristics of the host. This research showed that alignment-free methods are efficient in combination with alignment-based methods. They have a significant advantage in computational complexity and provide valuable additional alternative information on the genomes relationships. Public Library of Science 2022-03-08 /pmc/articles/PMC8903263/ /pubmed/35259178 http://dx.doi.org/10.1371/journal.pone.0264640 Text en © 2022 Kirichenko et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Kirichenko, Anastasiya D.
Poroshina, Anastasiya A.
Sherbakov, Dmitry Yu.
Sadovsky, Michael G.
Krutovsky, Konstantin V.
Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses
title Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses
title_full Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses
title_fullStr Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses
title_full_unstemmed Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses
title_short Comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses
title_sort comparative analysis of alignment-free genome clustering and whole genome alignment-based phylogenomic relationship of coronaviruses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8903263/
https://www.ncbi.nlm.nih.gov/pubmed/35259178
http://dx.doi.org/10.1371/journal.pone.0264640
work_keys_str_mv AT kirichenkoanastasiyad comparativeanalysisofalignmentfreegenomeclusteringandwholegenomealignmentbasedphylogenomicrelationshipofcoronaviruses
AT poroshinaanastasiyaa comparativeanalysisofalignmentfreegenomeclusteringandwholegenomealignmentbasedphylogenomicrelationshipofcoronaviruses
AT sherbakovdmitryyu comparativeanalysisofalignmentfreegenomeclusteringandwholegenomealignmentbasedphylogenomicrelationshipofcoronaviruses
AT sadovskymichaelg comparativeanalysisofalignmentfreegenomeclusteringandwholegenomealignmentbasedphylogenomicrelationshipofcoronaviruses
AT krutovskykonstantinv comparativeanalysisofalignmentfreegenomeclusteringandwholegenomealignmentbasedphylogenomicrelationshipofcoronaviruses