Cargando…

On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs

The aim of the study is to analyze viruses using parameters obtained from distributions of nucleotide sequences in the viral RNA. Seeking for the input data homogeneity, we analyze single-stranded RNA viruses only. Two approaches are used to obtain the nucleotide sequences; In the first one, chunks...

Descripción completa

Detalles Bibliográficos
Autores principales: Husev, Mykola, Rovenchak, Andrij
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7887720/
https://www.ncbi.nlm.nih.gov/pubmed/33613787
http://dx.doi.org/10.1007/s12304-021-09403-5
_version_ 1783652039424737280
author Husev, Mykola
Rovenchak, Andrij
author_facet Husev, Mykola
Rovenchak, Andrij
author_sort Husev, Mykola
collection PubMed
description The aim of the study is to analyze viruses using parameters obtained from distributions of nucleotide sequences in the viral RNA. Seeking for the input data homogeneity, we analyze single-stranded RNA viruses only. Two approaches are used to obtain the nucleotide sequences; In the first one, chunks of equal length (four nucleotides) are considered. In the second approach, the whole RNA genome is divided into parts by adenine or the most frequent nucleotide as a “space”. Rank–frequency distributions are studied in both cases. The defined nucleotide sequences are signs comparable to a certain extent to syllables or words as seen from the nature of their rank–frequency distributions. Within the first approach, the Pólya and the negative hypergeometric distribution yield the best fit. For the distributions obtained within the second approach, we have calculated a set of parameters, including entropy, mean sequence length, and its dispersion. The calculated parameters became the basis for the classification of viruses. We observed that proximity of viruses on planes spanned on various pairs of parameters corresponds to related species. In certain cases, such a proximity is observed for unrelated species as well calling thus for the expansion of the set of parameters used in the classification. We also observed that the fifth most frequent nucleotide sequences obtained within the second approach are of different nature in case of human coronaviruses (different nucleotides for MERS, SARS-CoV, and SARS-CoV-2 versus identical nucleotides for four other coronaviruses). We expect that our findings will be useful as a supplementary tool in the classification of diseases caused by RNA viruses with respect to severity and contagiousness.
format Online
Article
Text
id pubmed-7887720
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-78877202021-02-17 On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs Husev, Mykola Rovenchak, Andrij Biosemiotics Original Research The aim of the study is to analyze viruses using parameters obtained from distributions of nucleotide sequences in the viral RNA. Seeking for the input data homogeneity, we analyze single-stranded RNA viruses only. Two approaches are used to obtain the nucleotide sequences; In the first one, chunks of equal length (four nucleotides) are considered. In the second approach, the whole RNA genome is divided into parts by adenine or the most frequent nucleotide as a “space”. Rank–frequency distributions are studied in both cases. The defined nucleotide sequences are signs comparable to a certain extent to syllables or words as seen from the nature of their rank–frequency distributions. Within the first approach, the Pólya and the negative hypergeometric distribution yield the best fit. For the distributions obtained within the second approach, we have calculated a set of parameters, including entropy, mean sequence length, and its dispersion. The calculated parameters became the basis for the classification of viruses. We observed that proximity of viruses on planes spanned on various pairs of parameters corresponds to related species. In certain cases, such a proximity is observed for unrelated species as well calling thus for the expansion of the set of parameters used in the classification. We also observed that the fifth most frequent nucleotide sequences obtained within the second approach are of different nature in case of human coronaviruses (different nucleotides for MERS, SARS-CoV, and SARS-CoV-2 versus identical nucleotides for four other coronaviruses). We expect that our findings will be useful as a supplementary tool in the classification of diseases caused by RNA viruses with respect to severity and contagiousness. Springer Netherlands 2021-02-17 2021 /pmc/articles/PMC7887720/ /pubmed/33613787 http://dx.doi.org/10.1007/s12304-021-09403-5 Text en © The Author(s), under exclusive licence to Springer Nature B.V. part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Research
Husev, Mykola
Rovenchak, Andrij
On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs
title On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs
title_full On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs
title_fullStr On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs
title_full_unstemmed On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs
title_short On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs
title_sort on the verge of life: distribution of nucleotide sequences in viral rnas
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7887720/
https://www.ncbi.nlm.nih.gov/pubmed/33613787
http://dx.doi.org/10.1007/s12304-021-09403-5
work_keys_str_mv AT husevmykola onthevergeoflifedistributionofnucleotidesequencesinviralrnas
AT rovenchakandrij onthevergeoflifedistributionofnucleotidesequencesinviralrnas