Cargando…
Small-Sample Estimation of the Mutational Support and Distribution of SARS-CoV-2
We consider the problem of determining the mutational support and distribution of the SARS-CoV-2 viral genome in the small-sample regime. The mutational support refers to the unknown number of sites that may eventually mutate in the SARS-CoV-2 genome while mutational distribution refers to the distr...
Formato: | Online Artículo Texto |
---|---|
Lenguaje: | English |
Publicado: |
IEEE
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10009811/ https://www.ncbi.nlm.nih.gov/pubmed/35385386 http://dx.doi.org/10.1109/TCBB.2022.3165395 |
_version_ | 1784906061437730816 |
---|---|
collection | PubMed |
description | We consider the problem of determining the mutational support and distribution of the SARS-CoV-2 viral genome in the small-sample regime. The mutational support refers to the unknown number of sites that may eventually mutate in the SARS-CoV-2 genome while mutational distribution refers to the distribution of point mutations in the viral genome across a population. The mutational support may be used to assess the virulence of the virus and guide primer selection for real-time RT-PCR testing. Estimating the distribution of mutations in the genome of different subpopulations while accounting for the unseen may also aid in discovering new variants. To estimate the mutational support in the small-sample regime, we use GISAID sequencing data and our state-of-the-art polynomial estimation techniques based on new weighted and regularized Chebyshev approximation methods. For distribution estimation, we adapt the well-known Good-Turing estimator. Our analysis reveals several findings: First, the mutational supports exhibit significant differences in the ORF6 and ORF7a regions (older versus younger patients), ORF1b and ORF10 regions (females versus males) and in almost all ORFs (Asia/Europe/North America). Second, even though the N region of SARS-CoV-2 has a predicted [Formula: see text][Formula: see text] mutational support, mutations fall outside of the primer regions recommended by the CDC. |
format | Online Article Text |
id | pubmed-10009811 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | IEEE |
record_format | MEDLINE/PubMed |
spelling | pubmed-100098112023-03-20 Small-Sample Estimation of the Mutational Support and Distribution of SARS-CoV-2 IEEE/ACM Trans Comput Biol Bioinform Article We consider the problem of determining the mutational support and distribution of the SARS-CoV-2 viral genome in the small-sample regime. The mutational support refers to the unknown number of sites that may eventually mutate in the SARS-CoV-2 genome while mutational distribution refers to the distribution of point mutations in the viral genome across a population. The mutational support may be used to assess the virulence of the virus and guide primer selection for real-time RT-PCR testing. Estimating the distribution of mutations in the genome of different subpopulations while accounting for the unseen may also aid in discovering new variants. To estimate the mutational support in the small-sample regime, we use GISAID sequencing data and our state-of-the-art polynomial estimation techniques based on new weighted and regularized Chebyshev approximation methods. For distribution estimation, we adapt the well-known Good-Turing estimator. Our analysis reveals several findings: First, the mutational supports exhibit significant differences in the ORF6 and ORF7a regions (older versus younger patients), ORF1b and ORF10 regions (females versus males) and in almost all ORFs (Asia/Europe/North America). Second, even though the N region of SARS-CoV-2 has a predicted [Formula: see text][Formula: see text] mutational support, mutations fall outside of the primer regions recommended by the CDC. IEEE 2022-04-06 /pmc/articles/PMC10009811/ /pubmed/35385386 http://dx.doi.org/10.1109/TCBB.2022.3165395 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Small-Sample Estimation of the Mutational Support and Distribution of SARS-CoV-2 |
title | Small-Sample Estimation of the Mutational Support and Distribution of SARS-CoV-2 |
title_full | Small-Sample Estimation of the Mutational Support and Distribution of SARS-CoV-2 |
title_fullStr | Small-Sample Estimation of the Mutational Support and Distribution of SARS-CoV-2 |
title_full_unstemmed | Small-Sample Estimation of the Mutational Support and Distribution of SARS-CoV-2 |
title_short | Small-Sample Estimation of the Mutational Support and Distribution of SARS-CoV-2 |
title_sort | small-sample estimation of the mutational support and distribution of sars-cov-2 |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10009811/ https://www.ncbi.nlm.nih.gov/pubmed/35385386 http://dx.doi.org/10.1109/TCBB.2022.3165395 |
work_keys_str_mv | AT smallsampleestimationofthemutationalsupportanddistributionofsarscov2 AT smallsampleestimationofthemutationalsupportanddistributionofsarscov2 AT smallsampleestimationofthemutationalsupportanddistributionofsarscov2 AT smallsampleestimationofthemutationalsupportanddistributionofsarscov2 |