Cargando…
A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction
Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database (https://www.gisaid.org/), between December 2019...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8282786/ https://www.ncbi.nlm.nih.gov/pubmed/34267263 http://dx.doi.org/10.1038/s41598-021-93757-w |
_version_ | 1783723063751213056 |
---|---|
author | Ekpenyong, Moses Effiong Edoho, Mercy Ernest Inyang, Udoinyang Godwin Uzoka, Faith-Michael Ekaidem, Itemobong Samuel Moses, Anietie Effiong Emeje, Martins Ochubiojo Tatfeng, Youtchou Mirabeau Udo, Ifiok James Anwana, EnoAbasi Deborah Etim, Oboso Edem Geoffery, Joseph Ikim Dan, Emmanuel Ambrose |
author_facet | Ekpenyong, Moses Effiong Edoho, Mercy Ernest Inyang, Udoinyang Godwin Uzoka, Faith-Michael Ekaidem, Itemobong Samuel Moses, Anietie Effiong Emeje, Martins Ochubiojo Tatfeng, Youtchou Mirabeau Udo, Ifiok James Anwana, EnoAbasi Deborah Etim, Oboso Edem Geoffery, Joseph Ikim Dan, Emmanuel Ambrose |
author_sort | Ekpenyong, Moses Effiong |
collection | PubMed |
description | Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database (https://www.gisaid.org/), between December 2019 and January 15, 2021, a total of 8864 human SARS-CoV-2 complete genome sequences processed by gender, across 6 continents (88 countries) of the world, Antarctica exempt, were analyzed. We hypothesized that data speak for itself and can discern true and explainable patterns of the disease. Identical genome diversity and pattern correlates analysis performed using a hybrid of biotechnology and machine learning methods corroborate the emergence of inter- and intra- SARS-CoV-2 sub-strains transmission and sustain an increase in sub-strains within the various continents, with nucleotide mutations dynamically varying between individuals in close association with the virus as it adapts to its host/environment. Interestingly, some viral sub-strain patterns progressively transformed into new sub-strain clusters indicating varying amino acid, and strong nucleotide association derived from same lineage. A novel cognitive approach to knowledge mining helped the discovery of transmission routes and seamless contact tracing protocol. Our classification results were better than state-of-the-art methods, indicating a more robust system for predicting emerging or new viral sub-strain(s). The results therefore offer explanations for the growing concerns about the virus and its next wave(s). A future direction of this work is a defuzzification of confusable pattern clusters for precise intra-country SARS-CoV-2 sub-strains analytics. |
format | Online Article Text |
id | pubmed-8282786 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-82827862021-07-19 A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction Ekpenyong, Moses Effiong Edoho, Mercy Ernest Inyang, Udoinyang Godwin Uzoka, Faith-Michael Ekaidem, Itemobong Samuel Moses, Anietie Effiong Emeje, Martins Ochubiojo Tatfeng, Youtchou Mirabeau Udo, Ifiok James Anwana, EnoAbasi Deborah Etim, Oboso Edem Geoffery, Joseph Ikim Dan, Emmanuel Ambrose Sci Rep Article Whereas accelerated attention beclouded early stages of the coronavirus spread, knowledge of actual pathogenicity and origin of possible sub-strains remained unclear. By harvesting the Global initiative on Sharing All Influenza Data (GISAID) database (https://www.gisaid.org/), between December 2019 and January 15, 2021, a total of 8864 human SARS-CoV-2 complete genome sequences processed by gender, across 6 continents (88 countries) of the world, Antarctica exempt, were analyzed. We hypothesized that data speak for itself and can discern true and explainable patterns of the disease. Identical genome diversity and pattern correlates analysis performed using a hybrid of biotechnology and machine learning methods corroborate the emergence of inter- and intra- SARS-CoV-2 sub-strains transmission and sustain an increase in sub-strains within the various continents, with nucleotide mutations dynamically varying between individuals in close association with the virus as it adapts to its host/environment. Interestingly, some viral sub-strain patterns progressively transformed into new sub-strain clusters indicating varying amino acid, and strong nucleotide association derived from same lineage. A novel cognitive approach to knowledge mining helped the discovery of transmission routes and seamless contact tracing protocol. Our classification results were better than state-of-the-art methods, indicating a more robust system for predicting emerging or new viral sub-strain(s). The results therefore offer explanations for the growing concerns about the virus and its next wave(s). A future direction of this work is a defuzzification of confusable pattern clusters for precise intra-country SARS-CoV-2 sub-strains analytics. Nature Publishing Group UK 2021-07-15 /pmc/articles/PMC8282786/ /pubmed/34267263 http://dx.doi.org/10.1038/s41598-021-93757-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Ekpenyong, Moses Effiong Edoho, Mercy Ernest Inyang, Udoinyang Godwin Uzoka, Faith-Michael Ekaidem, Itemobong Samuel Moses, Anietie Effiong Emeje, Martins Ochubiojo Tatfeng, Youtchou Mirabeau Udo, Ifiok James Anwana, EnoAbasi Deborah Etim, Oboso Edem Geoffery, Joseph Ikim Dan, Emmanuel Ambrose A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction |
title | A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction |
title_full | A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction |
title_fullStr | A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction |
title_full_unstemmed | A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction |
title_short | A hybrid computational framework for intelligent inter-continent SARS-CoV-2 sub-strains characterization and prediction |
title_sort | hybrid computational framework for intelligent inter-continent sars-cov-2 sub-strains characterization and prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8282786/ https://www.ncbi.nlm.nih.gov/pubmed/34267263 http://dx.doi.org/10.1038/s41598-021-93757-w |
work_keys_str_mv | AT ekpenyongmoseseffiong ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT edohomercyernest ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT inyangudoinyanggodwin ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT uzokafaithmichael ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT ekaidemitemobongsamuel ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT mosesanietieeffiong ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT emejemartinsochubiojo ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT tatfengyoutchoumirabeau ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT udoifiokjames ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT anwanaenoabasideborah ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT etimobosoedem ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT geofferyjosephikim ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT danemmanuelambrose ahybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT ekpenyongmoseseffiong hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT edohomercyernest hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT inyangudoinyanggodwin hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT uzokafaithmichael hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT ekaidemitemobongsamuel hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT mosesanietieeffiong hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT emejemartinsochubiojo hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT tatfengyoutchoumirabeau hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT udoifiokjames hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT anwanaenoabasideborah hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT etimobosoedem hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT geofferyjosephikim hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction AT danemmanuelambrose hybridcomputationalframeworkforintelligentintercontinentsarscov2substrainscharacterizationandprediction |