Cargando…

Using artificial intelligence techniques for COVID-19 genome analysis

The genome of the novel coronavirus (COVID-19) disease was first sequenced in January 2020, approximately a month after its emergence in Wuhan, capital of Hubei province, China. COVID-19 genome sequencing is critical to understanding the virus behavior, its origin, how fast it mutates, and for the d...

Descripción completa

Detalles Bibliográficos
Autores principales: Nawaz, M. Saqib, Fournier-Viger, Philippe, Shojaee, Abbas, Fujita, Hamido
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7888282/
https://www.ncbi.nlm.nih.gov/pubmed/34764587
http://dx.doi.org/10.1007/s10489-021-02193-w
_version_ 1783652130580594688
author Nawaz, M. Saqib
Fournier-Viger, Philippe
Shojaee, Abbas
Fujita, Hamido
author_facet Nawaz, M. Saqib
Fournier-Viger, Philippe
Shojaee, Abbas
Fujita, Hamido
author_sort Nawaz, M. Saqib
collection PubMed
description The genome of the novel coronavirus (COVID-19) disease was first sequenced in January 2020, approximately a month after its emergence in Wuhan, capital of Hubei province, China. COVID-19 genome sequencing is critical to understanding the virus behavior, its origin, how fast it mutates, and for the development of drugs/vaccines and effective preventive strategies. This paper investigates the use of artificial intelligence techniques to learn interesting information from COVID-19 genome sequences. Sequential pattern mining (SPM) is first applied on a computer-understandable corpus of COVID-19 genome sequences to see if interesting hidden patterns can be found, which reveal frequent patterns of nucleotide bases and their relationships with each other. Second, sequence prediction models are applied to the corpus to evaluate if nucleotide base(s) can be predicted from previous ones. Third, for mutation analysis in genome sequences, an algorithm is designed to find the locations in the genome sequences where the nucleotide bases are changed and to calculate the mutation rate. Obtained results suggest that SPM and mutation analysis techniques can reveal interesting information and patterns in COVID-19 genome sequences to examine the evolution and variations in COVID-19 strains respectively.
format Online
Article
Text
id pubmed-7888282
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-78882822021-02-17 Using artificial intelligence techniques for COVID-19 genome analysis Nawaz, M. Saqib Fournier-Viger, Philippe Shojaee, Abbas Fujita, Hamido Appl Intell (Dordr) Article The genome of the novel coronavirus (COVID-19) disease was first sequenced in January 2020, approximately a month after its emergence in Wuhan, capital of Hubei province, China. COVID-19 genome sequencing is critical to understanding the virus behavior, its origin, how fast it mutates, and for the development of drugs/vaccines and effective preventive strategies. This paper investigates the use of artificial intelligence techniques to learn interesting information from COVID-19 genome sequences. Sequential pattern mining (SPM) is first applied on a computer-understandable corpus of COVID-19 genome sequences to see if interesting hidden patterns can be found, which reveal frequent patterns of nucleotide bases and their relationships with each other. Second, sequence prediction models are applied to the corpus to evaluate if nucleotide base(s) can be predicted from previous ones. Third, for mutation analysis in genome sequences, an algorithm is designed to find the locations in the genome sequences where the nucleotide bases are changed and to calculate the mutation rate. Obtained results suggest that SPM and mutation analysis techniques can reveal interesting information and patterns in COVID-19 genome sequences to examine the evolution and variations in COVID-19 strains respectively. Springer US 2021-02-17 2021 /pmc/articles/PMC7888282/ /pubmed/34764587 http://dx.doi.org/10.1007/s10489-021-02193-w Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC part of Springer Nature 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Nawaz, M. Saqib
Fournier-Viger, Philippe
Shojaee, Abbas
Fujita, Hamido
Using artificial intelligence techniques for COVID-19 genome analysis
title Using artificial intelligence techniques for COVID-19 genome analysis
title_full Using artificial intelligence techniques for COVID-19 genome analysis
title_fullStr Using artificial intelligence techniques for COVID-19 genome analysis
title_full_unstemmed Using artificial intelligence techniques for COVID-19 genome analysis
title_short Using artificial intelligence techniques for COVID-19 genome analysis
title_sort using artificial intelligence techniques for covid-19 genome analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7888282/
https://www.ncbi.nlm.nih.gov/pubmed/34764587
http://dx.doi.org/10.1007/s10489-021-02193-w
work_keys_str_mv AT nawazmsaqib usingartificialintelligencetechniquesforcovid19genomeanalysis
AT fourniervigerphilippe usingartificialintelligencetechniquesforcovid19genomeanalysis
AT shojaeeabbas usingartificialintelligencetechniquesforcovid19genomeanalysis
AT fujitahamido usingartificialintelligencetechniquesforcovid19genomeanalysis