Cargando…

Enabling Artificial Intelligence for Genome Sequence Analysis of COVID-19 and Alike Viruses

Recent pandemic of COVID-19 (Coronavirus) caused by severe acute respiratory syndrome Coronavirus 2 (SARS-CoV-2) has been growing lethally with unusual speed. It has infected millions of people and continues a mortifying influence on the global population’s health and well-being. In this situation,...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmed, Imran, Jeon, Gwanggil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Nature Singapore 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8342660/
https://www.ncbi.nlm.nih.gov/pubmed/34357528
http://dx.doi.org/10.1007/s12539-021-00465-0
_version_ 1783734115304996864
author Ahmed, Imran
Jeon, Gwanggil
author_facet Ahmed, Imran
Jeon, Gwanggil
author_sort Ahmed, Imran
collection PubMed
description Recent pandemic of COVID-19 (Coronavirus) caused by severe acute respiratory syndrome Coronavirus 2 (SARS-CoV-2) has been growing lethally with unusual speed. It has infected millions of people and continues a mortifying influence on the global population’s health and well-being. In this situation, genome sequence analysis and advanced artificial intelligence techniques may help researchers and medical experts to understand the genetic variants of COVID-19 or SARS-CoV-2. Genome sequence analysis of COVID-19 is crucial to understand the virus’s origin, behavior, and structure, which might help produce/develop vaccines, antiviral drugs, and efficient preventive strategies. This paper introduces an artificial intelligence based system to perform genome sequence analysis of COVID-19 and alike viruses, e.g., SARS, middle east respiratory syndrome, and Ebola. The system helps to get important information from the genome sequences of different viruses. We perform comparative data analysis by extracting basic information of COVID-19 and other genome sequences, including information of nucleotides composition and their frequency, tri-nucleotide compositions, count of amino acids, alignment between genome sequences, and their DNA similarity information. We use different visualization methods to analyze these viruses’ genome sequences and, finally, apply machine learning based classifier support vector machine to classify different genome sequences. The data set of different virus genome sequences are obtained from an online publicly accessible data center repository. The system achieves good classification results with an accuracy of 97% for COVID-19, 96%, SARS, and 95% for MERS and Ebola genome sequences, respectively. GRAPHIC ABSTRACT: [Image: see text]
format Online
Article
Text
id pubmed-8342660
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Nature Singapore
record_format MEDLINE/PubMed
spelling pubmed-83426602021-08-06 Enabling Artificial Intelligence for Genome Sequence Analysis of COVID-19 and Alike Viruses Ahmed, Imran Jeon, Gwanggil Interdiscip Sci Original Research Article Recent pandemic of COVID-19 (Coronavirus) caused by severe acute respiratory syndrome Coronavirus 2 (SARS-CoV-2) has been growing lethally with unusual speed. It has infected millions of people and continues a mortifying influence on the global population’s health and well-being. In this situation, genome sequence analysis and advanced artificial intelligence techniques may help researchers and medical experts to understand the genetic variants of COVID-19 or SARS-CoV-2. Genome sequence analysis of COVID-19 is crucial to understand the virus’s origin, behavior, and structure, which might help produce/develop vaccines, antiviral drugs, and efficient preventive strategies. This paper introduces an artificial intelligence based system to perform genome sequence analysis of COVID-19 and alike viruses, e.g., SARS, middle east respiratory syndrome, and Ebola. The system helps to get important information from the genome sequences of different viruses. We perform comparative data analysis by extracting basic information of COVID-19 and other genome sequences, including information of nucleotides composition and their frequency, tri-nucleotide compositions, count of amino acids, alignment between genome sequences, and their DNA similarity information. We use different visualization methods to analyze these viruses’ genome sequences and, finally, apply machine learning based classifier support vector machine to classify different genome sequences. The data set of different virus genome sequences are obtained from an online publicly accessible data center repository. The system achieves good classification results with an accuracy of 97% for COVID-19, 96%, SARS, and 95% for MERS and Ebola genome sequences, respectively. GRAPHIC ABSTRACT: [Image: see text] Springer Nature Singapore 2021-08-06 2022 /pmc/articles/PMC8342660/ /pubmed/34357528 http://dx.doi.org/10.1007/s12539-021-00465-0 Text en © International Association of Scientists in the Interdisciplinary Areas 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Research Article
Ahmed, Imran
Jeon, Gwanggil
Enabling Artificial Intelligence for Genome Sequence Analysis of COVID-19 and Alike Viruses
title Enabling Artificial Intelligence for Genome Sequence Analysis of COVID-19 and Alike Viruses
title_full Enabling Artificial Intelligence for Genome Sequence Analysis of COVID-19 and Alike Viruses
title_fullStr Enabling Artificial Intelligence for Genome Sequence Analysis of COVID-19 and Alike Viruses
title_full_unstemmed Enabling Artificial Intelligence for Genome Sequence Analysis of COVID-19 and Alike Viruses
title_short Enabling Artificial Intelligence for Genome Sequence Analysis of COVID-19 and Alike Viruses
title_sort enabling artificial intelligence for genome sequence analysis of covid-19 and alike viruses
topic Original Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8342660/
https://www.ncbi.nlm.nih.gov/pubmed/34357528
http://dx.doi.org/10.1007/s12539-021-00465-0
work_keys_str_mv AT ahmedimran enablingartificialintelligenceforgenomesequenceanalysisofcovid19andalikeviruses
AT jeongwanggil enablingartificialintelligenceforgenomesequenceanalysisofcovid19andalikeviruses