Cargando…

Determining human-coronavirus protein-protein interaction using machine intelligence

The Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2) virus spread the novel CoronaVirus −19 (nCoV-19) pandemic, resulting in millions of fatalities globally. Recent research demonstrated that the Protein-Protein Interaction (PPI) between SARS-CoV-2 and human proteins is accountable for v...

Descripción completa

Detalles Bibliográficos
Autores principales: Chakraborty, Arijit, Mitra, Sajal, Bhattacharjee, Mainak, De, Debashis, Pal, Anindya J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The Authors. Published by Elsevier B.V. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077817/
https://www.ncbi.nlm.nih.gov/pubmed/37056696
http://dx.doi.org/10.1016/j.medntd.2023.100228
_version_ 1785020387905503232
author Chakraborty, Arijit
Mitra, Sajal
Bhattacharjee, Mainak
De, Debashis
Pal, Anindya J.
author_facet Chakraborty, Arijit
Mitra, Sajal
Bhattacharjee, Mainak
De, Debashis
Pal, Anindya J.
author_sort Chakraborty, Arijit
collection PubMed
description The Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2) virus spread the novel CoronaVirus −19 (nCoV-19) pandemic, resulting in millions of fatalities globally. Recent research demonstrated that the Protein-Protein Interaction (PPI) between SARS-CoV-2 and human proteins is accountable for viral pathogenesis. However, many of these PPIs are poorly understood and unexplored, necessitating a more in-depth investigation to find latent yet critical interactions. This article elucidates the host-viral PPI through Machine Learning (ML) lenses and validates the biological significance of the same using web-based tools. ML classifiers are designed based on comprehensive datasets with five sequence-based features of human proteins, namely Amino Acid Composition, Pseudo Amino Acid Composition, Conjoint Triad, Dipeptide Composition, and Normalized Auto Correlation. A majority voting rule-based ensemble method composed of the Random Forest Model (RFM), AdaBoost, and Bagging technique is proposed that delivers encouraging statistical performance compared to other models employed in this work. The proposed ensemble model predicted a total of 111 possible SARS-CoV-2 human target proteins with a high likelihood factor ≥70%, validated by utilizing Gene Ontology (GO) and KEGG pathway enrichment analysis. Consequently, this research can aid in a deeper understanding of the molecular mechanisms underlying viral pathogenesis and provide clues for developing more efficient anti-COVID medications.
format Online
Article
Text
id pubmed-10077817
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher The Authors. Published by Elsevier B.V.
record_format MEDLINE/PubMed
spelling pubmed-100778172023-04-06 Determining human-coronavirus protein-protein interaction using machine intelligence Chakraborty, Arijit Mitra, Sajal Bhattacharjee, Mainak De, Debashis Pal, Anindya J. Med Nov Technol Devices Article The Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2) virus spread the novel CoronaVirus −19 (nCoV-19) pandemic, resulting in millions of fatalities globally. Recent research demonstrated that the Protein-Protein Interaction (PPI) between SARS-CoV-2 and human proteins is accountable for viral pathogenesis. However, many of these PPIs are poorly understood and unexplored, necessitating a more in-depth investigation to find latent yet critical interactions. This article elucidates the host-viral PPI through Machine Learning (ML) lenses and validates the biological significance of the same using web-based tools. ML classifiers are designed based on comprehensive datasets with five sequence-based features of human proteins, namely Amino Acid Composition, Pseudo Amino Acid Composition, Conjoint Triad, Dipeptide Composition, and Normalized Auto Correlation. A majority voting rule-based ensemble method composed of the Random Forest Model (RFM), AdaBoost, and Bagging technique is proposed that delivers encouraging statistical performance compared to other models employed in this work. The proposed ensemble model predicted a total of 111 possible SARS-CoV-2 human target proteins with a high likelihood factor ≥70%, validated by utilizing Gene Ontology (GO) and KEGG pathway enrichment analysis. Consequently, this research can aid in a deeper understanding of the molecular mechanisms underlying viral pathogenesis and provide clues for developing more efficient anti-COVID medications. The Authors. Published by Elsevier B.V. 2023-06 2023-04-06 /pmc/articles/PMC10077817/ /pubmed/37056696 http://dx.doi.org/10.1016/j.medntd.2023.100228 Text en © 2023 The Authors Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Chakraborty, Arijit
Mitra, Sajal
Bhattacharjee, Mainak
De, Debashis
Pal, Anindya J.
Determining human-coronavirus protein-protein interaction using machine intelligence
title Determining human-coronavirus protein-protein interaction using machine intelligence
title_full Determining human-coronavirus protein-protein interaction using machine intelligence
title_fullStr Determining human-coronavirus protein-protein interaction using machine intelligence
title_full_unstemmed Determining human-coronavirus protein-protein interaction using machine intelligence
title_short Determining human-coronavirus protein-protein interaction using machine intelligence
title_sort determining human-coronavirus protein-protein interaction using machine intelligence
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077817/
https://www.ncbi.nlm.nih.gov/pubmed/37056696
http://dx.doi.org/10.1016/j.medntd.2023.100228
work_keys_str_mv AT chakrabortyarijit determininghumancoronavirusproteinproteininteractionusingmachineintelligence
AT mitrasajal determininghumancoronavirusproteinproteininteractionusingmachineintelligence
AT bhattacharjeemainak determininghumancoronavirusproteinproteininteractionusingmachineintelligence
AT dedebashis determininghumancoronavirusproteinproteininteractionusingmachineintelligence
AT palanindyaj determininghumancoronavirusproteinproteininteractionusingmachineintelligence