Cargando…
Determining human-coronavirus protein-protein interaction using machine intelligence
The Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2) virus spread the novel CoronaVirus −19 (nCoV-19) pandemic, resulting in millions of fatalities globally. Recent research demonstrated that the Protein-Protein Interaction (PPI) between SARS-CoV-2 and human proteins is accountable for v...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The Authors. Published by Elsevier B.V.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077817/ https://www.ncbi.nlm.nih.gov/pubmed/37056696 http://dx.doi.org/10.1016/j.medntd.2023.100228 |
_version_ | 1785020387905503232 |
---|---|
author | Chakraborty, Arijit Mitra, Sajal Bhattacharjee, Mainak De, Debashis Pal, Anindya J. |
author_facet | Chakraborty, Arijit Mitra, Sajal Bhattacharjee, Mainak De, Debashis Pal, Anindya J. |
author_sort | Chakraborty, Arijit |
collection | PubMed |
description | The Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2) virus spread the novel CoronaVirus −19 (nCoV-19) pandemic, resulting in millions of fatalities globally. Recent research demonstrated that the Protein-Protein Interaction (PPI) between SARS-CoV-2 and human proteins is accountable for viral pathogenesis. However, many of these PPIs are poorly understood and unexplored, necessitating a more in-depth investigation to find latent yet critical interactions. This article elucidates the host-viral PPI through Machine Learning (ML) lenses and validates the biological significance of the same using web-based tools. ML classifiers are designed based on comprehensive datasets with five sequence-based features of human proteins, namely Amino Acid Composition, Pseudo Amino Acid Composition, Conjoint Triad, Dipeptide Composition, and Normalized Auto Correlation. A majority voting rule-based ensemble method composed of the Random Forest Model (RFM), AdaBoost, and Bagging technique is proposed that delivers encouraging statistical performance compared to other models employed in this work. The proposed ensemble model predicted a total of 111 possible SARS-CoV-2 human target proteins with a high likelihood factor ≥70%, validated by utilizing Gene Ontology (GO) and KEGG pathway enrichment analysis. Consequently, this research can aid in a deeper understanding of the molecular mechanisms underlying viral pathogenesis and provide clues for developing more efficient anti-COVID medications. |
format | Online Article Text |
id | pubmed-10077817 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | The Authors. Published by Elsevier B.V. |
record_format | MEDLINE/PubMed |
spelling | pubmed-100778172023-04-06 Determining human-coronavirus protein-protein interaction using machine intelligence Chakraborty, Arijit Mitra, Sajal Bhattacharjee, Mainak De, Debashis Pal, Anindya J. Med Nov Technol Devices Article The Severe Acute Respiratory Syndrome CoronaVirus 2 (SARS-CoV-2) virus spread the novel CoronaVirus −19 (nCoV-19) pandemic, resulting in millions of fatalities globally. Recent research demonstrated that the Protein-Protein Interaction (PPI) between SARS-CoV-2 and human proteins is accountable for viral pathogenesis. However, many of these PPIs are poorly understood and unexplored, necessitating a more in-depth investigation to find latent yet critical interactions. This article elucidates the host-viral PPI through Machine Learning (ML) lenses and validates the biological significance of the same using web-based tools. ML classifiers are designed based on comprehensive datasets with five sequence-based features of human proteins, namely Amino Acid Composition, Pseudo Amino Acid Composition, Conjoint Triad, Dipeptide Composition, and Normalized Auto Correlation. A majority voting rule-based ensemble method composed of the Random Forest Model (RFM), AdaBoost, and Bagging technique is proposed that delivers encouraging statistical performance compared to other models employed in this work. The proposed ensemble model predicted a total of 111 possible SARS-CoV-2 human target proteins with a high likelihood factor ≥70%, validated by utilizing Gene Ontology (GO) and KEGG pathway enrichment analysis. Consequently, this research can aid in a deeper understanding of the molecular mechanisms underlying viral pathogenesis and provide clues for developing more efficient anti-COVID medications. The Authors. Published by Elsevier B.V. 2023-06 2023-04-06 /pmc/articles/PMC10077817/ /pubmed/37056696 http://dx.doi.org/10.1016/j.medntd.2023.100228 Text en © 2023 The Authors Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Chakraborty, Arijit Mitra, Sajal Bhattacharjee, Mainak De, Debashis Pal, Anindya J. Determining human-coronavirus protein-protein interaction using machine intelligence |
title | Determining human-coronavirus protein-protein interaction using machine intelligence |
title_full | Determining human-coronavirus protein-protein interaction using machine intelligence |
title_fullStr | Determining human-coronavirus protein-protein interaction using machine intelligence |
title_full_unstemmed | Determining human-coronavirus protein-protein interaction using machine intelligence |
title_short | Determining human-coronavirus protein-protein interaction using machine intelligence |
title_sort | determining human-coronavirus protein-protein interaction using machine intelligence |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10077817/ https://www.ncbi.nlm.nih.gov/pubmed/37056696 http://dx.doi.org/10.1016/j.medntd.2023.100228 |
work_keys_str_mv | AT chakrabortyarijit determininghumancoronavirusproteinproteininteractionusingmachineintelligence AT mitrasajal determininghumancoronavirusproteinproteininteractionusingmachineintelligence AT bhattacharjeemainak determininghumancoronavirusproteinproteininteractionusingmachineintelligence AT dedebashis determininghumancoronavirusproteinproteininteractionusingmachineintelligence AT palanindyaj determininghumancoronavirusproteinproteininteractionusingmachineintelligence |