Cargando…
Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy
Predicting the physical or functional associations through protein-protein interactions (PPIs) represents an integral approach for inferring novel protein functions and discovering new drug targets during repositioning analysis. Recent advances in high-throughput data generation and multi-omics tech...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581021/ https://www.ncbi.nlm.nih.gov/pubmed/36303787 http://dx.doi.org/10.3389/fbinf.2021.731345 |
_version_ | 1784812524125814784 |
---|---|
author | Martins, Yasmmin Côrtes Ziviani, Artur Nicolás, Marisa Fabiana de Vasconcelos, Ana Tereza Ribeiro |
author_facet | Martins, Yasmmin Côrtes Ziviani, Artur Nicolás, Marisa Fabiana de Vasconcelos, Ana Tereza Ribeiro |
author_sort | Martins, Yasmmin Côrtes |
collection | PubMed |
description | Predicting the physical or functional associations through protein-protein interactions (PPIs) represents an integral approach for inferring novel protein functions and discovering new drug targets during repositioning analysis. Recent advances in high-throughput data generation and multi-omics techniques have enabled large-scale PPI predictions, thus promoting several computational methods based on different levels of biological evidence. However, integrating multiple results and strategies to optimize, extract interaction features automatically and scale up the entire PPI prediction process is still challenging. Most procedures do not offer an in-silico validation process to evaluate the predicted PPIs. In this context, this paper presents the PredPrIn scientific workflow that enables PPI prediction based on multiple lines of evidence, including the structure, sequence, and functional annotation categories, by combining boosting and stacking machine learning techniques. We also present a pipeline (PPIVPro) for the validation process based on cellular co-localization filtering and a focused search of PPI evidence on scientific publications. Thus, our combined approach provides means to extensive scale training or prediction of new PPIs and a strategy to evaluate the prediction quality. PredPrIn and PPIVPro are publicly available at https://github.com/YasCoMa/predprin and https://github.com/YasCoMa/ppi_validation_process. |
format | Online Article Text |
id | pubmed-9581021 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95810212022-10-26 Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy Martins, Yasmmin Côrtes Ziviani, Artur Nicolás, Marisa Fabiana de Vasconcelos, Ana Tereza Ribeiro Front Bioinform Bioinformatics Predicting the physical or functional associations through protein-protein interactions (PPIs) represents an integral approach for inferring novel protein functions and discovering new drug targets during repositioning analysis. Recent advances in high-throughput data generation and multi-omics techniques have enabled large-scale PPI predictions, thus promoting several computational methods based on different levels of biological evidence. However, integrating multiple results and strategies to optimize, extract interaction features automatically and scale up the entire PPI prediction process is still challenging. Most procedures do not offer an in-silico validation process to evaluate the predicted PPIs. In this context, this paper presents the PredPrIn scientific workflow that enables PPI prediction based on multiple lines of evidence, including the structure, sequence, and functional annotation categories, by combining boosting and stacking machine learning techniques. We also present a pipeline (PPIVPro) for the validation process based on cellular co-localization filtering and a focused search of PPI evidence on scientific publications. Thus, our combined approach provides means to extensive scale training or prediction of new PPIs and a strategy to evaluate the prediction quality. PredPrIn and PPIVPro are publicly available at https://github.com/YasCoMa/predprin and https://github.com/YasCoMa/ppi_validation_process. Frontiers Media S.A. 2021-09-06 /pmc/articles/PMC9581021/ /pubmed/36303787 http://dx.doi.org/10.3389/fbinf.2021.731345 Text en Copyright © 2021 Martins, Ziviani, Nicolás and de Vasconcelos. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Bioinformatics Martins, Yasmmin Côrtes Ziviani, Artur Nicolás, Marisa Fabiana de Vasconcelos, Ana Tereza Ribeiro Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy |
title | Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy |
title_full | Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy |
title_fullStr | Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy |
title_full_unstemmed | Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy |
title_short | Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy |
title_sort | large-scale protein interactions prediction by multiple evidence analysis associated with an in-silico curation strategy |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581021/ https://www.ncbi.nlm.nih.gov/pubmed/36303787 http://dx.doi.org/10.3389/fbinf.2021.731345 |
work_keys_str_mv | AT martinsyasmmincortes largescaleproteininteractionspredictionbymultipleevidenceanalysisassociatedwithaninsilicocurationstrategy AT zivianiartur largescaleproteininteractionspredictionbymultipleevidenceanalysisassociatedwithaninsilicocurationstrategy AT nicolasmarisafabiana largescaleproteininteractionspredictionbymultipleevidenceanalysisassociatedwithaninsilicocurationstrategy AT devasconcelosanaterezaribeiro largescaleproteininteractionspredictionbymultipleevidenceanalysisassociatedwithaninsilicocurationstrategy |