Cargando…

Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy

Predicting the physical or functional associations through protein-protein interactions (PPIs) represents an integral approach for inferring novel protein functions and discovering new drug targets during repositioning analysis. Recent advances in high-throughput data generation and multi-omics tech...

Descripción completa

Detalles Bibliográficos
Autores principales: Martins, Yasmmin Côrtes, Ziviani, Artur, Nicolás, Marisa Fabiana, de Vasconcelos, Ana Tereza Ribeiro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581021/
https://www.ncbi.nlm.nih.gov/pubmed/36303787
http://dx.doi.org/10.3389/fbinf.2021.731345
_version_ 1784812524125814784
author Martins, Yasmmin Côrtes
Ziviani, Artur
Nicolás, Marisa Fabiana
de Vasconcelos, Ana Tereza Ribeiro
author_facet Martins, Yasmmin Côrtes
Ziviani, Artur
Nicolás, Marisa Fabiana
de Vasconcelos, Ana Tereza Ribeiro
author_sort Martins, Yasmmin Côrtes
collection PubMed
description Predicting the physical or functional associations through protein-protein interactions (PPIs) represents an integral approach for inferring novel protein functions and discovering new drug targets during repositioning analysis. Recent advances in high-throughput data generation and multi-omics techniques have enabled large-scale PPI predictions, thus promoting several computational methods based on different levels of biological evidence. However, integrating multiple results and strategies to optimize, extract interaction features automatically and scale up the entire PPI prediction process is still challenging. Most procedures do not offer an in-silico validation process to evaluate the predicted PPIs. In this context, this paper presents the PredPrIn scientific workflow that enables PPI prediction based on multiple lines of evidence, including the structure, sequence, and functional annotation categories, by combining boosting and stacking machine learning techniques. We also present a pipeline (PPIVPro) for the validation process based on cellular co-localization filtering and a focused search of PPI evidence on scientific publications. Thus, our combined approach provides means to extensive scale training or prediction of new PPIs and a strategy to evaluate the prediction quality. PredPrIn and PPIVPro are publicly available at https://github.com/YasCoMa/predprin and https://github.com/YasCoMa/ppi_validation_process.
format Online
Article
Text
id pubmed-9581021
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95810212022-10-26 Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy Martins, Yasmmin Côrtes Ziviani, Artur Nicolás, Marisa Fabiana de Vasconcelos, Ana Tereza Ribeiro Front Bioinform Bioinformatics Predicting the physical or functional associations through protein-protein interactions (PPIs) represents an integral approach for inferring novel protein functions and discovering new drug targets during repositioning analysis. Recent advances in high-throughput data generation and multi-omics techniques have enabled large-scale PPI predictions, thus promoting several computational methods based on different levels of biological evidence. However, integrating multiple results and strategies to optimize, extract interaction features automatically and scale up the entire PPI prediction process is still challenging. Most procedures do not offer an in-silico validation process to evaluate the predicted PPIs. In this context, this paper presents the PredPrIn scientific workflow that enables PPI prediction based on multiple lines of evidence, including the structure, sequence, and functional annotation categories, by combining boosting and stacking machine learning techniques. We also present a pipeline (PPIVPro) for the validation process based on cellular co-localization filtering and a focused search of PPI evidence on scientific publications. Thus, our combined approach provides means to extensive scale training or prediction of new PPIs and a strategy to evaluate the prediction quality. PredPrIn and PPIVPro are publicly available at https://github.com/YasCoMa/predprin and https://github.com/YasCoMa/ppi_validation_process. Frontiers Media S.A. 2021-09-06 /pmc/articles/PMC9581021/ /pubmed/36303787 http://dx.doi.org/10.3389/fbinf.2021.731345 Text en Copyright © 2021 Martins, Ziviani, Nicolás and de Vasconcelos. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioinformatics
Martins, Yasmmin Côrtes
Ziviani, Artur
Nicolás, Marisa Fabiana
de Vasconcelos, Ana Tereza Ribeiro
Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy
title Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy
title_full Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy
title_fullStr Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy
title_full_unstemmed Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy
title_short Large-Scale Protein Interactions Prediction by Multiple Evidence Analysis Associated With an In-Silico Curation Strategy
title_sort large-scale protein interactions prediction by multiple evidence analysis associated with an in-silico curation strategy
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581021/
https://www.ncbi.nlm.nih.gov/pubmed/36303787
http://dx.doi.org/10.3389/fbinf.2021.731345
work_keys_str_mv AT martinsyasmmincortes largescaleproteininteractionspredictionbymultipleevidenceanalysisassociatedwithaninsilicocurationstrategy
AT zivianiartur largescaleproteininteractionspredictionbymultipleevidenceanalysisassociatedwithaninsilicocurationstrategy
AT nicolasmarisafabiana largescaleproteininteractionspredictionbymultipleevidenceanalysisassociatedwithaninsilicocurationstrategy
AT devasconcelosanaterezaribeiro largescaleproteininteractionspredictionbymultipleevidenceanalysisassociatedwithaninsilicocurationstrategy