Cargando…

Molecular complex detection in protein interaction networks through reinforcement learning

BACKGROUND: Proteins often assemble into higher-order complexes to perform their biological functions. Such protein–protein interactions (PPI) are often experimentally measured for pairs of proteins and summarized in a weighted PPI network, to which community detection algorithms can be applied to d...

Descripción completa

Detalles Bibliográficos
Autores principales:	Palukuri, Meghana V., Patil, Ridhi S., Marcotte, Edward M.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2023
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394916/ https://www.ncbi.nlm.nih.gov/pubmed/37532987 http://dx.doi.org/10.1186/s12859-023-05425-7

_version_	1785083478084157440
author	Palukuri, Meghana V. Patil, Ridhi S. Marcotte, Edward M.
author_facet	Palukuri, Meghana V. Patil, Ridhi S. Marcotte, Edward M.
author_sort	Palukuri, Meghana V.
collection	PubMed
description	BACKGROUND: Proteins often assemble into higher-order complexes to perform their biological functions. Such protein–protein interactions (PPI) are often experimentally measured for pairs of proteins and summarized in a weighted PPI network, to which community detection algorithms can be applied to define the various higher-order protein complexes. Current methods include unsupervised and supervised approaches, often assuming that protein complexes manifest only as dense subgraphs. Utilizing supervised approaches, the focus is not on how to find them in a network, but only on learning which subgraphs correspond to complexes, currently solved using heuristics. However, learning to walk trajectories on a network to identify protein complexes leads naturally to a reinforcement learning (RL) approach, a strategy not extensively explored for community detection. Here, we develop and evaluate a reinforcement learning pipeline for community detection on weighted protein–protein interaction networks to detect new protein complexes. The algorithm is trained to calculate the value of different subgraphs encountered while walking on the network to reconstruct known complexes. A distributed prediction algorithm then scales the RL pipeline to search for novel protein complexes on large PPI networks. RESULTS: The reinforcement learning pipeline is applied to a human PPI network consisting of 8k proteins and 60k PPI, which results in 1,157 protein complexes. The method demonstrated competitive accuracy with improved speed compared to previous algorithms. We highlight protein complexes such as C4orf19, C18orf21, and KIAA1522 which are currently minimally characterized. Additionally, the results suggest TMC04 be a putative additional subunit of the KICSTOR complex and confirm the involvement of C15orf41 in a higher-order complex with HIRA, CDAN1, ASF1A, and by 3D structural modeling. CONCLUSIONS: Reinforcement learning offers several distinct advantages for community detection, including scalability and knowledge of the walk trajectories defining those communities. Applied to currently available human protein interaction networks, this method had comparable accuracy with other algorithms and notable savings in computational time, and in turn, led to clear predictions of protein function and interactions for several uncharacterized human proteins. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05425-7.
format	Online Article Text
id	pubmed-10394916
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-103949162023-08-03 Molecular complex detection in protein interaction networks through reinforcement learning Palukuri, Meghana V. Patil, Ridhi S. Marcotte, Edward M. BMC Bioinformatics Research BACKGROUND: Proteins often assemble into higher-order complexes to perform their biological functions. Such protein–protein interactions (PPI) are often experimentally measured for pairs of proteins and summarized in a weighted PPI network, to which community detection algorithms can be applied to define the various higher-order protein complexes. Current methods include unsupervised and supervised approaches, often assuming that protein complexes manifest only as dense subgraphs. Utilizing supervised approaches, the focus is not on how to find them in a network, but only on learning which subgraphs correspond to complexes, currently solved using heuristics. However, learning to walk trajectories on a network to identify protein complexes leads naturally to a reinforcement learning (RL) approach, a strategy not extensively explored for community detection. Here, we develop and evaluate a reinforcement learning pipeline for community detection on weighted protein–protein interaction networks to detect new protein complexes. The algorithm is trained to calculate the value of different subgraphs encountered while walking on the network to reconstruct known complexes. A distributed prediction algorithm then scales the RL pipeline to search for novel protein complexes on large PPI networks. RESULTS: The reinforcement learning pipeline is applied to a human PPI network consisting of 8k proteins and 60k PPI, which results in 1,157 protein complexes. The method demonstrated competitive accuracy with improved speed compared to previous algorithms. We highlight protein complexes such as C4orf19, C18orf21, and KIAA1522 which are currently minimally characterized. Additionally, the results suggest TMC04 be a putative additional subunit of the KICSTOR complex and confirm the involvement of C15orf41 in a higher-order complex with HIRA, CDAN1, ASF1A, and by 3D structural modeling. CONCLUSIONS: Reinforcement learning offers several distinct advantages for community detection, including scalability and knowledge of the walk trajectories defining those communities. Applied to currently available human protein interaction networks, this method had comparable accuracy with other algorithms and notable savings in computational time, and in turn, led to clear predictions of protein function and interactions for several uncharacterized human proteins. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-023-05425-7. BioMed Central 2023-08-02 /pmc/articles/PMC10394916/ /pubmed/37532987 http://dx.doi.org/10.1186/s12859-023-05425-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Palukuri, Meghana V. Patil, Ridhi S. Marcotte, Edward M. Molecular complex detection in protein interaction networks through reinforcement learning
title	Molecular complex detection in protein interaction networks through reinforcement learning
title_full	Molecular complex detection in protein interaction networks through reinforcement learning
title_fullStr	Molecular complex detection in protein interaction networks through reinforcement learning
title_full_unstemmed	Molecular complex detection in protein interaction networks through reinforcement learning
title_short	Molecular complex detection in protein interaction networks through reinforcement learning
title_sort	molecular complex detection in protein interaction networks through reinforcement learning
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394916/ https://www.ncbi.nlm.nih.gov/pubmed/37532987 http://dx.doi.org/10.1186/s12859-023-05425-7
work_keys_str_mv	AT palukurimeghanav molecularcomplexdetectioninproteininteractionnetworksthroughreinforcementlearning AT patilridhis molecularcomplexdetectioninproteininteractionnetworksthroughreinforcementlearning AT marcotteedwardm molecularcomplexdetectioninproteininteractionnetworksthroughreinforcementlearning

Molecular complex detection in protein interaction networks through reinforcement learning

Ejemplares similares