Cargando…

Assignment of structural domains in proteins using diffusion kernels on graphs

Though proposing algorithmic approaches for protein domain decomposition has been of high interest, the inherent ambiguity to the problem makes it still an active area of research. Besides, accurate automated methods are in high demand as the number of solved structures for complex proteins is on th...

Descripción completa

Detalles Bibliográficos
Autores principales: Taheri-Ledari, Mohammad, Zandieh, Amirali, Shariatpanahi, Seyed Peyman, Eslahchi, Changiz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9461149/
https://www.ncbi.nlm.nih.gov/pubmed/36076174
http://dx.doi.org/10.1186/s12859-022-04902-9
_version_ 1784786915674816512
author Taheri-Ledari, Mohammad
Zandieh, Amirali
Shariatpanahi, Seyed Peyman
Eslahchi, Changiz
author_facet Taheri-Ledari, Mohammad
Zandieh, Amirali
Shariatpanahi, Seyed Peyman
Eslahchi, Changiz
author_sort Taheri-Ledari, Mohammad
collection PubMed
description Though proposing algorithmic approaches for protein domain decomposition has been of high interest, the inherent ambiguity to the problem makes it still an active area of research. Besides, accurate automated methods are in high demand as the number of solved structures for complex proteins is on the rise. While majority of the previous efforts for decomposition of 3D structures are centered on the developing clustering algorithms, employing enhanced measures of proximity between the amino acids has remained rather uncharted. If there exists a kernel function that in its reproducing kernel Hilbert space, structural domains of proteins become well separated, then protein structures can be parsed into domains without the need to use a complex clustering algorithm. Inspired by this idea, we developed a protein domain decomposition method based on diffusion kernels on protein graphs. We examined all combinations of four graph node kernels and two clustering algorithms to investigate their capability to decompose protein structures. The proposed method is tested on five of the most commonly used benchmark datasets for protein domain assignment plus a comprehensive non-redundant dataset. The results show a competitive performance of the method utilizing one of the diffusion kernels compared to four of the best automatic methods. Our method is also able to offer alternative partitionings for the same structure which is in line with the subjective definition of protein domain. With a competitive accuracy and balanced performance for the simple and complex structures despite relying on a relatively naive criterion to choose optimal decomposition, the proposed method revealed that diffusion kernels on graphs in particular, and kernel functions in general are promising measures to facilitate parsing proteins into domains and performing different structural analysis on proteins. The size and interconnectedness of the protein graphs make them promising targets for diffusion kernels as measures of affinity between amino acids. The versatility of our method allows the implementation of future kernels with higher performance. The source code of the proposed method is accessible at https://github.com/taherimo/kludo. Also, the proposed method is available as a web application from https://cbph.ir/tools/kludo. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04902-9.
format Online
Article
Text
id pubmed-9461149
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94611492022-09-10 Assignment of structural domains in proteins using diffusion kernels on graphs Taheri-Ledari, Mohammad Zandieh, Amirali Shariatpanahi, Seyed Peyman Eslahchi, Changiz BMC Bioinformatics Research Though proposing algorithmic approaches for protein domain decomposition has been of high interest, the inherent ambiguity to the problem makes it still an active area of research. Besides, accurate automated methods are in high demand as the number of solved structures for complex proteins is on the rise. While majority of the previous efforts for decomposition of 3D structures are centered on the developing clustering algorithms, employing enhanced measures of proximity between the amino acids has remained rather uncharted. If there exists a kernel function that in its reproducing kernel Hilbert space, structural domains of proteins become well separated, then protein structures can be parsed into domains without the need to use a complex clustering algorithm. Inspired by this idea, we developed a protein domain decomposition method based on diffusion kernels on protein graphs. We examined all combinations of four graph node kernels and two clustering algorithms to investigate their capability to decompose protein structures. The proposed method is tested on five of the most commonly used benchmark datasets for protein domain assignment plus a comprehensive non-redundant dataset. The results show a competitive performance of the method utilizing one of the diffusion kernels compared to four of the best automatic methods. Our method is also able to offer alternative partitionings for the same structure which is in line with the subjective definition of protein domain. With a competitive accuracy and balanced performance for the simple and complex structures despite relying on a relatively naive criterion to choose optimal decomposition, the proposed method revealed that diffusion kernels on graphs in particular, and kernel functions in general are promising measures to facilitate parsing proteins into domains and performing different structural analysis on proteins. The size and interconnectedness of the protein graphs make them promising targets for diffusion kernels as measures of affinity between amino acids. The versatility of our method allows the implementation of future kernels with higher performance. The source code of the proposed method is accessible at https://github.com/taherimo/kludo. Also, the proposed method is available as a web application from https://cbph.ir/tools/kludo. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04902-9. BioMed Central 2022-09-08 /pmc/articles/PMC9461149/ /pubmed/36076174 http://dx.doi.org/10.1186/s12859-022-04902-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Taheri-Ledari, Mohammad
Zandieh, Amirali
Shariatpanahi, Seyed Peyman
Eslahchi, Changiz
Assignment of structural domains in proteins using diffusion kernels on graphs
title Assignment of structural domains in proteins using diffusion kernels on graphs
title_full Assignment of structural domains in proteins using diffusion kernels on graphs
title_fullStr Assignment of structural domains in proteins using diffusion kernels on graphs
title_full_unstemmed Assignment of structural domains in proteins using diffusion kernels on graphs
title_short Assignment of structural domains in proteins using diffusion kernels on graphs
title_sort assignment of structural domains in proteins using diffusion kernels on graphs
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9461149/
https://www.ncbi.nlm.nih.gov/pubmed/36076174
http://dx.doi.org/10.1186/s12859-022-04902-9
work_keys_str_mv AT taheriledarimohammad assignmentofstructuraldomainsinproteinsusingdiffusionkernelsongraphs
AT zandiehamirali assignmentofstructuraldomainsinproteinsusingdiffusionkernelsongraphs
AT shariatpanahiseyedpeyman assignmentofstructuraldomainsinproteinsusingdiffusionkernelsongraphs
AT eslahchichangiz assignmentofstructuraldomainsinproteinsusingdiffusionkernelsongraphs