Cargando…
Assignment of structural domains in proteins using diffusion kernels on graphs
Though proposing algorithmic approaches for protein domain decomposition has been of high interest, the inherent ambiguity to the problem makes it still an active area of research. Besides, accurate automated methods are in high demand as the number of solved structures for complex proteins is on th...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9461149/ https://www.ncbi.nlm.nih.gov/pubmed/36076174 http://dx.doi.org/10.1186/s12859-022-04902-9 |
_version_ | 1784786915674816512 |
---|---|
author | Taheri-Ledari, Mohammad Zandieh, Amirali Shariatpanahi, Seyed Peyman Eslahchi, Changiz |
author_facet | Taheri-Ledari, Mohammad Zandieh, Amirali Shariatpanahi, Seyed Peyman Eslahchi, Changiz |
author_sort | Taheri-Ledari, Mohammad |
collection | PubMed |
description | Though proposing algorithmic approaches for protein domain decomposition has been of high interest, the inherent ambiguity to the problem makes it still an active area of research. Besides, accurate automated methods are in high demand as the number of solved structures for complex proteins is on the rise. While majority of the previous efforts for decomposition of 3D structures are centered on the developing clustering algorithms, employing enhanced measures of proximity between the amino acids has remained rather uncharted. If there exists a kernel function that in its reproducing kernel Hilbert space, structural domains of proteins become well separated, then protein structures can be parsed into domains without the need to use a complex clustering algorithm. Inspired by this idea, we developed a protein domain decomposition method based on diffusion kernels on protein graphs. We examined all combinations of four graph node kernels and two clustering algorithms to investigate their capability to decompose protein structures. The proposed method is tested on five of the most commonly used benchmark datasets for protein domain assignment plus a comprehensive non-redundant dataset. The results show a competitive performance of the method utilizing one of the diffusion kernels compared to four of the best automatic methods. Our method is also able to offer alternative partitionings for the same structure which is in line with the subjective definition of protein domain. With a competitive accuracy and balanced performance for the simple and complex structures despite relying on a relatively naive criterion to choose optimal decomposition, the proposed method revealed that diffusion kernels on graphs in particular, and kernel functions in general are promising measures to facilitate parsing proteins into domains and performing different structural analysis on proteins. The size and interconnectedness of the protein graphs make them promising targets for diffusion kernels as measures of affinity between amino acids. The versatility of our method allows the implementation of future kernels with higher performance. The source code of the proposed method is accessible at https://github.com/taherimo/kludo. Also, the proposed method is available as a web application from https://cbph.ir/tools/kludo. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04902-9. |
format | Online Article Text |
id | pubmed-9461149 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-94611492022-09-10 Assignment of structural domains in proteins using diffusion kernels on graphs Taheri-Ledari, Mohammad Zandieh, Amirali Shariatpanahi, Seyed Peyman Eslahchi, Changiz BMC Bioinformatics Research Though proposing algorithmic approaches for protein domain decomposition has been of high interest, the inherent ambiguity to the problem makes it still an active area of research. Besides, accurate automated methods are in high demand as the number of solved structures for complex proteins is on the rise. While majority of the previous efforts for decomposition of 3D structures are centered on the developing clustering algorithms, employing enhanced measures of proximity between the amino acids has remained rather uncharted. If there exists a kernel function that in its reproducing kernel Hilbert space, structural domains of proteins become well separated, then protein structures can be parsed into domains without the need to use a complex clustering algorithm. Inspired by this idea, we developed a protein domain decomposition method based on diffusion kernels on protein graphs. We examined all combinations of four graph node kernels and two clustering algorithms to investigate their capability to decompose protein structures. The proposed method is tested on five of the most commonly used benchmark datasets for protein domain assignment plus a comprehensive non-redundant dataset. The results show a competitive performance of the method utilizing one of the diffusion kernels compared to four of the best automatic methods. Our method is also able to offer alternative partitionings for the same structure which is in line with the subjective definition of protein domain. With a competitive accuracy and balanced performance for the simple and complex structures despite relying on a relatively naive criterion to choose optimal decomposition, the proposed method revealed that diffusion kernels on graphs in particular, and kernel functions in general are promising measures to facilitate parsing proteins into domains and performing different structural analysis on proteins. The size and interconnectedness of the protein graphs make them promising targets for diffusion kernels as measures of affinity between amino acids. The versatility of our method allows the implementation of future kernels with higher performance. The source code of the proposed method is accessible at https://github.com/taherimo/kludo. Also, the proposed method is available as a web application from https://cbph.ir/tools/kludo. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04902-9. BioMed Central 2022-09-08 /pmc/articles/PMC9461149/ /pubmed/36076174 http://dx.doi.org/10.1186/s12859-022-04902-9 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Taheri-Ledari, Mohammad Zandieh, Amirali Shariatpanahi, Seyed Peyman Eslahchi, Changiz Assignment of structural domains in proteins using diffusion kernels on graphs |
title | Assignment of structural domains in proteins using diffusion kernels on graphs |
title_full | Assignment of structural domains in proteins using diffusion kernels on graphs |
title_fullStr | Assignment of structural domains in proteins using diffusion kernels on graphs |
title_full_unstemmed | Assignment of structural domains in proteins using diffusion kernels on graphs |
title_short | Assignment of structural domains in proteins using diffusion kernels on graphs |
title_sort | assignment of structural domains in proteins using diffusion kernels on graphs |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9461149/ https://www.ncbi.nlm.nih.gov/pubmed/36076174 http://dx.doi.org/10.1186/s12859-022-04902-9 |
work_keys_str_mv | AT taheriledarimohammad assignmentofstructuraldomainsinproteinsusingdiffusionkernelsongraphs AT zandiehamirali assignmentofstructuraldomainsinproteinsusingdiffusionkernelsongraphs AT shariatpanahiseyedpeyman assignmentofstructuraldomainsinproteinsusingdiffusionkernelsongraphs AT eslahchichangiz assignmentofstructuraldomainsinproteinsusingdiffusionkernelsongraphs |