Cargando…

Modelling and Recognition of Protein Contact Networks by Multiple Kernel Learning and Dissimilarity Representations

Multiple kernel learning is a paradigm which employs a properly constructed chain of kernel functions able to simultaneously analyse different data or different representations of the same data. In this paper, we propose an hybrid classification system based on a linear combination of multiple kerne...

Descripción completa

Detalles Bibliográficos
Autores principales: Martino, Alessio, De Santis, Enrico, Giuliani, Alessandro, Rizzi, Antonello
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517365/
https://www.ncbi.nlm.nih.gov/pubmed/33286565
http://dx.doi.org/10.3390/e22070794
_version_ 1783587213522501632
author Martino, Alessio
De Santis, Enrico
Giuliani, Alessandro
Rizzi, Antonello
author_facet Martino, Alessio
De Santis, Enrico
Giuliani, Alessandro
Rizzi, Antonello
author_sort Martino, Alessio
collection PubMed
description Multiple kernel learning is a paradigm which employs a properly constructed chain of kernel functions able to simultaneously analyse different data or different representations of the same data. In this paper, we propose an hybrid classification system based on a linear combination of multiple kernels defined over multiple dissimilarity spaces. The core of the training procedure is the joint optimisation of kernel weights and representatives selection in the dissimilarity spaces. This equips the system with a two-fold knowledge discovery phase: by analysing the weights, it is possible to check which representations are more suitable for solving the classification problem, whereas the pivotal patterns selected as representatives can give further insights on the modelled system, possibly with the help of field-experts. The proposed classification system is tested on real proteomic data in order to predict proteins’ functional role starting from their folded structure: specifically, a set of eight representations are drawn from the graph-based protein folded description. The proposed multiple kernel-based system has also been benchmarked against a clustering-based classification system also able to exploit multiple dissimilarities simultaneously. Computational results show remarkable classification capabilities and the knowledge discovery analysis is in line with current biological knowledge, suggesting the reliability of the proposed system.
format Online
Article
Text
id pubmed-7517365
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75173652020-11-09 Modelling and Recognition of Protein Contact Networks by Multiple Kernel Learning and Dissimilarity Representations Martino, Alessio De Santis, Enrico Giuliani, Alessandro Rizzi, Antonello Entropy (Basel) Article Multiple kernel learning is a paradigm which employs a properly constructed chain of kernel functions able to simultaneously analyse different data or different representations of the same data. In this paper, we propose an hybrid classification system based on a linear combination of multiple kernels defined over multiple dissimilarity spaces. The core of the training procedure is the joint optimisation of kernel weights and representatives selection in the dissimilarity spaces. This equips the system with a two-fold knowledge discovery phase: by analysing the weights, it is possible to check which representations are more suitable for solving the classification problem, whereas the pivotal patterns selected as representatives can give further insights on the modelled system, possibly with the help of field-experts. The proposed classification system is tested on real proteomic data in order to predict proteins’ functional role starting from their folded structure: specifically, a set of eight representations are drawn from the graph-based protein folded description. The proposed multiple kernel-based system has also been benchmarked against a clustering-based classification system also able to exploit multiple dissimilarities simultaneously. Computational results show remarkable classification capabilities and the knowledge discovery analysis is in line with current biological knowledge, suggesting the reliability of the proposed system. MDPI 2020-07-21 /pmc/articles/PMC7517365/ /pubmed/33286565 http://dx.doi.org/10.3390/e22070794 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Martino, Alessio
De Santis, Enrico
Giuliani, Alessandro
Rizzi, Antonello
Modelling and Recognition of Protein Contact Networks by Multiple Kernel Learning and Dissimilarity Representations
title Modelling and Recognition of Protein Contact Networks by Multiple Kernel Learning and Dissimilarity Representations
title_full Modelling and Recognition of Protein Contact Networks by Multiple Kernel Learning and Dissimilarity Representations
title_fullStr Modelling and Recognition of Protein Contact Networks by Multiple Kernel Learning and Dissimilarity Representations
title_full_unstemmed Modelling and Recognition of Protein Contact Networks by Multiple Kernel Learning and Dissimilarity Representations
title_short Modelling and Recognition of Protein Contact Networks by Multiple Kernel Learning and Dissimilarity Representations
title_sort modelling and recognition of protein contact networks by multiple kernel learning and dissimilarity representations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7517365/
https://www.ncbi.nlm.nih.gov/pubmed/33286565
http://dx.doi.org/10.3390/e22070794
work_keys_str_mv AT martinoalessio modellingandrecognitionofproteincontactnetworksbymultiplekernellearninganddissimilarityrepresentations
AT desantisenrico modellingandrecognitionofproteincontactnetworksbymultiplekernellearninganddissimilarityrepresentations
AT giulianialessandro modellingandrecognitionofproteincontactnetworksbymultiplekernellearninganddissimilarityrepresentations
AT rizziantonello modellingandrecognitionofproteincontactnetworksbymultiplekernellearninganddissimilarityrepresentations