Cargando…

Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia

BACKGROUND: Although the etiology of chronic lymphocytic leukemia (CLL), the most common type of adult leukemia, is still unclear, strong evidence implicates antigen involvement in disease ontogeny and evolution. Primary and 3D structure analysis has been utilised in order to discover indications of...

Descripción completa

Detalles Bibliográficos
Autores principales: Polychronidou, Eleftheria, Kalamaras, Ilias, Agathangelidis, Andreas, Sutton, Lesley-Ann, Yan, Xiao-Jie, Bikos, Vasilis, Vardi, Anna, Mochament, Konstantinos, Chiorazzi, Nicholas, Belessi, Chrysoula, Rosenquist, Richard, Ghia, Paolo, Stamatopoulos, Kostas, Vlamos, Panayiotis, Chailyan, Anna, Overby, Nanna, Marcatili, Paolo, Hatzidimitriou, Anastasia, Tzovaras, Dimitrios
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245605/
https://www.ncbi.nlm.nih.gov/pubmed/30453883
http://dx.doi.org/10.1186/s12859-018-2381-1
_version_ 1783372272483958784
author Polychronidou, Eleftheria
Kalamaras, Ilias
Agathangelidis, Andreas
Sutton, Lesley-Ann
Yan, Xiao-Jie
Bikos, Vasilis
Vardi, Anna
Mochament, Konstantinos
Chiorazzi, Nicholas
Belessi, Chrysoula
Rosenquist, Richard
Ghia, Paolo
Stamatopoulos, Kostas
Vlamos, Panayiotis
Chailyan, Anna
Overby, Nanna
Marcatili, Paolo
Hatzidimitriou, Anastasia
Tzovaras, Dimitrios
author_facet Polychronidou, Eleftheria
Kalamaras, Ilias
Agathangelidis, Andreas
Sutton, Lesley-Ann
Yan, Xiao-Jie
Bikos, Vasilis
Vardi, Anna
Mochament, Konstantinos
Chiorazzi, Nicholas
Belessi, Chrysoula
Rosenquist, Richard
Ghia, Paolo
Stamatopoulos, Kostas
Vlamos, Panayiotis
Chailyan, Anna
Overby, Nanna
Marcatili, Paolo
Hatzidimitriou, Anastasia
Tzovaras, Dimitrios
author_sort Polychronidou, Eleftheria
collection PubMed
description BACKGROUND: Although the etiology of chronic lymphocytic leukemia (CLL), the most common type of adult leukemia, is still unclear, strong evidence implicates antigen involvement in disease ontogeny and evolution. Primary and 3D structure analysis has been utilised in order to discover indications of antigenic pressure. The latter has been mostly based on the 3D models of the clonotypic B cell receptor immunoglobulin (BcR IG) amino acid sequences. Therefore, their accuracy is directly dependent on the quality of the model construction algorithms and the specific methods used to compare the ensuing models. Thus far, reliable and robust methods that can group the IG 3D models based on their structural characteristics are missing. RESULTS: Here we propose a novel method for clustering a set of proteins based on their 3D structure focusing on 3D structures of BcR IG from a large series of patients with CLL. The method combines techniques from the areas of bioinformatics, 3D object recognition and machine learning. The clustering procedure is based on the extraction of 3D descriptors, encoding various properties of the local and global geometrical structure of the proteins. The descriptors are extracted from aligned pairs of proteins. A combination of individual 3D descriptors is also used as an additional method. The comparison of the automatically generated clusters to manual annotation by experts shows an increased accuracy when using the 3D descriptors compared to plain bioinformatics-based comparison. The accuracy is increased even more when using the combination of 3D descriptors. CONCLUSIONS: The experimental results verify that the use of 3D descriptors commonly used for 3D object recognition can be effectively applied to distinguishing structural differences of proteins. The proposed approach can be applied to provide hints for the existence of structural groups in a large set of unannotated BcR IG protein files in both CLL and, by logical extension, other contexts where it is relevant to characterize BcR IG structural similarity. The method does not present any limitations in application and can be extended to other types of proteins.
format Online
Article
Text
id pubmed-6245605
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-62456052018-11-26 Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia Polychronidou, Eleftheria Kalamaras, Ilias Agathangelidis, Andreas Sutton, Lesley-Ann Yan, Xiao-Jie Bikos, Vasilis Vardi, Anna Mochament, Konstantinos Chiorazzi, Nicholas Belessi, Chrysoula Rosenquist, Richard Ghia, Paolo Stamatopoulos, Kostas Vlamos, Panayiotis Chailyan, Anna Overby, Nanna Marcatili, Paolo Hatzidimitriou, Anastasia Tzovaras, Dimitrios BMC Bioinformatics Methodology BACKGROUND: Although the etiology of chronic lymphocytic leukemia (CLL), the most common type of adult leukemia, is still unclear, strong evidence implicates antigen involvement in disease ontogeny and evolution. Primary and 3D structure analysis has been utilised in order to discover indications of antigenic pressure. The latter has been mostly based on the 3D models of the clonotypic B cell receptor immunoglobulin (BcR IG) amino acid sequences. Therefore, their accuracy is directly dependent on the quality of the model construction algorithms and the specific methods used to compare the ensuing models. Thus far, reliable and robust methods that can group the IG 3D models based on their structural characteristics are missing. RESULTS: Here we propose a novel method for clustering a set of proteins based on their 3D structure focusing on 3D structures of BcR IG from a large series of patients with CLL. The method combines techniques from the areas of bioinformatics, 3D object recognition and machine learning. The clustering procedure is based on the extraction of 3D descriptors, encoding various properties of the local and global geometrical structure of the proteins. The descriptors are extracted from aligned pairs of proteins. A combination of individual 3D descriptors is also used as an additional method. The comparison of the automatically generated clusters to manual annotation by experts shows an increased accuracy when using the 3D descriptors compared to plain bioinformatics-based comparison. The accuracy is increased even more when using the combination of 3D descriptors. CONCLUSIONS: The experimental results verify that the use of 3D descriptors commonly used for 3D object recognition can be effectively applied to distinguishing structural differences of proteins. The proposed approach can be applied to provide hints for the existence of structural groups in a large set of unannotated BcR IG protein files in both CLL and, by logical extension, other contexts where it is relevant to characterize BcR IG structural similarity. The method does not present any limitations in application and can be extended to other types of proteins. BioMed Central 2018-11-20 /pmc/articles/PMC6245605/ /pubmed/30453883 http://dx.doi.org/10.1186/s12859-018-2381-1 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Polychronidou, Eleftheria
Kalamaras, Ilias
Agathangelidis, Andreas
Sutton, Lesley-Ann
Yan, Xiao-Jie
Bikos, Vasilis
Vardi, Anna
Mochament, Konstantinos
Chiorazzi, Nicholas
Belessi, Chrysoula
Rosenquist, Richard
Ghia, Paolo
Stamatopoulos, Kostas
Vlamos, Panayiotis
Chailyan, Anna
Overby, Nanna
Marcatili, Paolo
Hatzidimitriou, Anastasia
Tzovaras, Dimitrios
Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia
title Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia
title_full Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia
title_fullStr Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia
title_full_unstemmed Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia
title_short Automated shape-based clustering of 3D immunoglobulin protein structures in chronic lymphocytic leukemia
title_sort automated shape-based clustering of 3d immunoglobulin protein structures in chronic lymphocytic leukemia
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6245605/
https://www.ncbi.nlm.nih.gov/pubmed/30453883
http://dx.doi.org/10.1186/s12859-018-2381-1
work_keys_str_mv AT polychronidoueleftheria automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT kalamarasilias automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT agathangelidisandreas automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT suttonlesleyann automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT yanxiaojie automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT bikosvasilis automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT vardianna automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT mochamentkonstantinos automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT chiorazzinicholas automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT belessichrysoula automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT rosenquistrichard automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT ghiapaolo automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT stamatopouloskostas automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT vlamospanayiotis automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT chailyananna automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT overbynanna automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT marcatilipaolo automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT hatzidimitriouanastasia automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia
AT tzovarasdimitrios automatedshapebasedclusteringof3dimmunoglobulinproteinstructuresinchroniclymphocyticleukemia