Cargando…

Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC

Protein Structure Comparison (PSC) is a well developed field of computational proteomics with active interest from the research community, since it is widely used in structural biology and drug discovery. With new PSC methods continuously emerging and no clear method of choice, Multi-Criteria Protei...

Descripción completa

Detalles Bibliográficos
Autores principales: Sharma, Anuj, Manolakos, Elias S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192565/
https://www.ncbi.nlm.nih.gov/pubmed/30332415
http://dx.doi.org/10.1371/journal.pone.0204587
_version_ 1783363921613160448
author Sharma, Anuj
Manolakos, Elias S.
author_facet Sharma, Anuj
Manolakos, Elias S.
author_sort Sharma, Anuj
collection PubMed
description Protein Structure Comparison (PSC) is a well developed field of computational proteomics with active interest from the research community, since it is widely used in structural biology and drug discovery. With new PSC methods continuously emerging and no clear method of choice, Multi-Criteria Protein Structure Comparison (MCPSC) is commonly employed to combine methods and generate consensus structural similarity scores. We present pyMCPSC, a Python based utility we developed to allow users to perform MCPSC efficiently, by exploiting the parallelism afforded by the multi-core CPUs of today’s desktop computers. We show how pyMCPSC facilitates the analysis of similarities in protein domain datasets and how it can be extended to incorporate new PSC methods as they are becoming available. We exemplify the power of pyMCPSC using a case study based on the Proteus_300 dataset. Results generated using pyMCPSC show that MCPSC scores form a reliable basis for identifying the true classification of a domain, as evidenced both by the ROC analysis as well as the Nearest-Neighbor analysis. Structure similarity based “Phylogenetic Trees” representation generated by pyMCPSC provide insight into functional grouping within the dataset of domains. Furthermore, scatter plots generated by pyMCPSC show the existence of strong correlation between protein domains belonging to SCOP Class C and loose correlation between those of SCOP Class D. Such analyses and corresponding visualizations help users quickly gain insights about their datasets. The source code of pyMCPSC is available under the GPLv3.0 license through a GitHub repository (https://github.com/xulesc/pymcpsc).
format Online
Article
Text
id pubmed-6192565
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61925652018-11-05 Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC Sharma, Anuj Manolakos, Elias S. PLoS One Research Article Protein Structure Comparison (PSC) is a well developed field of computational proteomics with active interest from the research community, since it is widely used in structural biology and drug discovery. With new PSC methods continuously emerging and no clear method of choice, Multi-Criteria Protein Structure Comparison (MCPSC) is commonly employed to combine methods and generate consensus structural similarity scores. We present pyMCPSC, a Python based utility we developed to allow users to perform MCPSC efficiently, by exploiting the parallelism afforded by the multi-core CPUs of today’s desktop computers. We show how pyMCPSC facilitates the analysis of similarities in protein domain datasets and how it can be extended to incorporate new PSC methods as they are becoming available. We exemplify the power of pyMCPSC using a case study based on the Proteus_300 dataset. Results generated using pyMCPSC show that MCPSC scores form a reliable basis for identifying the true classification of a domain, as evidenced both by the ROC analysis as well as the Nearest-Neighbor analysis. Structure similarity based “Phylogenetic Trees” representation generated by pyMCPSC provide insight into functional grouping within the dataset of domains. Furthermore, scatter plots generated by pyMCPSC show the existence of strong correlation between protein domains belonging to SCOP Class C and loose correlation between those of SCOP Class D. Such analyses and corresponding visualizations help users quickly gain insights about their datasets. The source code of pyMCPSC is available under the GPLv3.0 license through a GitHub repository (https://github.com/xulesc/pymcpsc). Public Library of Science 2018-10-17 /pmc/articles/PMC6192565/ /pubmed/30332415 http://dx.doi.org/10.1371/journal.pone.0204587 Text en © 2018 Sharma, Manolakos http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Sharma, Anuj
Manolakos, Elias S.
Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC
title Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC
title_full Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC
title_fullStr Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC
title_full_unstemmed Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC
title_short Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC
title_sort multi-criteria protein structure comparison and structural similarities analysis using pymcpsc
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192565/
https://www.ncbi.nlm.nih.gov/pubmed/30332415
http://dx.doi.org/10.1371/journal.pone.0204587
work_keys_str_mv AT sharmaanuj multicriteriaproteinstructurecomparisonandstructuralsimilaritiesanalysisusingpymcpsc
AT manolakoseliass multicriteriaproteinstructurecomparisonandstructuralsimilaritiesanalysisusingpymcpsc