Cargando…
Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC
Protein Structure Comparison (PSC) is a well developed field of computational proteomics with active interest from the research community, since it is widely used in structural biology and drug discovery. With new PSC methods continuously emerging and no clear method of choice, Multi-Criteria Protei...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192565/ https://www.ncbi.nlm.nih.gov/pubmed/30332415 http://dx.doi.org/10.1371/journal.pone.0204587 |
_version_ | 1783363921613160448 |
---|---|
author | Sharma, Anuj Manolakos, Elias S. |
author_facet | Sharma, Anuj Manolakos, Elias S. |
author_sort | Sharma, Anuj |
collection | PubMed |
description | Protein Structure Comparison (PSC) is a well developed field of computational proteomics with active interest from the research community, since it is widely used in structural biology and drug discovery. With new PSC methods continuously emerging and no clear method of choice, Multi-Criteria Protein Structure Comparison (MCPSC) is commonly employed to combine methods and generate consensus structural similarity scores. We present pyMCPSC, a Python based utility we developed to allow users to perform MCPSC efficiently, by exploiting the parallelism afforded by the multi-core CPUs of today’s desktop computers. We show how pyMCPSC facilitates the analysis of similarities in protein domain datasets and how it can be extended to incorporate new PSC methods as they are becoming available. We exemplify the power of pyMCPSC using a case study based on the Proteus_300 dataset. Results generated using pyMCPSC show that MCPSC scores form a reliable basis for identifying the true classification of a domain, as evidenced both by the ROC analysis as well as the Nearest-Neighbor analysis. Structure similarity based “Phylogenetic Trees” representation generated by pyMCPSC provide insight into functional grouping within the dataset of domains. Furthermore, scatter plots generated by pyMCPSC show the existence of strong correlation between protein domains belonging to SCOP Class C and loose correlation between those of SCOP Class D. Such analyses and corresponding visualizations help users quickly gain insights about their datasets. The source code of pyMCPSC is available under the GPLv3.0 license through a GitHub repository (https://github.com/xulesc/pymcpsc). |
format | Online Article Text |
id | pubmed-6192565 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-61925652018-11-05 Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC Sharma, Anuj Manolakos, Elias S. PLoS One Research Article Protein Structure Comparison (PSC) is a well developed field of computational proteomics with active interest from the research community, since it is widely used in structural biology and drug discovery. With new PSC methods continuously emerging and no clear method of choice, Multi-Criteria Protein Structure Comparison (MCPSC) is commonly employed to combine methods and generate consensus structural similarity scores. We present pyMCPSC, a Python based utility we developed to allow users to perform MCPSC efficiently, by exploiting the parallelism afforded by the multi-core CPUs of today’s desktop computers. We show how pyMCPSC facilitates the analysis of similarities in protein domain datasets and how it can be extended to incorporate new PSC methods as they are becoming available. We exemplify the power of pyMCPSC using a case study based on the Proteus_300 dataset. Results generated using pyMCPSC show that MCPSC scores form a reliable basis for identifying the true classification of a domain, as evidenced both by the ROC analysis as well as the Nearest-Neighbor analysis. Structure similarity based “Phylogenetic Trees” representation generated by pyMCPSC provide insight into functional grouping within the dataset of domains. Furthermore, scatter plots generated by pyMCPSC show the existence of strong correlation between protein domains belonging to SCOP Class C and loose correlation between those of SCOP Class D. Such analyses and corresponding visualizations help users quickly gain insights about their datasets. The source code of pyMCPSC is available under the GPLv3.0 license through a GitHub repository (https://github.com/xulesc/pymcpsc). Public Library of Science 2018-10-17 /pmc/articles/PMC6192565/ /pubmed/30332415 http://dx.doi.org/10.1371/journal.pone.0204587 Text en © 2018 Sharma, Manolakos http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Sharma, Anuj Manolakos, Elias S. Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC |
title | Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC |
title_full | Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC |
title_fullStr | Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC |
title_full_unstemmed | Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC |
title_short | Multi-criteria protein structure comparison and structural similarities analysis using pyMCPSC |
title_sort | multi-criteria protein structure comparison and structural similarities analysis using pymcpsc |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6192565/ https://www.ncbi.nlm.nih.gov/pubmed/30332415 http://dx.doi.org/10.1371/journal.pone.0204587 |
work_keys_str_mv | AT sharmaanuj multicriteriaproteinstructurecomparisonandstructuralsimilaritiesanalysisusingpymcpsc AT manolakoseliass multicriteriaproteinstructurecomparisonandstructuralsimilaritiesanalysisusingpymcpsc |