Cargando…

PCP-ML: Protein characterization package for machine learning

BACKGROUND: Machine Learning (ML) has a number of demonstrated applications in protein prediction tasks such as protein structure prediction. To speed further development of machine learning based tools and their release to the community, we have developed a package which characterizes several aspec...

Descripción completa

Detalles Bibliográficos
Autores principales: Eickholt, Jesse, Wang, Zheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4246511/
https://www.ncbi.nlm.nih.gov/pubmed/25406415
http://dx.doi.org/10.1186/1756-0500-7-810
_version_ 1782346527781421056
author Eickholt, Jesse
Wang, Zheng
author_facet Eickholt, Jesse
Wang, Zheng
author_sort Eickholt, Jesse
collection PubMed
description BACKGROUND: Machine Learning (ML) has a number of demonstrated applications in protein prediction tasks such as protein structure prediction. To speed further development of machine learning based tools and their release to the community, we have developed a package which characterizes several aspects of a protein commonly used for protein prediction tasks with machine learning. FINDINGS: A number of software libraries and modules exist for handling protein related data. The package we present in this work, PCP-ML, is unique in its small footprint and emphasis on machine learning. Its primary focus is on characterizing various aspects of a protein through sets of numerical data. The generated data can then be used with machine learning tools and/or techniques. PCP-ML is very flexible in how the generated data is formatted and as a result is compatible with a variety of existing machine learning packages. Given its small size, it can be directly packaged and distributed with community developed tools for protein prediction tasks. CONCLUSIONS: Source code and example programs are available under a BSD license at http://mlid.cps.cmich.edu/eickh1jl/tools/PCPML/. The package is implemented in C++ and accessible as a Python module. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1756-0500-7-810) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4246511
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42465112014-11-29 PCP-ML: Protein characterization package for machine learning Eickholt, Jesse Wang, Zheng BMC Res Notes Technical Note BACKGROUND: Machine Learning (ML) has a number of demonstrated applications in protein prediction tasks such as protein structure prediction. To speed further development of machine learning based tools and their release to the community, we have developed a package which characterizes several aspects of a protein commonly used for protein prediction tasks with machine learning. FINDINGS: A number of software libraries and modules exist for handling protein related data. The package we present in this work, PCP-ML, is unique in its small footprint and emphasis on machine learning. Its primary focus is on characterizing various aspects of a protein through sets of numerical data. The generated data can then be used with machine learning tools and/or techniques. PCP-ML is very flexible in how the generated data is formatted and as a result is compatible with a variety of existing machine learning packages. Given its small size, it can be directly packaged and distributed with community developed tools for protein prediction tasks. CONCLUSIONS: Source code and example programs are available under a BSD license at http://mlid.cps.cmich.edu/eickh1jl/tools/PCPML/. The package is implemented in C++ and accessible as a Python module. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1756-0500-7-810) contains supplementary material, which is available to authorized users. BioMed Central 2014-11-18 /pmc/articles/PMC4246511/ /pubmed/25406415 http://dx.doi.org/10.1186/1756-0500-7-810 Text en © Eickholt and Wang; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Note
Eickholt, Jesse
Wang, Zheng
PCP-ML: Protein characterization package for machine learning
title PCP-ML: Protein characterization package for machine learning
title_full PCP-ML: Protein characterization package for machine learning
title_fullStr PCP-ML: Protein characterization package for machine learning
title_full_unstemmed PCP-ML: Protein characterization package for machine learning
title_short PCP-ML: Protein characterization package for machine learning
title_sort pcp-ml: protein characterization package for machine learning
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4246511/
https://www.ncbi.nlm.nih.gov/pubmed/25406415
http://dx.doi.org/10.1186/1756-0500-7-810
work_keys_str_mv AT eickholtjesse pcpmlproteincharacterizationpackageformachinelearning
AT wangzheng pcpmlproteincharacterizationpackageformachinelearning