Cargando…
NRfamPred: A proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families
Nuclear receptor proteins (NRP) are transcription factor that regulate many vital cellular processes in animal cells. NRPs form a super-family of phylogenetically related proteins and divided into different sub-families on the basis of ligand characteristics and their functions. In the post-genomic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5381360/ https://www.ncbi.nlm.nih.gov/pubmed/25351274 http://dx.doi.org/10.1038/srep06810 |
_version_ | 1782519925370257408 |
---|---|
author | Kumar, Ravindra Kumari, Bandana Srivastava, Abhishikha Kumar, Manish |
author_facet | Kumar, Ravindra Kumari, Bandana Srivastava, Abhishikha Kumar, Manish |
author_sort | Kumar, Ravindra |
collection | PubMed |
description | Nuclear receptor proteins (NRP) are transcription factor that regulate many vital cellular processes in animal cells. NRPs form a super-family of phylogenetically related proteins and divided into different sub-families on the basis of ligand characteristics and their functions. In the post-genomic era, when new proteins are being added to the database in a high-throughput mode, it becomes imperative to identify new NRPs using information from amino acid sequence alone. In this study we report a SVM based two level prediction systems, NRfamPred, using dipeptide composition of proteins as input. At the 1st level, NRfamPred screens whether the query protein is NRP or non-NRP; if the query protein belongs to NRP class, prediction moves to 2nd level and predicts the sub-family. Using leave-one-out cross-validation, we were able to achieve an overall accuracy of 97.88% at the 1st level and an overall accuracy of 98.11% at the 2nd level with dipeptide composition. Benchmarking on independent datasets showed that NRfamPred had comparable accuracy to other existing methods, developed on the same dataset. Our method predicted the existence of 76 NRPs in the human proteome, out of which 14 are novel NRPs. NRfamPred also predicted the sub-families of these 14 NRPs. |
format | Online Article Text |
id | pubmed-5381360 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-53813602017-04-11 NRfamPred: A proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families Kumar, Ravindra Kumari, Bandana Srivastava, Abhishikha Kumar, Manish Sci Rep Article Nuclear receptor proteins (NRP) are transcription factor that regulate many vital cellular processes in animal cells. NRPs form a super-family of phylogenetically related proteins and divided into different sub-families on the basis of ligand characteristics and their functions. In the post-genomic era, when new proteins are being added to the database in a high-throughput mode, it becomes imperative to identify new NRPs using information from amino acid sequence alone. In this study we report a SVM based two level prediction systems, NRfamPred, using dipeptide composition of proteins as input. At the 1st level, NRfamPred screens whether the query protein is NRP or non-NRP; if the query protein belongs to NRP class, prediction moves to 2nd level and predicts the sub-family. Using leave-one-out cross-validation, we were able to achieve an overall accuracy of 97.88% at the 1st level and an overall accuracy of 98.11% at the 2nd level with dipeptide composition. Benchmarking on independent datasets showed that NRfamPred had comparable accuracy to other existing methods, developed on the same dataset. Our method predicted the existence of 76 NRPs in the human proteome, out of which 14 are novel NRPs. NRfamPred also predicted the sub-families of these 14 NRPs. Nature Publishing Group 2014-10-29 /pmc/articles/PMC5381360/ /pubmed/25351274 http://dx.doi.org/10.1038/srep06810 Text en Copyright © 2014, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by-nc-sa/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/ |
spellingShingle | Article Kumar, Ravindra Kumari, Bandana Srivastava, Abhishikha Kumar, Manish NRfamPred: A proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families |
title | NRfamPred: A proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families |
title_full | NRfamPred: A proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families |
title_fullStr | NRfamPred: A proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families |
title_full_unstemmed | NRfamPred: A proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families |
title_short | NRfamPred: A proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families |
title_sort | nrfampred: a proteome-scale two level method for prediction of nuclear receptor proteins and their sub-families |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5381360/ https://www.ncbi.nlm.nih.gov/pubmed/25351274 http://dx.doi.org/10.1038/srep06810 |
work_keys_str_mv | AT kumarravindra nrfampredaproteomescaletwolevelmethodforpredictionofnuclearreceptorproteinsandtheirsubfamilies AT kumaribandana nrfampredaproteomescaletwolevelmethodforpredictionofnuclearreceptorproteinsandtheirsubfamilies AT srivastavaabhishikha nrfampredaproteomescaletwolevelmethodforpredictionofnuclearreceptorproteinsandtheirsubfamilies AT kumarmanish nrfampredaproteomescaletwolevelmethodforpredictionofnuclearreceptorproteinsandtheirsubfamilies |