Cargando…

HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network

AIMS: To develop a tool that can annotate subcellular localization of human proteins. BACKGROUND: With the progression of high throughput human proteomics projects, an enormous amount of protein sequence data has been discovered in the recent past. All these raw sequence data require precise mapping...

Descripción completa

Detalles Bibliográficos
Autores principales: Semwal, Rahul, Varadwaj, Pritish Kumar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Bentham Science Publishers 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7604748/
https://www.ncbi.nlm.nih.gov/pubmed/33214771
http://dx.doi.org/10.2174/1389202921999200528160534
_version_ 1783604193471234048
author Semwal, Rahul
Varadwaj, Pritish Kumar
author_facet Semwal, Rahul
Varadwaj, Pritish Kumar
author_sort Semwal, Rahul
collection PubMed
description AIMS: To develop a tool that can annotate subcellular localization of human proteins. BACKGROUND: With the progression of high throughput human proteomics projects, an enormous amount of protein sequence data has been discovered in the recent past. All these raw sequence data require precise mapping and annotation for their respective biological role and functional attributes. The functional characteristics of protein molecules are highly dependent on the subcellular localization/compartment. Therefore, a fully automated and reliable protein subcellular localization prediction system would be very useful for current proteomic research. OBJECTIVE: To develop a machine learning-based predictive model that can annotate the subcellular localization of human proteins with high accuracy and precision. METHODS: In this study, we used the PSI-CD-HIT homology criterion and utilized the sequence-based features of protein sequences to develop a powerful subcellular localization predictive model. The dataset used to train the HumDLoc model was extracted from a reliable data source, Uniprot knowledge base, which helps the model to generalize on the unseen dataset. RESULTS: The proposed model, HumDLoc, was compared with two of the most widely used techniques: CELLO and DeepLoc, and other machine learning-based tools. The result demonstrated promising predictive performance of HumDLoc model based on various machine learning parameters such as accuracy (≥97.00%), precision (≥0.86), recall (≥0.89), MCC score (≥0.86), ROC curve (0.98 square unit), and precision-recall curve (0.93 square unit). CONCLUSION: In conclusion, HumDLoc was able to outperform several alternative tools for correctly predicting subcellular localization of human proteins. The HumDLoc has been hosted as a web-based tool at https://bioserver.iiita.ac.in/HumDLoc/.
format Online
Article
Text
id pubmed-7604748
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Bentham Science Publishers
record_format MEDLINE/PubMed
spelling pubmed-76047482021-05-01 HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network Semwal, Rahul Varadwaj, Pritish Kumar Curr Genomics Article AIMS: To develop a tool that can annotate subcellular localization of human proteins. BACKGROUND: With the progression of high throughput human proteomics projects, an enormous amount of protein sequence data has been discovered in the recent past. All these raw sequence data require precise mapping and annotation for their respective biological role and functional attributes. The functional characteristics of protein molecules are highly dependent on the subcellular localization/compartment. Therefore, a fully automated and reliable protein subcellular localization prediction system would be very useful for current proteomic research. OBJECTIVE: To develop a machine learning-based predictive model that can annotate the subcellular localization of human proteins with high accuracy and precision. METHODS: In this study, we used the PSI-CD-HIT homology criterion and utilized the sequence-based features of protein sequences to develop a powerful subcellular localization predictive model. The dataset used to train the HumDLoc model was extracted from a reliable data source, Uniprot knowledge base, which helps the model to generalize on the unseen dataset. RESULTS: The proposed model, HumDLoc, was compared with two of the most widely used techniques: CELLO and DeepLoc, and other machine learning-based tools. The result demonstrated promising predictive performance of HumDLoc model based on various machine learning parameters such as accuracy (≥97.00%), precision (≥0.86), recall (≥0.89), MCC score (≥0.86), ROC curve (0.98 square unit), and precision-recall curve (0.93 square unit). CONCLUSION: In conclusion, HumDLoc was able to outperform several alternative tools for correctly predicting subcellular localization of human proteins. The HumDLoc has been hosted as a web-based tool at https://bioserver.iiita.ac.in/HumDLoc/. Bentham Science Publishers 2020-11 2020-11 /pmc/articles/PMC7604748/ /pubmed/33214771 http://dx.doi.org/10.2174/1389202921999200528160534 Text en © 2020 Bentham Science Publishers https://creativecommons.org/licenses/by-nc/4.0/legalcode This is an open access article licensed under the terms of the Creative Commons Attribution-Non-Commercial 4.0 International Public License (CC BY-NC 4.0) (https://creativecommons.org/licenses/by-nc/4.0/legalcode), which permits unrestricted, non-commercial use, distribution and reproduction in any medium, provided the work is properly cited.
spellingShingle Article
Semwal, Rahul
Varadwaj, Pritish Kumar
HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network
title HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network
title_full HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network
title_fullStr HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network
title_full_unstemmed HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network
title_short HumDLoc: Human Protein Subcellular Localization Prediction Using Deep Neural Network
title_sort humdloc: human protein subcellular localization prediction using deep neural network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7604748/
https://www.ncbi.nlm.nih.gov/pubmed/33214771
http://dx.doi.org/10.2174/1389202921999200528160534
work_keys_str_mv AT semwalrahul humdlochumanproteinsubcellularlocalizationpredictionusingdeepneuralnetwork
AT varadwajpritishkumar humdlochumanproteinsubcellularlocalizationpredictionusingdeepneuralnetwork