Cargando…

Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks

BACKGROUND: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intra...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoseini, Adele Sadat Haghighat, Mirzarezaee, Mitra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Institute of Genetic Engineering and Biotechnology 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6697825/
https://www.ncbi.nlm.nih.gov/pubmed/31457027
http://dx.doi.org/10.15171/ijb.1933
_version_ 1783444436459454464
author Hoseini, Adele Sadat Haghighat
Mirzarezaee, Mitra
author_facet Hoseini, Adele Sadat Haghighat
Mirzarezaee, Mitra
author_sort Hoseini, Adele Sadat Haghighat
collection PubMed
description BACKGROUND: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from protein sequences. In contrast, protein interactions have been less investigated. OBJECTIVES: As protein interactions usually occur in the same or adjacent places, using this feature to find the location would be efficient and impressive. This study did not aim at increasing the total accuracy of the conducted research. The study has focused on the features of the proteins’ interaction and their employment which lead to a higher accuracy. MATERIALS AND METHODS: In this study, we have examined the protein interaction network as one of the features for prediction of the protein localization and its effects on the prediction results. In this regards, we have gathered some of the most common features including Amino Acid Composition, Dipeptide Compositions, Pseudo Amino Acid Compositions (PseAAC), Position Specific Scoring Matrix (PSSM), Functional Domain, Gene Ontology information, and the Pair-wise sequence alignment. The results of the classification are compared to the ones using protein interactions. For achieving this goal different machine learning algorithms were tested. RESULTS: The best-obtained results of using single feature set obtained using SVM classifier for PseAAC feature. The accuracy of combining all features with PPI data, using the Decision Tree and Random Forest classifiers, was 82.49% and 83.35%, respectively. In another experiment, using just protein interaction data with the different cutting points resulted in obtaining an accuracy of 93.035% for the protein location prediction. CONCLUSION: In total, it was shown that protein(s) interaction has a significant impact on the prediction of the mitochondrial proteins’ location. This feature can separately distinguish the locations well. Using this feature the accuracy of the results is raised up to 5%.
format Online
Article
Text
id pubmed-6697825
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher National Institute of Genetic Engineering and Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-66978252019-08-27 Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks Hoseini, Adele Sadat Haghighat Mirzarezaee, Mitra Iran J Biotechnol Research Article BACKGROUND: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from protein sequences. In contrast, protein interactions have been less investigated. OBJECTIVES: As protein interactions usually occur in the same or adjacent places, using this feature to find the location would be efficient and impressive. This study did not aim at increasing the total accuracy of the conducted research. The study has focused on the features of the proteins’ interaction and their employment which lead to a higher accuracy. MATERIALS AND METHODS: In this study, we have examined the protein interaction network as one of the features for prediction of the protein localization and its effects on the prediction results. In this regards, we have gathered some of the most common features including Amino Acid Composition, Dipeptide Compositions, Pseudo Amino Acid Compositions (PseAAC), Position Specific Scoring Matrix (PSSM), Functional Domain, Gene Ontology information, and the Pair-wise sequence alignment. The results of the classification are compared to the ones using protein interactions. For achieving this goal different machine learning algorithms were tested. RESULTS: The best-obtained results of using single feature set obtained using SVM classifier for PseAAC feature. The accuracy of combining all features with PPI data, using the Decision Tree and Random Forest classifiers, was 82.49% and 83.35%, respectively. In another experiment, using just protein interaction data with the different cutting points resulted in obtaining an accuracy of 93.035% for the protein location prediction. CONCLUSION: In total, it was shown that protein(s) interaction has a significant impact on the prediction of the mitochondrial proteins’ location. This feature can separately distinguish the locations well. Using this feature the accuracy of the results is raised up to 5%. National Institute of Genetic Engineering and Biotechnology 2018-08-11 /pmc/articles/PMC6697825/ /pubmed/31457027 http://dx.doi.org/10.15171/ijb.1933 Text en Copyright © 2017 The Author(s); Published by National Institute of Genetic Engineering and Biotechnology. http://creativecommons.org/licenses/by-nc/4.0/ This is an open access article, distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/) which permits others to copy and redistribute material just in noncommercial usages, provided the original work is properly cited.
spellingShingle Research Article
Hoseini, Adele Sadat Haghighat
Mirzarezaee, Mitra
Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
title Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
title_full Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
title_fullStr Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
title_full_unstemmed Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
title_short Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
title_sort prediction of protein sub-mitochondria locations using protein interaction networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6697825/
https://www.ncbi.nlm.nih.gov/pubmed/31457027
http://dx.doi.org/10.15171/ijb.1933
work_keys_str_mv AT hoseiniadelesadathaghighat predictionofproteinsubmitochondrialocationsusingproteininteractionnetworks
AT mirzarezaeemitra predictionofproteinsubmitochondrialocationsusingproteininteractionnetworks