Cargando…
Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks
BACKGROUND: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intra...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
National Institute of Genetic Engineering and Biotechnology
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6697825/ https://www.ncbi.nlm.nih.gov/pubmed/31457027 http://dx.doi.org/10.15171/ijb.1933 |
_version_ | 1783444436459454464 |
---|---|
author | Hoseini, Adele Sadat Haghighat Mirzarezaee, Mitra |
author_facet | Hoseini, Adele Sadat Haghighat Mirzarezaee, Mitra |
author_sort | Hoseini, Adele Sadat Haghighat |
collection | PubMed |
description | BACKGROUND: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from protein sequences. In contrast, protein interactions have been less investigated. OBJECTIVES: As protein interactions usually occur in the same or adjacent places, using this feature to find the location would be efficient and impressive. This study did not aim at increasing the total accuracy of the conducted research. The study has focused on the features of the proteins’ interaction and their employment which lead to a higher accuracy. MATERIALS AND METHODS: In this study, we have examined the protein interaction network as one of the features for prediction of the protein localization and its effects on the prediction results. In this regards, we have gathered some of the most common features including Amino Acid Composition, Dipeptide Compositions, Pseudo Amino Acid Compositions (PseAAC), Position Specific Scoring Matrix (PSSM), Functional Domain, Gene Ontology information, and the Pair-wise sequence alignment. The results of the classification are compared to the ones using protein interactions. For achieving this goal different machine learning algorithms were tested. RESULTS: The best-obtained results of using single feature set obtained using SVM classifier for PseAAC feature. The accuracy of combining all features with PPI data, using the Decision Tree and Random Forest classifiers, was 82.49% and 83.35%, respectively. In another experiment, using just protein interaction data with the different cutting points resulted in obtaining an accuracy of 93.035% for the protein location prediction. CONCLUSION: In total, it was shown that protein(s) interaction has a significant impact on the prediction of the mitochondrial proteins’ location. This feature can separately distinguish the locations well. Using this feature the accuracy of the results is raised up to 5%. |
format | Online Article Text |
id | pubmed-6697825 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | National Institute of Genetic Engineering and Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-66978252019-08-27 Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks Hoseini, Adele Sadat Haghighat Mirzarezaee, Mitra Iran J Biotechnol Research Article BACKGROUND: Prediction of the protein localization is among the most important issues in the bioinformatics that is used for the prediction of the proteins in the cells and organelles such as mitochondria. In this study, several machine learning algorithms are applied for the prediction of the intracellular protein locations. These algorithms use the features extracted from protein sequences. In contrast, protein interactions have been less investigated. OBJECTIVES: As protein interactions usually occur in the same or adjacent places, using this feature to find the location would be efficient and impressive. This study did not aim at increasing the total accuracy of the conducted research. The study has focused on the features of the proteins’ interaction and their employment which lead to a higher accuracy. MATERIALS AND METHODS: In this study, we have examined the protein interaction network as one of the features for prediction of the protein localization and its effects on the prediction results. In this regards, we have gathered some of the most common features including Amino Acid Composition, Dipeptide Compositions, Pseudo Amino Acid Compositions (PseAAC), Position Specific Scoring Matrix (PSSM), Functional Domain, Gene Ontology information, and the Pair-wise sequence alignment. The results of the classification are compared to the ones using protein interactions. For achieving this goal different machine learning algorithms were tested. RESULTS: The best-obtained results of using single feature set obtained using SVM classifier for PseAAC feature. The accuracy of combining all features with PPI data, using the Decision Tree and Random Forest classifiers, was 82.49% and 83.35%, respectively. In another experiment, using just protein interaction data with the different cutting points resulted in obtaining an accuracy of 93.035% for the protein location prediction. CONCLUSION: In total, it was shown that protein(s) interaction has a significant impact on the prediction of the mitochondrial proteins’ location. This feature can separately distinguish the locations well. Using this feature the accuracy of the results is raised up to 5%. National Institute of Genetic Engineering and Biotechnology 2018-08-11 /pmc/articles/PMC6697825/ /pubmed/31457027 http://dx.doi.org/10.15171/ijb.1933 Text en Copyright © 2017 The Author(s); Published by National Institute of Genetic Engineering and Biotechnology. http://creativecommons.org/licenses/by-nc/4.0/ This is an open access article, distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International License (http://creativecommons.org/licenses/by-nc/4.0/) which permits others to copy and redistribute material just in noncommercial usages, provided the original work is properly cited. |
spellingShingle | Research Article Hoseini, Adele Sadat Haghighat Mirzarezaee, Mitra Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks |
title | Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks |
title_full | Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks |
title_fullStr | Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks |
title_full_unstemmed | Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks |
title_short | Prediction of Protein Sub-Mitochondria Locations Using Protein Interaction Networks |
title_sort | prediction of protein sub-mitochondria locations using protein interaction networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6697825/ https://www.ncbi.nlm.nih.gov/pubmed/31457027 http://dx.doi.org/10.15171/ijb.1933 |
work_keys_str_mv | AT hoseiniadelesadathaghighat predictionofproteinsubmitochondrialocationsusingproteininteractionnetworks AT mirzarezaeemitra predictionofproteinsubmitochondrialocationsusingproteininteractionnetworks |