Cargando…

A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments

Transmembrane helix (TMH) topology prediction is becoming a focal problem in bioinformatics because the structure of TM proteins is difficult to determine using experimental methods. Therefore, methods that can computationally predict the topology of helical membrane proteins are highly desirable. I...

Descripción completa

Detalles Bibliográficos
Autores principales: Zaki, Nazar, Bouktif, Salah, Lazarova-Molnar, Sanja
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3144211/
https://www.ncbi.nlm.nih.gov/pubmed/21814556
http://dx.doi.org/10.1371/journal.pone.0021821
_version_ 1782208977721884672
author Zaki, Nazar
Bouktif, Salah
Lazarova-Molnar, Sanja
author_facet Zaki, Nazar
Bouktif, Salah
Lazarova-Molnar, Sanja
author_sort Zaki, Nazar
collection PubMed
description Transmembrane helix (TMH) topology prediction is becoming a focal problem in bioinformatics because the structure of TM proteins is difficult to determine using experimental methods. Therefore, methods that can computationally predict the topology of helical membrane proteins are highly desirable. In this paper we introduce TMHindex, a method for detecting TMH segments using only the amino acid sequence information. Each amino acid in a protein sequence is represented by a Compositional Index, which is deduced from a combination of the difference in amino acid occurrences in TMH and non-TMH segments in training protein sequences and the amino acid composition information. Furthermore, a genetic algorithm was employed to find the optimal threshold value for the separation of TMH segments from non-TMH segments. The method successfully predicted 376 out of the 378 TMH segments in a dataset consisting of 70 test protein sequences. The sensitivity and specificity for classifying each amino acid in every protein sequence in the dataset was 0.901 and 0.865, respectively. To assess the generality of TMHindex, we also tested the approach on another standard 73-protein 3D helix dataset. TMHindex correctly predicted 91.8% of proteins based on TM segments. The level of the accuracy achieved using TMHindex in comparison to other recent approaches for predicting the topology of TM proteins is a strong argument in favor of our proposed method. Availability: The datasets, software together with supplementary materials are available at: http://faculty.uaeu.ac.ae/nzaki/TMHindex.htm.
format Online
Article
Text
id pubmed-3144211
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31442112011-08-03 A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments Zaki, Nazar Bouktif, Salah Lazarova-Molnar, Sanja PLoS One Research Article Transmembrane helix (TMH) topology prediction is becoming a focal problem in bioinformatics because the structure of TM proteins is difficult to determine using experimental methods. Therefore, methods that can computationally predict the topology of helical membrane proteins are highly desirable. In this paper we introduce TMHindex, a method for detecting TMH segments using only the amino acid sequence information. Each amino acid in a protein sequence is represented by a Compositional Index, which is deduced from a combination of the difference in amino acid occurrences in TMH and non-TMH segments in training protein sequences and the amino acid composition information. Furthermore, a genetic algorithm was employed to find the optimal threshold value for the separation of TMH segments from non-TMH segments. The method successfully predicted 376 out of the 378 TMH segments in a dataset consisting of 70 test protein sequences. The sensitivity and specificity for classifying each amino acid in every protein sequence in the dataset was 0.901 and 0.865, respectively. To assess the generality of TMHindex, we also tested the approach on another standard 73-protein 3D helix dataset. TMHindex correctly predicted 91.8% of proteins based on TM segments. The level of the accuracy achieved using TMHindex in comparison to other recent approaches for predicting the topology of TM proteins is a strong argument in favor of our proposed method. Availability: The datasets, software together with supplementary materials are available at: http://faculty.uaeu.ac.ae/nzaki/TMHindex.htm. Public Library of Science 2011-07-26 /pmc/articles/PMC3144211/ /pubmed/21814556 http://dx.doi.org/10.1371/journal.pone.0021821 Text en Zaki et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zaki, Nazar
Bouktif, Salah
Lazarova-Molnar, Sanja
A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments
title A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments
title_full A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments
title_fullStr A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments
title_full_unstemmed A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments
title_short A Combination of Compositional Index and Genetic Algorithm for Predicting Transmembrane Helical Segments
title_sort combination of compositional index and genetic algorithm for predicting transmembrane helical segments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3144211/
https://www.ncbi.nlm.nih.gov/pubmed/21814556
http://dx.doi.org/10.1371/journal.pone.0021821
work_keys_str_mv AT zakinazar acombinationofcompositionalindexandgeneticalgorithmforpredictingtransmembranehelicalsegments
AT bouktifsalah acombinationofcompositionalindexandgeneticalgorithmforpredictingtransmembranehelicalsegments
AT lazarovamolnarsanja acombinationofcompositionalindexandgeneticalgorithmforpredictingtransmembranehelicalsegments
AT zakinazar combinationofcompositionalindexandgeneticalgorithmforpredictingtransmembranehelicalsegments
AT bouktifsalah combinationofcompositionalindexandgeneticalgorithmforpredictingtransmembranehelicalsegments
AT lazarovamolnarsanja combinationofcompositionalindexandgeneticalgorithmforpredictingtransmembranehelicalsegments