Cargando…

Relationship between insertion/deletion (indel) frequency of proteins and essentiality

BACKGROUND: In a previous study, we demonstrated that some essential proteins from pathogenic organisms contained sizable insertions/deletions (indels) when aligned to human proteins of high sequence similarity. Such indels may provide sufficient spatial differences between the pathogenic protein an...

Descripción completa

Detalles Bibliográficos
Autores principales: Chan, Simon K, Hsing, Michael, Hormozdiari, Fereydoun, Cherkasov, Artem
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1925122/
https://www.ncbi.nlm.nih.gov/pubmed/17598914
http://dx.doi.org/10.1186/1471-2105-8-227
_version_ 1782134241030569984
author Chan, Simon K
Hsing, Michael
Hormozdiari, Fereydoun
Cherkasov, Artem
author_facet Chan, Simon K
Hsing, Michael
Hormozdiari, Fereydoun
Cherkasov, Artem
author_sort Chan, Simon K
collection PubMed
description BACKGROUND: In a previous study, we demonstrated that some essential proteins from pathogenic organisms contained sizable insertions/deletions (indels) when aligned to human proteins of high sequence similarity. Such indels may provide sufficient spatial differences between the pathogenic protein and human proteins to allow for selective targeting. In one example, an indel difference was targeted via large scale in-silico screening. This resulted in selective antibodies and small compounds which were capable of binding to the deletion-bearing essential pathogen protein without any cross-reactivity to the highly similar human protein. The objective of the current study was to investigate whether indels were found more frequently in essential than non-essential proteins. RESULTS: We have investigated three species, Bacillus subtilis, Escherichia coli, and Saccharomyces cerevisiae, for which high-quality protein essentiality data is available. Using these data, we demonstrated with t-test calculations that the mean indel frequencies in essential proteins were greater than that of non-essential proteins in the three proteomes. The abundance of indels in both types of proteins was also shown to be accurately modeled by the Weibull distribution. However, Receiver Operator Characteristic (ROC) curves showed that indel frequencies alone could not be used as a marker to accurately discriminate between essential and non-essential proteins in the three proteomes. Finally, we analyzed the protein interaction data available for S. cerevisiae and observed that indel-bearing proteins were involved in more interactions and had greater betweenness values within Protein Interaction Networks (PINs). CONCLUSION: Overall, our findings demonstrated that indels were not randomly distributed across the studied proteomes and were likely to occur more often in essential proteins and those that were highly connected, indicating a possible role of sequence insertions and deletions in the regulation and modification of protein-protein interactions. Such observations will provide new insights into indel-based drug design using bioinformatics and cheminformatics tools.
format Text
id pubmed-1925122
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19251222007-07-20 Relationship between insertion/deletion (indel) frequency of proteins and essentiality Chan, Simon K Hsing, Michael Hormozdiari, Fereydoun Cherkasov, Artem BMC Bioinformatics Research Article BACKGROUND: In a previous study, we demonstrated that some essential proteins from pathogenic organisms contained sizable insertions/deletions (indels) when aligned to human proteins of high sequence similarity. Such indels may provide sufficient spatial differences between the pathogenic protein and human proteins to allow for selective targeting. In one example, an indel difference was targeted via large scale in-silico screening. This resulted in selective antibodies and small compounds which were capable of binding to the deletion-bearing essential pathogen protein without any cross-reactivity to the highly similar human protein. The objective of the current study was to investigate whether indels were found more frequently in essential than non-essential proteins. RESULTS: We have investigated three species, Bacillus subtilis, Escherichia coli, and Saccharomyces cerevisiae, for which high-quality protein essentiality data is available. Using these data, we demonstrated with t-test calculations that the mean indel frequencies in essential proteins were greater than that of non-essential proteins in the three proteomes. The abundance of indels in both types of proteins was also shown to be accurately modeled by the Weibull distribution. However, Receiver Operator Characteristic (ROC) curves showed that indel frequencies alone could not be used as a marker to accurately discriminate between essential and non-essential proteins in the three proteomes. Finally, we analyzed the protein interaction data available for S. cerevisiae and observed that indel-bearing proteins were involved in more interactions and had greater betweenness values within Protein Interaction Networks (PINs). CONCLUSION: Overall, our findings demonstrated that indels were not randomly distributed across the studied proteomes and were likely to occur more often in essential proteins and those that were highly connected, indicating a possible role of sequence insertions and deletions in the regulation and modification of protein-protein interactions. Such observations will provide new insights into indel-based drug design using bioinformatics and cheminformatics tools. BioMed Central 2007-06-28 /pmc/articles/PMC1925122/ /pubmed/17598914 http://dx.doi.org/10.1186/1471-2105-8-227 Text en Copyright © 2007 Chan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chan, Simon K
Hsing, Michael
Hormozdiari, Fereydoun
Cherkasov, Artem
Relationship between insertion/deletion (indel) frequency of proteins and essentiality
title Relationship between insertion/deletion (indel) frequency of proteins and essentiality
title_full Relationship between insertion/deletion (indel) frequency of proteins and essentiality
title_fullStr Relationship between insertion/deletion (indel) frequency of proteins and essentiality
title_full_unstemmed Relationship between insertion/deletion (indel) frequency of proteins and essentiality
title_short Relationship between insertion/deletion (indel) frequency of proteins and essentiality
title_sort relationship between insertion/deletion (indel) frequency of proteins and essentiality
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1925122/
https://www.ncbi.nlm.nih.gov/pubmed/17598914
http://dx.doi.org/10.1186/1471-2105-8-227
work_keys_str_mv AT chansimonk relationshipbetweeninsertiondeletionindelfrequencyofproteinsandessentiality
AT hsingmichael relationshipbetweeninsertiondeletionindelfrequencyofproteinsandessentiality
AT hormozdiarifereydoun relationshipbetweeninsertiondeletionindelfrequencyofproteinsandessentiality
AT cherkasovartem relationshipbetweeninsertiondeletionindelfrequencyofproteinsandessentiality