Cargando…
Relationship between insertion/deletion (indel) frequency of proteins and essentiality
BACKGROUND: In a previous study, we demonstrated that some essential proteins from pathogenic organisms contained sizable insertions/deletions (indels) when aligned to human proteins of high sequence similarity. Such indels may provide sufficient spatial differences between the pathogenic protein an...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1925122/ https://www.ncbi.nlm.nih.gov/pubmed/17598914 http://dx.doi.org/10.1186/1471-2105-8-227 |
_version_ | 1782134241030569984 |
---|---|
author | Chan, Simon K Hsing, Michael Hormozdiari, Fereydoun Cherkasov, Artem |
author_facet | Chan, Simon K Hsing, Michael Hormozdiari, Fereydoun Cherkasov, Artem |
author_sort | Chan, Simon K |
collection | PubMed |
description | BACKGROUND: In a previous study, we demonstrated that some essential proteins from pathogenic organisms contained sizable insertions/deletions (indels) when aligned to human proteins of high sequence similarity. Such indels may provide sufficient spatial differences between the pathogenic protein and human proteins to allow for selective targeting. In one example, an indel difference was targeted via large scale in-silico screening. This resulted in selective antibodies and small compounds which were capable of binding to the deletion-bearing essential pathogen protein without any cross-reactivity to the highly similar human protein. The objective of the current study was to investigate whether indels were found more frequently in essential than non-essential proteins. RESULTS: We have investigated three species, Bacillus subtilis, Escherichia coli, and Saccharomyces cerevisiae, for which high-quality protein essentiality data is available. Using these data, we demonstrated with t-test calculations that the mean indel frequencies in essential proteins were greater than that of non-essential proteins in the three proteomes. The abundance of indels in both types of proteins was also shown to be accurately modeled by the Weibull distribution. However, Receiver Operator Characteristic (ROC) curves showed that indel frequencies alone could not be used as a marker to accurately discriminate between essential and non-essential proteins in the three proteomes. Finally, we analyzed the protein interaction data available for S. cerevisiae and observed that indel-bearing proteins were involved in more interactions and had greater betweenness values within Protein Interaction Networks (PINs). CONCLUSION: Overall, our findings demonstrated that indels were not randomly distributed across the studied proteomes and were likely to occur more often in essential proteins and those that were highly connected, indicating a possible role of sequence insertions and deletions in the regulation and modification of protein-protein interactions. Such observations will provide new insights into indel-based drug design using bioinformatics and cheminformatics tools. |
format | Text |
id | pubmed-1925122 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-19251222007-07-20 Relationship between insertion/deletion (indel) frequency of proteins and essentiality Chan, Simon K Hsing, Michael Hormozdiari, Fereydoun Cherkasov, Artem BMC Bioinformatics Research Article BACKGROUND: In a previous study, we demonstrated that some essential proteins from pathogenic organisms contained sizable insertions/deletions (indels) when aligned to human proteins of high sequence similarity. Such indels may provide sufficient spatial differences between the pathogenic protein and human proteins to allow for selective targeting. In one example, an indel difference was targeted via large scale in-silico screening. This resulted in selective antibodies and small compounds which were capable of binding to the deletion-bearing essential pathogen protein without any cross-reactivity to the highly similar human protein. The objective of the current study was to investigate whether indels were found more frequently in essential than non-essential proteins. RESULTS: We have investigated three species, Bacillus subtilis, Escherichia coli, and Saccharomyces cerevisiae, for which high-quality protein essentiality data is available. Using these data, we demonstrated with t-test calculations that the mean indel frequencies in essential proteins were greater than that of non-essential proteins in the three proteomes. The abundance of indels in both types of proteins was also shown to be accurately modeled by the Weibull distribution. However, Receiver Operator Characteristic (ROC) curves showed that indel frequencies alone could not be used as a marker to accurately discriminate between essential and non-essential proteins in the three proteomes. Finally, we analyzed the protein interaction data available for S. cerevisiae and observed that indel-bearing proteins were involved in more interactions and had greater betweenness values within Protein Interaction Networks (PINs). CONCLUSION: Overall, our findings demonstrated that indels were not randomly distributed across the studied proteomes and were likely to occur more often in essential proteins and those that were highly connected, indicating a possible role of sequence insertions and deletions in the regulation and modification of protein-protein interactions. Such observations will provide new insights into indel-based drug design using bioinformatics and cheminformatics tools. BioMed Central 2007-06-28 /pmc/articles/PMC1925122/ /pubmed/17598914 http://dx.doi.org/10.1186/1471-2105-8-227 Text en Copyright © 2007 Chan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chan, Simon K Hsing, Michael Hormozdiari, Fereydoun Cherkasov, Artem Relationship between insertion/deletion (indel) frequency of proteins and essentiality |
title | Relationship between insertion/deletion (indel) frequency of proteins and essentiality |
title_full | Relationship between insertion/deletion (indel) frequency of proteins and essentiality |
title_fullStr | Relationship between insertion/deletion (indel) frequency of proteins and essentiality |
title_full_unstemmed | Relationship between insertion/deletion (indel) frequency of proteins and essentiality |
title_short | Relationship between insertion/deletion (indel) frequency of proteins and essentiality |
title_sort | relationship between insertion/deletion (indel) frequency of proteins and essentiality |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1925122/ https://www.ncbi.nlm.nih.gov/pubmed/17598914 http://dx.doi.org/10.1186/1471-2105-8-227 |
work_keys_str_mv | AT chansimonk relationshipbetweeninsertiondeletionindelfrequencyofproteinsandessentiality AT hsingmichael relationshipbetweeninsertiondeletionindelfrequencyofproteinsandessentiality AT hormozdiarifereydoun relationshipbetweeninsertiondeletionindelfrequencyofproteinsandessentiality AT cherkasovartem relationshipbetweeninsertiondeletionindelfrequencyofproteinsandessentiality |