Cargando…

An approach to large scale identification of non-obvious structural similarities between proteins

BACKGROUND: A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between protei...

Descripción completa

Detalles Bibliográficos
Autores principales: Cherkasov, Artem, Jones, Steven JM
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2004
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC434491/
https://www.ncbi.nlm.nih.gov/pubmed/15147578
http://dx.doi.org/10.1186/1471-2105-5-61
_version_ 1782121516283985920
author Cherkasov, Artem
Jones, Steven JM
author_facet Cherkasov, Artem
Jones, Steven JM
author_sort Cherkasov, Artem
collection PubMed
description BACKGROUND: A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between proteins. When applied to the testing set of proteins with known three dimensional structures the developed approach was able to recognize structurally similar proteins with high accuracy. RESULTS: The method has been developed to identify pathogenic proteins with low sequence identity and high structural similarity to host analogues. Such protein structure relationships would be hypothesized to arise through convergent evolution or through ancient horizontal gene transfer events, now undetectable using current sequence alignment techniques. The pathogen proteins, which could mimic or interfere with host activities, would represent candidate virulence factors. The developed approach utilizes the numerical outputs from the sequence-structure threading. It identifies the potential structural similarity between a pair of proteins by correlating the threading scores of the corresponding two primary sequences against the library of the standard folds. This approach allowed up to 64% sensitivity and 99.9% specificity in distinguishing protein pairs with high structural similarity. CONCLUSION: Preliminary results obtained by comparison of the genomes of Homo sapiens and several strains of Chlamydia trachomatis have demonstrated the potential usefulness of the method in the identification of bacterial proteins with known or potential roles in virulence.
format Text
id pubmed-434491
institution National Center for Biotechnology Information
language English
publishDate 2004
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-4344912004-06-25 An approach to large scale identification of non-obvious structural similarities between proteins Cherkasov, Artem Jones, Steven JM BMC Bioinformatics Research Article BACKGROUND: A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between proteins. When applied to the testing set of proteins with known three dimensional structures the developed approach was able to recognize structurally similar proteins with high accuracy. RESULTS: The method has been developed to identify pathogenic proteins with low sequence identity and high structural similarity to host analogues. Such protein structure relationships would be hypothesized to arise through convergent evolution or through ancient horizontal gene transfer events, now undetectable using current sequence alignment techniques. The pathogen proteins, which could mimic or interfere with host activities, would represent candidate virulence factors. The developed approach utilizes the numerical outputs from the sequence-structure threading. It identifies the potential structural similarity between a pair of proteins by correlating the threading scores of the corresponding two primary sequences against the library of the standard folds. This approach allowed up to 64% sensitivity and 99.9% specificity in distinguishing protein pairs with high structural similarity. CONCLUSION: Preliminary results obtained by comparison of the genomes of Homo sapiens and several strains of Chlamydia trachomatis have demonstrated the potential usefulness of the method in the identification of bacterial proteins with known or potential roles in virulence. BioMed Central 2004-05-17 /pmc/articles/PMC434491/ /pubmed/15147578 http://dx.doi.org/10.1186/1471-2105-5-61 Text en Copyright © 2004 Cherkasov and Jones; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Research Article
Cherkasov, Artem
Jones, Steven JM
An approach to large scale identification of non-obvious structural similarities between proteins
title An approach to large scale identification of non-obvious structural similarities between proteins
title_full An approach to large scale identification of non-obvious structural similarities between proteins
title_fullStr An approach to large scale identification of non-obvious structural similarities between proteins
title_full_unstemmed An approach to large scale identification of non-obvious structural similarities between proteins
title_short An approach to large scale identification of non-obvious structural similarities between proteins
title_sort approach to large scale identification of non-obvious structural similarities between proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC434491/
https://www.ncbi.nlm.nih.gov/pubmed/15147578
http://dx.doi.org/10.1186/1471-2105-5-61
work_keys_str_mv AT cherkasovartem anapproachtolargescaleidentificationofnonobviousstructuralsimilaritiesbetweenproteins
AT jonesstevenjm anapproachtolargescaleidentificationofnonobviousstructuralsimilaritiesbetweenproteins
AT cherkasovartem approachtolargescaleidentificationofnonobviousstructuralsimilaritiesbetweenproteins
AT jonesstevenjm approachtolargescaleidentificationofnonobviousstructuralsimilaritiesbetweenproteins