Cargando…

XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences

BACKGROUND: Biological sequence repeats arranged in tandem patterns are widespread in DNA and proteins. While many software tools have been designed to detect DNA tandem repeats (TRs), useful algorithms for identifying protein TRs with varied levels of degeneracy are still needed. RESULTS: To addres...

Descripción completa

Detalles Bibliográficos
Autores principales: Newman, Aaron M, Cooper, James B
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2233649/
https://www.ncbi.nlm.nih.gov/pubmed/17931424
http://dx.doi.org/10.1186/1471-2105-8-382
_version_ 1782150275115515904
author Newman, Aaron M
Cooper, James B
author_facet Newman, Aaron M
Cooper, James B
author_sort Newman, Aaron M
collection PubMed
description BACKGROUND: Biological sequence repeats arranged in tandem patterns are widespread in DNA and proteins. While many software tools have been designed to detect DNA tandem repeats (TRs), useful algorithms for identifying protein TRs with varied levels of degeneracy are still needed. RESULTS: To address limitations of current repeat identification methods, and to provide an efficient and flexible algorithm for the detection and analysis of TRs in protein sequences, we designed and implemented a new computational method called XSTREAM. Running time tests confirm the practicality of XSTREAM for analyses of multi-genome datasets. Each of the key capabilities of XSTREAM (e.g., merging, nesting, long-period detection, and TR architecture modeling) are demonstrated using anecdotal examples, and the utility of XSTREAM for identifying TR proteins was validated using data from a recently published paper. CONCLUSION: We show that XSTREAM is a practical and valuable tool for TR detection in protein and nucleotide sequences at the multi-genome scale, and an effective tool for modeling TR domains with diverse architectures and varied levels of degeneracy. Because of these useful features, XSTREAM has significant potential for the discovery of naturally-evolved modular proteins with applications for engineering novel biostructural and biomimetic materials, and identifying new vaccine and diagnostic targets.
format Text
id pubmed-2233649
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22336492008-02-07 XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences Newman, Aaron M Cooper, James B BMC Bioinformatics Software BACKGROUND: Biological sequence repeats arranged in tandem patterns are widespread in DNA and proteins. While many software tools have been designed to detect DNA tandem repeats (TRs), useful algorithms for identifying protein TRs with varied levels of degeneracy are still needed. RESULTS: To address limitations of current repeat identification methods, and to provide an efficient and flexible algorithm for the detection and analysis of TRs in protein sequences, we designed and implemented a new computational method called XSTREAM. Running time tests confirm the practicality of XSTREAM for analyses of multi-genome datasets. Each of the key capabilities of XSTREAM (e.g., merging, nesting, long-period detection, and TR architecture modeling) are demonstrated using anecdotal examples, and the utility of XSTREAM for identifying TR proteins was validated using data from a recently published paper. CONCLUSION: We show that XSTREAM is a practical and valuable tool for TR detection in protein and nucleotide sequences at the multi-genome scale, and an effective tool for modeling TR domains with diverse architectures and varied levels of degeneracy. Because of these useful features, XSTREAM has significant potential for the discovery of naturally-evolved modular proteins with applications for engineering novel biostructural and biomimetic materials, and identifying new vaccine and diagnostic targets. BioMed Central 2007-10-11 /pmc/articles/PMC2233649/ /pubmed/17931424 http://dx.doi.org/10.1186/1471-2105-8-382 Text en Copyright © 2007 Newman and Cooper; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Newman, Aaron M
Cooper, James B
XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_full XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_fullStr XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_full_unstemmed XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_short XSTREAM: A practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
title_sort xstream: a practical algorithm for identification and architecture modeling of tandem repeats in protein sequences
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2233649/
https://www.ncbi.nlm.nih.gov/pubmed/17931424
http://dx.doi.org/10.1186/1471-2105-8-382
work_keys_str_mv AT newmanaaronm xstreamapracticalalgorithmforidentificationandarchitecturemodelingoftandemrepeatsinproteinsequences
AT cooperjamesb xstreamapracticalalgorithmforidentificationandarchitecturemodelingoftandemrepeatsinproteinsequences