Cargando…
Structural homology guided alignment of cysteine rich proteins
BACKGROUND: Cysteine rich protein families are notoriously difficult to align due to low sequence identity and frequent insertions and deletions. RESULTS: Here we present an alignment method that ensures homologous cysteines align by assigning a unique 10 amino acid barcode to those identified as st...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4709342/ https://www.ncbi.nlm.nih.gov/pubmed/26788439 http://dx.doi.org/10.1186/s40064-015-1609-z |
_version_ | 1782409622975414272 |
---|---|
author | Shafee, Thomas M. A. Robinson, Andrew J. van der Weerden, Nicole Anderson, Marilyn A. |
author_facet | Shafee, Thomas M. A. Robinson, Andrew J. van der Weerden, Nicole Anderson, Marilyn A. |
author_sort | Shafee, Thomas M. A. |
collection | PubMed |
description | BACKGROUND: Cysteine rich protein families are notoriously difficult to align due to low sequence identity and frequent insertions and deletions. RESULTS: Here we present an alignment method that ensures homologous cysteines align by assigning a unique 10 amino acid barcode to those identified as structurally homologous by the DALI webserver. The free inter-cysteine regions of the barcoded sequences can then be aligned using any standard algorithm. Finally the barcodes are replaced with the original columns to yield an alignment which requires the minimum of manual refinement. CONCLUSIONS: Using structural homology information to constrain sequence alignments allows the alignment of highly divergent, repetitive sequences that are poorly dealt with by existing algorithms. Tools are provided to perform this method online using the CysBar web-tool (http://CysBar.science.latrobe.edu.au) and offline (python script available from http://github.com/ts404/CysBar). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40064-015-1609-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4709342 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-47093422016-01-19 Structural homology guided alignment of cysteine rich proteins Shafee, Thomas M. A. Robinson, Andrew J. van der Weerden, Nicole Anderson, Marilyn A. Springerplus Software BACKGROUND: Cysteine rich protein families are notoriously difficult to align due to low sequence identity and frequent insertions and deletions. RESULTS: Here we present an alignment method that ensures homologous cysteines align by assigning a unique 10 amino acid barcode to those identified as structurally homologous by the DALI webserver. The free inter-cysteine regions of the barcoded sequences can then be aligned using any standard algorithm. Finally the barcodes are replaced with the original columns to yield an alignment which requires the minimum of manual refinement. CONCLUSIONS: Using structural homology information to constrain sequence alignments allows the alignment of highly divergent, repetitive sequences that are poorly dealt with by existing algorithms. Tools are provided to perform this method online using the CysBar web-tool (http://CysBar.science.latrobe.edu.au) and offline (python script available from http://github.com/ts404/CysBar). ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s40064-015-1609-z) contains supplementary material, which is available to authorized users. Springer International Publishing 2016-01-12 /pmc/articles/PMC4709342/ /pubmed/26788439 http://dx.doi.org/10.1186/s40064-015-1609-z Text en © Shafee et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Software Shafee, Thomas M. A. Robinson, Andrew J. van der Weerden, Nicole Anderson, Marilyn A. Structural homology guided alignment of cysteine rich proteins |
title | Structural homology guided alignment of cysteine rich proteins |
title_full | Structural homology guided alignment of cysteine rich proteins |
title_fullStr | Structural homology guided alignment of cysteine rich proteins |
title_full_unstemmed | Structural homology guided alignment of cysteine rich proteins |
title_short | Structural homology guided alignment of cysteine rich proteins |
title_sort | structural homology guided alignment of cysteine rich proteins |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4709342/ https://www.ncbi.nlm.nih.gov/pubmed/26788439 http://dx.doi.org/10.1186/s40064-015-1609-z |
work_keys_str_mv | AT shafeethomasma structuralhomologyguidedalignmentofcysteinerichproteins AT robinsonandrewj structuralhomologyguidedalignmentofcysteinerichproteins AT vanderweerdennicole structuralhomologyguidedalignmentofcysteinerichproteins AT andersonmarilyna structuralhomologyguidedalignmentofcysteinerichproteins |