Cargando…

Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee

BACKGROUND: Transmembrane proteins (TMPs) constitute about 20~30% of all protein coding genes. The relative lack of experimental structure has so far made it hard to develop specific alignment methods and the current state of the art (PRALINE™) only manages to recapitulate 50% of the positions in th...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Jia-Ming, Di Tommaso, Paolo, Taly, Jean-François, Notredame, Cedric
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3303701/
https://www.ncbi.nlm.nih.gov/pubmed/22536955
http://dx.doi.org/10.1186/1471-2105-13-S4-S1
_version_ 1782226773539291136
author Chang, Jia-Ming
Di Tommaso, Paolo
Taly, Jean-François
Notredame, Cedric
author_facet Chang, Jia-Ming
Di Tommaso, Paolo
Taly, Jean-François
Notredame, Cedric
author_sort Chang, Jia-Ming
collection PubMed
description BACKGROUND: Transmembrane proteins (TMPs) constitute about 20~30% of all protein coding genes. The relative lack of experimental structure has so far made it hard to develop specific alignment methods and the current state of the art (PRALINE™) only manages to recapitulate 50% of the positions in the reference alignments available from the BAliBASE2-ref7. METHODS: We show how homology extension can be adapted and combined with a consistency based approach in order to significantly improve the multiple sequence alignment of alpha-helical TMPs. TM-Coffee is a special mode of PSI-Coffee able to efficiently align TMPs, while using a reduced reference database for homology extension. RESULTS: Our benchmarking on BAliBASE2-ref7 alpha-helical TMPs shows a significant improvement over the most accurate methods such as MSAProbs, Kalign, PROMALS, MAFFT, ProbCons and PRALINE™. We also estimated the influence of the database used for homology extension and show that highly non-redundant UniRef databases can be used to obtain similar results at a significantly reduced computational cost over full protein databases. TM-Coffee is part of the T-Coffee package, a web server is also available from http://tcoffee.crg.cat/tmcoffee and a freeware open source code can be downloaded from http://www.tcoffee.org/Packages/Stable/Latest.
format Online
Article
Text
id pubmed-3303701
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33037012012-03-15 Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee Chang, Jia-Ming Di Tommaso, Paolo Taly, Jean-François Notredame, Cedric BMC Bioinformatics Research BACKGROUND: Transmembrane proteins (TMPs) constitute about 20~30% of all protein coding genes. The relative lack of experimental structure has so far made it hard to develop specific alignment methods and the current state of the art (PRALINE™) only manages to recapitulate 50% of the positions in the reference alignments available from the BAliBASE2-ref7. METHODS: We show how homology extension can be adapted and combined with a consistency based approach in order to significantly improve the multiple sequence alignment of alpha-helical TMPs. TM-Coffee is a special mode of PSI-Coffee able to efficiently align TMPs, while using a reduced reference database for homology extension. RESULTS: Our benchmarking on BAliBASE2-ref7 alpha-helical TMPs shows a significant improvement over the most accurate methods such as MSAProbs, Kalign, PROMALS, MAFFT, ProbCons and PRALINE™. We also estimated the influence of the database used for homology extension and show that highly non-redundant UniRef databases can be used to obtain similar results at a significantly reduced computational cost over full protein databases. TM-Coffee is part of the T-Coffee package, a web server is also available from http://tcoffee.crg.cat/tmcoffee and a freeware open source code can be downloaded from http://www.tcoffee.org/Packages/Stable/Latest. BioMed Central 2012-03-28 /pmc/articles/PMC3303701/ /pubmed/22536955 http://dx.doi.org/10.1186/1471-2105-13-S4-S1 Text en Copyright ©2012 Chang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Chang, Jia-Ming
Di Tommaso, Paolo
Taly, Jean-François
Notredame, Cedric
Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee
title Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee
title_full Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee
title_fullStr Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee
title_full_unstemmed Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee
title_short Accurate multiple sequence alignment of transmembrane proteins with PSI-Coffee
title_sort accurate multiple sequence alignment of transmembrane proteins with psi-coffee
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3303701/
https://www.ncbi.nlm.nih.gov/pubmed/22536955
http://dx.doi.org/10.1186/1471-2105-13-S4-S1
work_keys_str_mv AT changjiaming accuratemultiplesequencealignmentoftransmembraneproteinswithpsicoffee
AT ditommasopaolo accuratemultiplesequencealignmentoftransmembraneproteinswithpsicoffee
AT talyjeanfrancois accuratemultiplesequencealignmentoftransmembraneproteinswithpsicoffee
AT notredamecedric accuratemultiplesequencealignmentoftransmembraneproteinswithpsicoffee