Cargando…

Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone

The alpha-helical coiled coil can adopt a variety of topologies, among the most common of which are parallel and antiparallel dimers and trimers. We present Multicoil2, an algorithm that predicts both the location and oligomerization state (two versus three helices) of coiled coils in protein sequen...

Descripción completa

Detalles Bibliográficos
Autores principales: Trigg, Jason, Gutwin, Karl, Keating, Amy E., Berger, Bonnie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3162000/
https://www.ncbi.nlm.nih.gov/pubmed/21901122
http://dx.doi.org/10.1371/journal.pone.0023519
_version_ 1782210775823155200
author Trigg, Jason
Gutwin, Karl
Keating, Amy E.
Berger, Bonnie
author_facet Trigg, Jason
Gutwin, Karl
Keating, Amy E.
Berger, Bonnie
author_sort Trigg, Jason
collection PubMed
description The alpha-helical coiled coil can adopt a variety of topologies, among the most common of which are parallel and antiparallel dimers and trimers. We present Multicoil2, an algorithm that predicts both the location and oligomerization state (two versus three helices) of coiled coils in protein sequences. Multicoil2 combines the pairwise correlations of the previous Multicoil method with the flexibility of Hidden Markov Models (HMMs) in a Markov Random Field (MRF). The resulting algorithm integrates sequence features, including pairwise interactions, through multinomial logistic regression to devise an optimized scoring function for distinguishing dimer, trimer and non-coiled-coil oligomerization states; this scoring function is used to produce Markov Random Field potentials that incorporate pairwise correlations localized in sequence. Multicoil2 significantly improves both coiled-coil detection and dimer versus trimer state prediction over the original Multicoil algorithm retrained on a newly-constructed database of coiled-coil sequences. The new database, comprised of 2,105 sequences containing 124,088 residues, includes reliable structural annotations based on experimental data in the literature. Notably, the enhanced performance of Multicoil2 is evident when tested in stringent leave-family-out cross-validation on the new database, reflecting expected performance on challenging new prediction targets that have minimal sequence similarity to known coiled-coil families. The Multicoil2 program and training database are available for download from http://multicoil2.csail.mit.edu.
format Online
Article
Text
id pubmed-3162000
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31620002011-09-07 Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone Trigg, Jason Gutwin, Karl Keating, Amy E. Berger, Bonnie PLoS One Research Article The alpha-helical coiled coil can adopt a variety of topologies, among the most common of which are parallel and antiparallel dimers and trimers. We present Multicoil2, an algorithm that predicts both the location and oligomerization state (two versus three helices) of coiled coils in protein sequences. Multicoil2 combines the pairwise correlations of the previous Multicoil method with the flexibility of Hidden Markov Models (HMMs) in a Markov Random Field (MRF). The resulting algorithm integrates sequence features, including pairwise interactions, through multinomial logistic regression to devise an optimized scoring function for distinguishing dimer, trimer and non-coiled-coil oligomerization states; this scoring function is used to produce Markov Random Field potentials that incorporate pairwise correlations localized in sequence. Multicoil2 significantly improves both coiled-coil detection and dimer versus trimer state prediction over the original Multicoil algorithm retrained on a newly-constructed database of coiled-coil sequences. The new database, comprised of 2,105 sequences containing 124,088 residues, includes reliable structural annotations based on experimental data in the literature. Notably, the enhanced performance of Multicoil2 is evident when tested in stringent leave-family-out cross-validation on the new database, reflecting expected performance on challenging new prediction targets that have minimal sequence similarity to known coiled-coil families. The Multicoil2 program and training database are available for download from http://multicoil2.csail.mit.edu. Public Library of Science 2011-08-25 /pmc/articles/PMC3162000/ /pubmed/21901122 http://dx.doi.org/10.1371/journal.pone.0023519 Text en Trigg et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Trigg, Jason
Gutwin, Karl
Keating, Amy E.
Berger, Bonnie
Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone
title Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone
title_full Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone
title_fullStr Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone
title_full_unstemmed Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone
title_short Multicoil2: Predicting Coiled Coils and Their Oligomerization States from Sequence in the Twilight Zone
title_sort multicoil2: predicting coiled coils and their oligomerization states from sequence in the twilight zone
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3162000/
https://www.ncbi.nlm.nih.gov/pubmed/21901122
http://dx.doi.org/10.1371/journal.pone.0023519
work_keys_str_mv AT triggjason multicoil2predictingcoiledcoilsandtheiroligomerizationstatesfromsequenceinthetwilightzone
AT gutwinkarl multicoil2predictingcoiledcoilsandtheiroligomerizationstatesfromsequenceinthetwilightzone
AT keatingamye multicoil2predictingcoiledcoilsandtheiroligomerizationstatesfromsequenceinthetwilightzone
AT bergerbonnie multicoil2predictingcoiledcoilsandtheiroligomerizationstatesfromsequenceinthetwilightzone