Cargando…
Accurate contact predictions using covariation techniques and machine learning
Here we present the results of residue–residue contact predictions achieved in CASP11 by the CONSIP2 server, which is based around our MetaPSICOV contact prediction method. On a set of 40 target domains with a median family size of around 40 effective sequences, our server achieved an average top‐L/...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5042084/ https://www.ncbi.nlm.nih.gov/pubmed/26205532 http://dx.doi.org/10.1002/prot.24863 |
_version_ | 1782456545668235264 |
---|---|
author | Kosciolek, Tomasz Jones, David T. |
author_facet | Kosciolek, Tomasz Jones, David T. |
author_sort | Kosciolek, Tomasz |
collection | PubMed |
description | Here we present the results of residue–residue contact predictions achieved in CASP11 by the CONSIP2 server, which is based around our MetaPSICOV contact prediction method. On a set of 40 target domains with a median family size of around 40 effective sequences, our server achieved an average top‐L/5 long‐range contact precision of 27%. MetaPSICOV method bases on a combination of classical contact prediction features, enhanced with three distinct covariation methods embedded in a two‐stage neural network predictor. Some unique features of our approach are (1) the tuning between the classical and covariation features depending on the depth of the input alignment and (2) a hybrid approach to generate deepest possible multiple‐sequence alignments by combining jackHMMer and HHblits. We discuss the CONSIP2 pipeline, our results and show that where the method underperformed, the major factor was relying on a fixed set of parameters for the initial sequence alignments and not attempting to perform domain splitting as a preprocessing step. Proteins 2016; 84(Suppl 1):145–151. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. |
format | Online Article Text |
id | pubmed-5042084 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-50420842016-10-03 Accurate contact predictions using covariation techniques and machine learning Kosciolek, Tomasz Jones, David T. Proteins Articles Here we present the results of residue–residue contact predictions achieved in CASP11 by the CONSIP2 server, which is based around our MetaPSICOV contact prediction method. On a set of 40 target domains with a median family size of around 40 effective sequences, our server achieved an average top‐L/5 long‐range contact precision of 27%. MetaPSICOV method bases on a combination of classical contact prediction features, enhanced with three distinct covariation methods embedded in a two‐stage neural network predictor. Some unique features of our approach are (1) the tuning between the classical and covariation features depending on the depth of the input alignment and (2) a hybrid approach to generate deepest possible multiple‐sequence alignments by combining jackHMMer and HHblits. We discuss the CONSIP2 pipeline, our results and show that where the method underperformed, the major factor was relying on a fixed set of parameters for the initial sequence alignments and not attempting to perform domain splitting as a preprocessing step. Proteins 2016; 84(Suppl 1):145–151. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. John Wiley and Sons Inc. 2015-08-14 2016-09 /pmc/articles/PMC5042084/ /pubmed/26205532 http://dx.doi.org/10.1002/prot.24863 Text en © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Kosciolek, Tomasz Jones, David T. Accurate contact predictions using covariation techniques and machine learning |
title | Accurate contact predictions using covariation techniques and machine learning |
title_full | Accurate contact predictions using covariation techniques and machine learning |
title_fullStr | Accurate contact predictions using covariation techniques and machine learning |
title_full_unstemmed | Accurate contact predictions using covariation techniques and machine learning |
title_short | Accurate contact predictions using covariation techniques and machine learning |
title_sort | accurate contact predictions using covariation techniques and machine learning |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5042084/ https://www.ncbi.nlm.nih.gov/pubmed/26205532 http://dx.doi.org/10.1002/prot.24863 |
work_keys_str_mv | AT kosciolektomasz accuratecontactpredictionsusingcovariationtechniquesandmachinelearning AT jonesdavidt accuratecontactpredictionsusingcovariationtechniquesandmachinelearning |