Cargando…

Accurate contact predictions using covariation techniques and machine learning

Here we present the results of residue–residue contact predictions achieved in CASP11 by the CONSIP2 server, which is based around our MetaPSICOV contact prediction method. On a set of 40 target domains with a median family size of around 40 effective sequences, our server achieved an average top‐L/...

Descripción completa

Detalles Bibliográficos
Autores principales: Kosciolek, Tomasz, Jones, David T.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5042084/
https://www.ncbi.nlm.nih.gov/pubmed/26205532
http://dx.doi.org/10.1002/prot.24863
_version_ 1782456545668235264
author Kosciolek, Tomasz
Jones, David T.
author_facet Kosciolek, Tomasz
Jones, David T.
author_sort Kosciolek, Tomasz
collection PubMed
description Here we present the results of residue–residue contact predictions achieved in CASP11 by the CONSIP2 server, which is based around our MetaPSICOV contact prediction method. On a set of 40 target domains with a median family size of around 40 effective sequences, our server achieved an average top‐L/5 long‐range contact precision of 27%. MetaPSICOV method bases on a combination of classical contact prediction features, enhanced with three distinct covariation methods embedded in a two‐stage neural network predictor. Some unique features of our approach are (1) the tuning between the classical and covariation features depending on the depth of the input alignment and (2) a hybrid approach to generate deepest possible multiple‐sequence alignments by combining jackHMMer and HHblits. We discuss the CONSIP2 pipeline, our results and show that where the method underperformed, the major factor was relying on a fixed set of parameters for the initial sequence alignments and not attempting to perform domain splitting as a preprocessing step. Proteins 2016; 84(Suppl 1):145–151. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc.
format Online
Article
Text
id pubmed-5042084
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-50420842016-10-03 Accurate contact predictions using covariation techniques and machine learning Kosciolek, Tomasz Jones, David T. Proteins Articles Here we present the results of residue–residue contact predictions achieved in CASP11 by the CONSIP2 server, which is based around our MetaPSICOV contact prediction method. On a set of 40 target domains with a median family size of around 40 effective sequences, our server achieved an average top‐L/5 long‐range contact precision of 27%. MetaPSICOV method bases on a combination of classical contact prediction features, enhanced with three distinct covariation methods embedded in a two‐stage neural network predictor. Some unique features of our approach are (1) the tuning between the classical and covariation features depending on the depth of the input alignment and (2) a hybrid approach to generate deepest possible multiple‐sequence alignments by combining jackHMMer and HHblits. We discuss the CONSIP2 pipeline, our results and show that where the method underperformed, the major factor was relying on a fixed set of parameters for the initial sequence alignments and not attempting to perform domain splitting as a preprocessing step. Proteins 2016; 84(Suppl 1):145–151. © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. John Wiley and Sons Inc. 2015-08-14 2016-09 /pmc/articles/PMC5042084/ /pubmed/26205532 http://dx.doi.org/10.1002/prot.24863 Text en © 2015 The Authors. Proteins: Structure, Function, and Bioinformatics Published by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Kosciolek, Tomasz
Jones, David T.
Accurate contact predictions using covariation techniques and machine learning
title Accurate contact predictions using covariation techniques and machine learning
title_full Accurate contact predictions using covariation techniques and machine learning
title_fullStr Accurate contact predictions using covariation techniques and machine learning
title_full_unstemmed Accurate contact predictions using covariation techniques and machine learning
title_short Accurate contact predictions using covariation techniques and machine learning
title_sort accurate contact predictions using covariation techniques and machine learning
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5042084/
https://www.ncbi.nlm.nih.gov/pubmed/26205532
http://dx.doi.org/10.1002/prot.24863
work_keys_str_mv AT kosciolektomasz accuratecontactpredictionsusingcovariationtechniquesandmachinelearning
AT jonesdavidt accuratecontactpredictionsusingcovariationtechniquesandmachinelearning