Cargando…

Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence

BACKGROUND: Strict guidelines delimit the use of computational information in the clinical setting, due to the still moderate accuracy of in silico tools. These guidelines indicate that several tools should always be used and that full coincidence between them is required if we want to consider thei...

Descripción completa

Detalles Bibliográficos
Autores principales: de la Campa, Elena Álvarez, Padilla, Natàlia, de la Cruz, Xavier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5558188/
https://www.ncbi.nlm.nih.gov/pubmed/28812538
http://dx.doi.org/10.1186/s12864-017-3914-0
_version_ 1783257352174043136
author de la Campa, Elena Álvarez
Padilla, Natàlia
de la Cruz, Xavier
author_facet de la Campa, Elena Álvarez
Padilla, Natàlia
de la Cruz, Xavier
author_sort de la Campa, Elena Álvarez
collection PubMed
description BACKGROUND: Strict guidelines delimit the use of computational information in the clinical setting, due to the still moderate accuracy of in silico tools. These guidelines indicate that several tools should always be used and that full coincidence between them is required if we want to consider their results as supporting evidence in medical decision processes. Application of this simple rule certainly decreases the error rate of in silico pathogenicity assignments. However, when predictors disagree this rule results in the rejection of potentially valuable information for a number of variants. In this work, we focus on these variants of the protein sequence and develop specific predictors to help improve the success rate of their annotation. RESULTS: We have used a set of 59,442 protein sequence variants (15,723 pathological and 43,719 neutral) from 228 proteins to identify those cases for which pathogenicity predictors disagree. We have repeated this process for all the possible combinations of five known methods (SIFT, PolyPhen-2, PON-P2, CADD and MutationTaster2). For each resulting subset we have trained a specific pathogenicity predictor. We find that these specific predictors are able to discriminate between neutral and pathogenic variants, with a success rate different from random. They tend to outperform the constitutive methods but this trend decreases as the performance of the constitutive predictor improves (e.g. with PON-P2 and PolyPhen-2). We also find that specific methods outperform standard consensus methods (Condel and CAROL). CONCLUSION: Focusing development efforts on the case of variants for which known methods disagree we may obtain pathogenicity predictors with improved performances. Although we have not yet reached the success rate that allows the use of this computational evidence in a clinical setting, the simplicity of the approach indicates that more advanced methods may reach this goal in a close future. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3914-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5558188
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55581882017-08-16 Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence de la Campa, Elena Álvarez Padilla, Natàlia de la Cruz, Xavier BMC Genomics Research BACKGROUND: Strict guidelines delimit the use of computational information in the clinical setting, due to the still moderate accuracy of in silico tools. These guidelines indicate that several tools should always be used and that full coincidence between them is required if we want to consider their results as supporting evidence in medical decision processes. Application of this simple rule certainly decreases the error rate of in silico pathogenicity assignments. However, when predictors disagree this rule results in the rejection of potentially valuable information for a number of variants. In this work, we focus on these variants of the protein sequence and develop specific predictors to help improve the success rate of their annotation. RESULTS: We have used a set of 59,442 protein sequence variants (15,723 pathological and 43,719 neutral) from 228 proteins to identify those cases for which pathogenicity predictors disagree. We have repeated this process for all the possible combinations of five known methods (SIFT, PolyPhen-2, PON-P2, CADD and MutationTaster2). For each resulting subset we have trained a specific pathogenicity predictor. We find that these specific predictors are able to discriminate between neutral and pathogenic variants, with a success rate different from random. They tend to outperform the constitutive methods but this trend decreases as the performance of the constitutive predictor improves (e.g. with PON-P2 and PolyPhen-2). We also find that specific methods outperform standard consensus methods (Condel and CAROL). CONCLUSION: Focusing development efforts on the case of variants for which known methods disagree we may obtain pathogenicity predictors with improved performances. Although we have not yet reached the success rate that allows the use of this computational evidence in a clinical setting, the simplicity of the approach indicates that more advanced methods may reach this goal in a close future. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3914-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-11 /pmc/articles/PMC5558188/ /pubmed/28812538 http://dx.doi.org/10.1186/s12864-017-3914-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
de la Campa, Elena Álvarez
Padilla, Natàlia
de la Cruz, Xavier
Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence
title Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence
title_full Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence
title_fullStr Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence
title_full_unstemmed Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence
title_short Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence
title_sort development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5558188/
https://www.ncbi.nlm.nih.gov/pubmed/28812538
http://dx.doi.org/10.1186/s12864-017-3914-0
work_keys_str_mv AT delacampaelenaalvarez developmentofpathogenicitypredictorsspecificforvariantsthatdonotcomplywithclinicalguidelinesfortheuseofcomputationalevidence
AT padillanatalia developmentofpathogenicitypredictorsspecificforvariantsthatdonotcomplywithclinicalguidelinesfortheuseofcomputationalevidence
AT delacruzxavier developmentofpathogenicitypredictorsspecificforvariantsthatdonotcomplywithclinicalguidelinesfortheuseofcomputationalevidence