Cargando…
Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence
BACKGROUND: Strict guidelines delimit the use of computational information in the clinical setting, due to the still moderate accuracy of in silico tools. These guidelines indicate that several tools should always be used and that full coincidence between them is required if we want to consider thei...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5558188/ https://www.ncbi.nlm.nih.gov/pubmed/28812538 http://dx.doi.org/10.1186/s12864-017-3914-0 |
_version_ | 1783257352174043136 |
---|---|
author | de la Campa, Elena Álvarez Padilla, Natàlia de la Cruz, Xavier |
author_facet | de la Campa, Elena Álvarez Padilla, Natàlia de la Cruz, Xavier |
author_sort | de la Campa, Elena Álvarez |
collection | PubMed |
description | BACKGROUND: Strict guidelines delimit the use of computational information in the clinical setting, due to the still moderate accuracy of in silico tools. These guidelines indicate that several tools should always be used and that full coincidence between them is required if we want to consider their results as supporting evidence in medical decision processes. Application of this simple rule certainly decreases the error rate of in silico pathogenicity assignments. However, when predictors disagree this rule results in the rejection of potentially valuable information for a number of variants. In this work, we focus on these variants of the protein sequence and develop specific predictors to help improve the success rate of their annotation. RESULTS: We have used a set of 59,442 protein sequence variants (15,723 pathological and 43,719 neutral) from 228 proteins to identify those cases for which pathogenicity predictors disagree. We have repeated this process for all the possible combinations of five known methods (SIFT, PolyPhen-2, PON-P2, CADD and MutationTaster2). For each resulting subset we have trained a specific pathogenicity predictor. We find that these specific predictors are able to discriminate between neutral and pathogenic variants, with a success rate different from random. They tend to outperform the constitutive methods but this trend decreases as the performance of the constitutive predictor improves (e.g. with PON-P2 and PolyPhen-2). We also find that specific methods outperform standard consensus methods (Condel and CAROL). CONCLUSION: Focusing development efforts on the case of variants for which known methods disagree we may obtain pathogenicity predictors with improved performances. Although we have not yet reached the success rate that allows the use of this computational evidence in a clinical setting, the simplicity of the approach indicates that more advanced methods may reach this goal in a close future. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3914-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5558188 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-55581882017-08-16 Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence de la Campa, Elena Álvarez Padilla, Natàlia de la Cruz, Xavier BMC Genomics Research BACKGROUND: Strict guidelines delimit the use of computational information in the clinical setting, due to the still moderate accuracy of in silico tools. These guidelines indicate that several tools should always be used and that full coincidence between them is required if we want to consider their results as supporting evidence in medical decision processes. Application of this simple rule certainly decreases the error rate of in silico pathogenicity assignments. However, when predictors disagree this rule results in the rejection of potentially valuable information for a number of variants. In this work, we focus on these variants of the protein sequence and develop specific predictors to help improve the success rate of their annotation. RESULTS: We have used a set of 59,442 protein sequence variants (15,723 pathological and 43,719 neutral) from 228 proteins to identify those cases for which pathogenicity predictors disagree. We have repeated this process for all the possible combinations of five known methods (SIFT, PolyPhen-2, PON-P2, CADD and MutationTaster2). For each resulting subset we have trained a specific pathogenicity predictor. We find that these specific predictors are able to discriminate between neutral and pathogenic variants, with a success rate different from random. They tend to outperform the constitutive methods but this trend decreases as the performance of the constitutive predictor improves (e.g. with PON-P2 and PolyPhen-2). We also find that specific methods outperform standard consensus methods (Condel and CAROL). CONCLUSION: Focusing development efforts on the case of variants for which known methods disagree we may obtain pathogenicity predictors with improved performances. Although we have not yet reached the success rate that allows the use of this computational evidence in a clinical setting, the simplicity of the approach indicates that more advanced methods may reach this goal in a close future. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3914-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-11 /pmc/articles/PMC5558188/ /pubmed/28812538 http://dx.doi.org/10.1186/s12864-017-3914-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research de la Campa, Elena Álvarez Padilla, Natàlia de la Cruz, Xavier Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence |
title | Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence |
title_full | Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence |
title_fullStr | Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence |
title_full_unstemmed | Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence |
title_short | Development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence |
title_sort | development of pathogenicity predictors specific for variants that do not comply with clinical guidelines for the use of computational evidence |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5558188/ https://www.ncbi.nlm.nih.gov/pubmed/28812538 http://dx.doi.org/10.1186/s12864-017-3914-0 |
work_keys_str_mv | AT delacampaelenaalvarez developmentofpathogenicitypredictorsspecificforvariantsthatdonotcomplywithclinicalguidelinesfortheuseofcomputationalevidence AT padillanatalia developmentofpathogenicitypredictorsspecificforvariantsthatdonotcomplywithclinicalguidelinesfortheuseofcomputationalevidence AT delacruzxavier developmentofpathogenicitypredictorsspecificforvariantsthatdonotcomplywithclinicalguidelinesfortheuseofcomputationalevidence |