Cargando…

Problems in variation interpretation guidelines and in their implementation in computational tools

BACKGROUND: ACMG/AMP and AMP/ASCO/CAP have released guidelines for variation interpretation, and ESHG for diagnostic sequencing. These guidelines contain recommendations including the use of computational prediction methods. The guidelines per se and the way they are implemented cause some problems....

Descripción completa

Detalles Bibliográficos
Autor principal:	Vihinen, Mauno
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley and Sons Inc. 2020
Materias:	Original Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7507483/ https://www.ncbi.nlm.nih.gov/pubmed/32160417 http://dx.doi.org/10.1002/mgg3.1206

_version_	1783585236710326272
author	Vihinen, Mauno
author_facet	Vihinen, Mauno
author_sort	Vihinen, Mauno
collection	PubMed
description	BACKGROUND: ACMG/AMP and AMP/ASCO/CAP have released guidelines for variation interpretation, and ESHG for diagnostic sequencing. These guidelines contain recommendations including the use of computational prediction methods. The guidelines per se and the way they are implemented cause some problems. METHODS: Logical reasoning based on domain knowledge. RESULTS: According to the guidelines, several methods have to be used and they have to agree. This means that the methods with the poorest performance overrule the better ones. The choice of the prediction method(s) should be made by experts based on systematic benchmarking studies reporting all the relevant performance measures. Currently variation interpretation methods have been applied mainly to amino acid substitutions and splice site variants; however, predictors for some other types of variations are available and there will be tools for new application areas in the near future. Common problems in prediction method usage are discussed. The number of features used for method training or the number of variation types predicted by a tool are not indicators of method performance. Many published gene, protein or disease‐specific benchmark studies suffer from too small dataset rendering the results useless. In the case of binary predictors, equal number of positive and negative cases is beneficial for training, the imbalance has to be corrected for performance assessment. Predictors cannot be better than the data they are based on and used for training and testing. Minor allele frequency (MAF) can help to detect likely benign cases, but the recommended MAF threshold is apparently too high. The fact that many rare variants are disease‐causing or ‐related does not mean that rare variants in general would be harmful. How large a portion of the tested variants a tool can predict (coverage) is not a quality measure. CONCLUSION: Methods used for variation interpretation have to be carefully selected. It should be possible to use only one predictor, with proven good performance or a limited number of complementary predictors with state‐of‐the‐art performance. Bear in mind that diseases and pathogenicity have a continuum and variants are not dichotomic i.e. either pathogenic or benign, either.
format	Online Article Text
id	pubmed-7507483
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	John Wiley and Sons Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-75074832020-09-28 Problems in variation interpretation guidelines and in their implementation in computational tools Vihinen, Mauno Mol Genet Genomic Med Original Articles BACKGROUND: ACMG/AMP and AMP/ASCO/CAP have released guidelines for variation interpretation, and ESHG for diagnostic sequencing. These guidelines contain recommendations including the use of computational prediction methods. The guidelines per se and the way they are implemented cause some problems. METHODS: Logical reasoning based on domain knowledge. RESULTS: According to the guidelines, several methods have to be used and they have to agree. This means that the methods with the poorest performance overrule the better ones. The choice of the prediction method(s) should be made by experts based on systematic benchmarking studies reporting all the relevant performance measures. Currently variation interpretation methods have been applied mainly to amino acid substitutions and splice site variants; however, predictors for some other types of variations are available and there will be tools for new application areas in the near future. Common problems in prediction method usage are discussed. The number of features used for method training or the number of variation types predicted by a tool are not indicators of method performance. Many published gene, protein or disease‐specific benchmark studies suffer from too small dataset rendering the results useless. In the case of binary predictors, equal number of positive and negative cases is beneficial for training, the imbalance has to be corrected for performance assessment. Predictors cannot be better than the data they are based on and used for training and testing. Minor allele frequency (MAF) can help to detect likely benign cases, but the recommended MAF threshold is apparently too high. The fact that many rare variants are disease‐causing or ‐related does not mean that rare variants in general would be harmful. How large a portion of the tested variants a tool can predict (coverage) is not a quality measure. CONCLUSION: Methods used for variation interpretation have to be carefully selected. It should be possible to use only one predictor, with proven good performance or a limited number of complementary predictors with state‐of‐the‐art performance. Bear in mind that diseases and pathogenicity have a continuum and variants are not dichotomic i.e. either pathogenic or benign, either. John Wiley and Sons Inc. 2020-03-11 /pmc/articles/PMC7507483/ /pubmed/32160417 http://dx.doi.org/10.1002/mgg3.1206 Text en © 2020 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle	Original Articles Vihinen, Mauno Problems in variation interpretation guidelines and in their implementation in computational tools
title	Problems in variation interpretation guidelines and in their implementation in computational tools
title_full	Problems in variation interpretation guidelines and in their implementation in computational tools
title_fullStr	Problems in variation interpretation guidelines and in their implementation in computational tools
title_full_unstemmed	Problems in variation interpretation guidelines and in their implementation in computational tools
title_short	Problems in variation interpretation guidelines and in their implementation in computational tools
title_sort	problems in variation interpretation guidelines and in their implementation in computational tools
topic	Original Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7507483/ https://www.ncbi.nlm.nih.gov/pubmed/32160417 http://dx.doi.org/10.1002/mgg3.1206
work_keys_str_mv	AT vihinenmauno problemsinvariationinterpretationguidelinesandintheirimplementationincomputationaltools

Problems in variation interpretation guidelines and in their implementation in computational tools

Ejemplares similares