Cargando…

Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods

BACKGROUND: In silico models have recently been created in order to predict which genetic variants are more likely to contribute to the risk of a complex trait given their functional characteristics. However, there has been no comprehensive review as to which type of predictive accuracy measures and...

Descripción completa

Detalles Bibliográficos
Autores principales: Gagliano, Sarah A., Paterson, Andrew D., Weale, Michael E., Knight, Jo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4440290/
https://www.ncbi.nlm.nih.gov/pubmed/25997848
http://dx.doi.org/10.1186/s12864-015-1616-z
_version_ 1782372615259684864
author Gagliano, Sarah A.
Paterson, Andrew D.
Weale, Michael E.
Knight, Jo
author_facet Gagliano, Sarah A.
Paterson, Andrew D.
Weale, Michael E.
Knight, Jo
author_sort Gagliano, Sarah A.
collection PubMed
description BACKGROUND: In silico models have recently been created in order to predict which genetic variants are more likely to contribute to the risk of a complex trait given their functional characteristics. However, there has been no comprehensive review as to which type of predictive accuracy measures and data visualization techniques are most useful for assessing these models. METHODS: We assessed the performance of the models for predicting risk using various methodologies, some of which include: receiver operating characteristic (ROC) curves, histograms of classification probability, and the novel use of the quantile-quantile plot. These measures have variable interpretability depending on factors such as whether the dataset is balanced in terms of numbers of genetic variants classified as risk variants versus those that are not. RESULTS: We conclude that the area under the curve (AUC) is a suitable starting place, and for models with similar AUCs, violin plots are particularly useful for examining the distribution of the risk scores. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1616-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4440290
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44402902015-05-22 Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods Gagliano, Sarah A. Paterson, Andrew D. Weale, Michael E. Knight, Jo BMC Genomics Research Article BACKGROUND: In silico models have recently been created in order to predict which genetic variants are more likely to contribute to the risk of a complex trait given their functional characteristics. However, there has been no comprehensive review as to which type of predictive accuracy measures and data visualization techniques are most useful for assessing these models. METHODS: We assessed the performance of the models for predicting risk using various methodologies, some of which include: receiver operating characteristic (ROC) curves, histograms of classification probability, and the novel use of the quantile-quantile plot. These measures have variable interpretability depending on factors such as whether the dataset is balanced in terms of numbers of genetic variants classified as risk variants versus those that are not. RESULTS: We conclude that the area under the curve (AUC) is a suitable starting place, and for models with similar AUCs, violin plots are particularly useful for examining the distribution of the risk scores. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1616-z) contains supplementary material, which is available to authorized users. BioMed Central 2015-05-22 /pmc/articles/PMC4440290/ /pubmed/25997848 http://dx.doi.org/10.1186/s12864-015-1616-z Text en © Gagliano et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Gagliano, Sarah A.
Paterson, Andrew D.
Weale, Michael E.
Knight, Jo
Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods
title Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods
title_full Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods
title_fullStr Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods
title_full_unstemmed Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods
title_short Assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods
title_sort assessing models for genetic prediction of complex traits: a comparison of visualization and quantitative methods
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4440290/
https://www.ncbi.nlm.nih.gov/pubmed/25997848
http://dx.doi.org/10.1186/s12864-015-1616-z
work_keys_str_mv AT gaglianosaraha assessingmodelsforgeneticpredictionofcomplextraitsacomparisonofvisualizationandquantitativemethods
AT patersonandrewd assessingmodelsforgeneticpredictionofcomplextraitsacomparisonofvisualizationandquantitativemethods
AT wealemichaele assessingmodelsforgeneticpredictionofcomplextraitsacomparisonofvisualizationandquantitativemethods
AT knightjo assessingmodelsforgeneticpredictionofcomplextraitsacomparisonofvisualizationandquantitativemethods