Cargando…

A comparison of genomic profiles of complex diseases under different models

BACKGROUND: Various approaches are being used to predict individual risk to polygenic diseases from data provided by genome-wide association studies. As there are substantial differences between the diseases investigated, the data sets used and the way they are tested, it is difficult to assess whic...

Descripción completa

Detalles Bibliográficos
Autores principales: Potenciano, Víctor, Abad-Grau, María Mar, Alcina, Antonio, Matesanz, Fuencisla
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4717655/
https://www.ncbi.nlm.nih.gov/pubmed/26782991
http://dx.doi.org/10.1186/s12920-015-0157-2
_version_ 1782410693564170240
author Potenciano, Víctor
Abad-Grau, María Mar
Alcina, Antonio
Matesanz, Fuencisla
author_facet Potenciano, Víctor
Abad-Grau, María Mar
Alcina, Antonio
Matesanz, Fuencisla
author_sort Potenciano, Víctor
collection PubMed
description BACKGROUND: Various approaches are being used to predict individual risk to polygenic diseases from data provided by genome-wide association studies. As there are substantial differences between the diseases investigated, the data sets used and the way they are tested, it is difficult to assess which models are more suitable for this task. RESULTS: We compared different approaches for seven complex diseases provided by the Wellcome Trust Case Control Consortium (WTCCC) under a within-study validation approach. Risk models were inferred using a variety of learning machines and assumptions about the underlying genetic model, including a haplotype-based approach with different haplotype lengths and different thresholds in association levels to choose loci as part of the predictive model. In accordance with previous work, our results generally showed low accuracy considering disease heritability and population prevalence. However, the boosting algorithm returned a predictive area under the ROC curve (AUC) of 0.8805 for Type 1 diabetes (T1D) and 0.8087 for rheumatoid arthritis, both clearly over the AUC obtained by other approaches and over 0.75, which is the minimum required for a disease to be successfully tested on a sample at risk, which means that boosting is a promising approach. Its good performance seems to be related to its robustness to redundant data, as in the case of genome-wide data sets due to linkage disequilibrium. CONCLUSIONS: In view of our results, the boosting approach may be suitable for modeling individual predisposition to Type 1 diabetes and rheumatoid arthritis based on genome-wide data and should be considered for more in-depth research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-015-0157-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4717655
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47176552016-01-20 A comparison of genomic profiles of complex diseases under different models Potenciano, Víctor Abad-Grau, María Mar Alcina, Antonio Matesanz, Fuencisla BMC Med Genomics Research Article BACKGROUND: Various approaches are being used to predict individual risk to polygenic diseases from data provided by genome-wide association studies. As there are substantial differences between the diseases investigated, the data sets used and the way they are tested, it is difficult to assess which models are more suitable for this task. RESULTS: We compared different approaches for seven complex diseases provided by the Wellcome Trust Case Control Consortium (WTCCC) under a within-study validation approach. Risk models were inferred using a variety of learning machines and assumptions about the underlying genetic model, including a haplotype-based approach with different haplotype lengths and different thresholds in association levels to choose loci as part of the predictive model. In accordance with previous work, our results generally showed low accuracy considering disease heritability and population prevalence. However, the boosting algorithm returned a predictive area under the ROC curve (AUC) of 0.8805 for Type 1 diabetes (T1D) and 0.8087 for rheumatoid arthritis, both clearly over the AUC obtained by other approaches and over 0.75, which is the minimum required for a disease to be successfully tested on a sample at risk, which means that boosting is a promising approach. Its good performance seems to be related to its robustness to redundant data, as in the case of genome-wide data sets due to linkage disequilibrium. CONCLUSIONS: In view of our results, the boosting approach may be suitable for modeling individual predisposition to Type 1 diabetes and rheumatoid arthritis based on genome-wide data and should be considered for more in-depth research. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-015-0157-2) contains supplementary material, which is available to authorized users. BioMed Central 2016-01-19 /pmc/articles/PMC4717655/ /pubmed/26782991 http://dx.doi.org/10.1186/s12920-015-0157-2 Text en © Potenciano et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Potenciano, Víctor
Abad-Grau, María Mar
Alcina, Antonio
Matesanz, Fuencisla
A comparison of genomic profiles of complex diseases under different models
title A comparison of genomic profiles of complex diseases under different models
title_full A comparison of genomic profiles of complex diseases under different models
title_fullStr A comparison of genomic profiles of complex diseases under different models
title_full_unstemmed A comparison of genomic profiles of complex diseases under different models
title_short A comparison of genomic profiles of complex diseases under different models
title_sort comparison of genomic profiles of complex diseases under different models
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4717655/
https://www.ncbi.nlm.nih.gov/pubmed/26782991
http://dx.doi.org/10.1186/s12920-015-0157-2
work_keys_str_mv AT potencianovictor acomparisonofgenomicprofilesofcomplexdiseasesunderdifferentmodels
AT abadgraumariamar acomparisonofgenomicprofilesofcomplexdiseasesunderdifferentmodels
AT alcinaantonio acomparisonofgenomicprofilesofcomplexdiseasesunderdifferentmodels
AT matesanzfuencisla acomparisonofgenomicprofilesofcomplexdiseasesunderdifferentmodels
AT potencianovictor comparisonofgenomicprofilesofcomplexdiseasesunderdifferentmodels
AT abadgraumariamar comparisonofgenomicprofilesofcomplexdiseasesunderdifferentmodels
AT alcinaantonio comparisonofgenomicprofilesofcomplexdiseasesunderdifferentmodels
AT matesanzfuencisla comparisonofgenomicprofilesofcomplexdiseasesunderdifferentmodels