Cargando…

Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data

Owing to recent improvement of genotyping technology, large-scale genetic data can be utilized to identify disease susceptibility loci and this successful finding has substantially improved our understanding of complex diseases. However, in spite of these successes, most of the genetic effects for m...

Descripción completa

Detalles Bibliográficos
Autores principales: Won, Sungho, Choi, Hosik, Park, Suyeon, Lee, Juyoung, Park, Changyi, Kwon, Sunghoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539442/
https://www.ncbi.nlm.nih.gov/pubmed/26346893
http://dx.doi.org/10.1155/2015/605891
_version_ 1782386105631375360
author Won, Sungho
Choi, Hosik
Park, Suyeon
Lee, Juyoung
Park, Changyi
Kwon, Sunghoon
author_facet Won, Sungho
Choi, Hosik
Park, Suyeon
Lee, Juyoung
Park, Changyi
Kwon, Sunghoon
author_sort Won, Sungho
collection PubMed
description Owing to recent improvement of genotyping technology, large-scale genetic data can be utilized to identify disease susceptibility loci and this successful finding has substantially improved our understanding of complex diseases. However, in spite of these successes, most of the genetic effects for many complex diseases were found to be very small, which have been a big hurdle to build disease prediction model. Recently, many statistical methods based on penalized regressions have been proposed to tackle the so-called “large P and small N” problem. Penalized regressions including least absolute selection and shrinkage operator (LASSO) and ridge regression limit the space of parameters, and this constraint enables the estimation of effects for very large number of SNPs. Various extensions have been suggested, and, in this report, we compare their accuracy by applying them to several complex diseases. Our results show that penalized regressions are usually robust and provide better accuracy than the existing methods for at least diseases under consideration.
format Online
Article
Text
id pubmed-4539442
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-45394422015-09-06 Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data Won, Sungho Choi, Hosik Park, Suyeon Lee, Juyoung Park, Changyi Kwon, Sunghoon Biomed Res Int Research Article Owing to recent improvement of genotyping technology, large-scale genetic data can be utilized to identify disease susceptibility loci and this successful finding has substantially improved our understanding of complex diseases. However, in spite of these successes, most of the genetic effects for many complex diseases were found to be very small, which have been a big hurdle to build disease prediction model. Recently, many statistical methods based on penalized regressions have been proposed to tackle the so-called “large P and small N” problem. Penalized regressions including least absolute selection and shrinkage operator (LASSO) and ridge regression limit the space of parameters, and this constraint enables the estimation of effects for very large number of SNPs. Various extensions have been suggested, and, in this report, we compare their accuracy by applying them to several complex diseases. Our results show that penalized regressions are usually robust and provide better accuracy than the existing methods for at least diseases under consideration. Hindawi Publishing Corporation 2015 2015-08-04 /pmc/articles/PMC4539442/ /pubmed/26346893 http://dx.doi.org/10.1155/2015/605891 Text en Copyright © 2015 Sungho Won et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Won, Sungho
Choi, Hosik
Park, Suyeon
Lee, Juyoung
Park, Changyi
Kwon, Sunghoon
Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data
title Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data
title_full Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data
title_fullStr Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data
title_full_unstemmed Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data
title_short Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data
title_sort evaluation of penalized and nonpenalized methods for disease prediction with large-scale genetic data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4539442/
https://www.ncbi.nlm.nih.gov/pubmed/26346893
http://dx.doi.org/10.1155/2015/605891
work_keys_str_mv AT wonsungho evaluationofpenalizedandnonpenalizedmethodsfordiseasepredictionwithlargescalegeneticdata
AT choihosik evaluationofpenalizedandnonpenalizedmethodsfordiseasepredictionwithlargescalegeneticdata
AT parksuyeon evaluationofpenalizedandnonpenalizedmethodsfordiseasepredictionwithlargescalegeneticdata
AT leejuyoung evaluationofpenalizedandnonpenalizedmethodsfordiseasepredictionwithlargescalegeneticdata
AT parkchangyi evaluationofpenalizedandnonpenalizedmethodsfordiseasepredictionwithlargescalegeneticdata
AT kwonsunghoon evaluationofpenalizedandnonpenalizedmethodsfordiseasepredictionwithlargescalegeneticdata