Cargando…

Bioinformatics challenges for genome-wide association studies

Motivation: The sequencing of the human genome has made it possible to identify an informative set of >1 million single nucleotide polymorphisms (SNPs) across the genome that can be used to carry out genome-wide association studies (GWASs). The availability of massive amounts of GWAS data has nec...

Descripción completa

Detalles Bibliográficos
Autores principales: Moore, Jason H., Asselbergs, Folkert W., Williams, Scott M.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2820680/
https://www.ncbi.nlm.nih.gov/pubmed/20053841
http://dx.doi.org/10.1093/bioinformatics/btp713
_version_ 1782177402882883584
author Moore, Jason H.
Asselbergs, Folkert W.
Williams, Scott M.
author_facet Moore, Jason H.
Asselbergs, Folkert W.
Williams, Scott M.
author_sort Moore, Jason H.
collection PubMed
description Motivation: The sequencing of the human genome has made it possible to identify an informative set of >1 million single nucleotide polymorphisms (SNPs) across the genome that can be used to carry out genome-wide association studies (GWASs). The availability of massive amounts of GWAS data has necessitated the development of new biostatistical methods for quality control, imputation and analysis issues including multiple testing. This work has been successful and has enabled the discovery of new associations that have been replicated in multiple studies. However, it is now recognized that most SNPs discovered via GWAS have small effects on disease susceptibility and thus may not be suitable for improving health care through genetic testing. One likely explanation for the mixed results of GWAS is that the current biostatistical analysis paradigm is by design agnostic or unbiased in that it ignores all prior knowledge about disease pathobiology. Further, the linear modeling framework that is employed in GWAS often considers only one SNP at a time thus ignoring their genomic and environmental context. There is now a shift away from the biostatistical approach toward a more holistic approach that recognizes the complexity of the genotype–phenotype relationship that is characterized by significant heterogeneity and gene–gene and gene–environment interaction. We argue here that bioinformatics has an important role to play in addressing the complexity of the underlying genetic basis of common human diseases. The goal of this review is to identify and discuss those GWAS challenges that will require computational methods. Contact: jason.h.moore@dartmouth.edu
format Text
id pubmed-2820680
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28206802010-02-12 Bioinformatics challenges for genome-wide association studies Moore, Jason H. Asselbergs, Folkert W. Williams, Scott M. Bioinformatics Review Motivation: The sequencing of the human genome has made it possible to identify an informative set of >1 million single nucleotide polymorphisms (SNPs) across the genome that can be used to carry out genome-wide association studies (GWASs). The availability of massive amounts of GWAS data has necessitated the development of new biostatistical methods for quality control, imputation and analysis issues including multiple testing. This work has been successful and has enabled the discovery of new associations that have been replicated in multiple studies. However, it is now recognized that most SNPs discovered via GWAS have small effects on disease susceptibility and thus may not be suitable for improving health care through genetic testing. One likely explanation for the mixed results of GWAS is that the current biostatistical analysis paradigm is by design agnostic or unbiased in that it ignores all prior knowledge about disease pathobiology. Further, the linear modeling framework that is employed in GWAS often considers only one SNP at a time thus ignoring their genomic and environmental context. There is now a shift away from the biostatistical approach toward a more holistic approach that recognizes the complexity of the genotype–phenotype relationship that is characterized by significant heterogeneity and gene–gene and gene–environment interaction. We argue here that bioinformatics has an important role to play in addressing the complexity of the underlying genetic basis of common human diseases. The goal of this review is to identify and discuss those GWAS challenges that will require computational methods. Contact: jason.h.moore@dartmouth.edu Oxford University Press 2010-02-15 2010-01-06 /pmc/articles/PMC2820680/ /pubmed/20053841 http://dx.doi.org/10.1093/bioinformatics/btp713 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review
Moore, Jason H.
Asselbergs, Folkert W.
Williams, Scott M.
Bioinformatics challenges for genome-wide association studies
title Bioinformatics challenges for genome-wide association studies
title_full Bioinformatics challenges for genome-wide association studies
title_fullStr Bioinformatics challenges for genome-wide association studies
title_full_unstemmed Bioinformatics challenges for genome-wide association studies
title_short Bioinformatics challenges for genome-wide association studies
title_sort bioinformatics challenges for genome-wide association studies
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2820680/
https://www.ncbi.nlm.nih.gov/pubmed/20053841
http://dx.doi.org/10.1093/bioinformatics/btp713
work_keys_str_mv AT moorejasonh bioinformaticschallengesforgenomewideassociationstudies
AT asselbergsfolkertw bioinformaticschallengesforgenomewideassociationstudies
AT williamsscottm bioinformaticschallengesforgenomewideassociationstudies