Cargando…

RAPIDSNPs: A new computational pipeline for rapidly identifying key genetic variants reveals previously unidentified SNPs that are significantly associated with individual platelet responses

Advances in omics technologies have led to the discovery of genetic markers, or single nucleotide polymorphisms (SNPs), that are associated with particular diseases or complex traits. Although there have been significant improvements in the approaches used to analyse associations of SNPs with diseas...

Descripción completa

Detalles Bibliográficos
Autores principales: Salehe, Bajuna Rashid, Jones, Chris Ian, Di Fatta, Giuseppe, McGuffin, Liam James
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5404774/
https://www.ncbi.nlm.nih.gov/pubmed/28441463
http://dx.doi.org/10.1371/journal.pone.0175957
_version_ 1783231646574575616
author Salehe, Bajuna Rashid
Jones, Chris Ian
Di Fatta, Giuseppe
McGuffin, Liam James
author_facet Salehe, Bajuna Rashid
Jones, Chris Ian
Di Fatta, Giuseppe
McGuffin, Liam James
author_sort Salehe, Bajuna Rashid
collection PubMed
description Advances in omics technologies have led to the discovery of genetic markers, or single nucleotide polymorphisms (SNPs), that are associated with particular diseases or complex traits. Although there have been significant improvements in the approaches used to analyse associations of SNPs with disease, further optimised and rapid techniques are needed to keep up with the rate of SNP discovery, which has exacerbated the ‘missing heritability’ problem. Here, we have devised a novel, integrated, heuristic-based, hybrid analytical computational pipeline, for rapidly detecting novel or key genetic variants that are associated with diseases or complex traits. Our pipeline is particularly useful in genetic association studies where the genotyped SNP data are highly dimensional, and the complex trait phenotype involved is continuous. In particular, the pipeline is more efficient for investigating small sets of genotyped SNPs defined in high dimensional spaces that may be associated with continuous phenotypes, rather than for the investigation of whole genome variants. The pipeline, which employs a consensus approach based on the random forest, was able to rapidly identify previously unseen key SNPs, that are significantly associated with the platelet response phenotype, which was used as our complex trait case study. Several of these SNPs, such as rs6141803 of COMMD7 and rs41316468 in PKT2B, have independently confirmed associations with cardiovascular diseases (CVDs) according to other unrelated studies, suggesting that our pipeline is robust in identifying key genetic variants. Our new pipeline provides an important step towards addressing the problem of ‘missing heritability’ through enhanced detection of key genetic variants (SNPs) that are associated with continuous complex traits/disease phenotypes.
format Online
Article
Text
id pubmed-5404774
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54047742017-05-12 RAPIDSNPs: A new computational pipeline for rapidly identifying key genetic variants reveals previously unidentified SNPs that are significantly associated with individual platelet responses Salehe, Bajuna Rashid Jones, Chris Ian Di Fatta, Giuseppe McGuffin, Liam James PLoS One Research Article Advances in omics technologies have led to the discovery of genetic markers, or single nucleotide polymorphisms (SNPs), that are associated with particular diseases or complex traits. Although there have been significant improvements in the approaches used to analyse associations of SNPs with disease, further optimised and rapid techniques are needed to keep up with the rate of SNP discovery, which has exacerbated the ‘missing heritability’ problem. Here, we have devised a novel, integrated, heuristic-based, hybrid analytical computational pipeline, for rapidly detecting novel or key genetic variants that are associated with diseases or complex traits. Our pipeline is particularly useful in genetic association studies where the genotyped SNP data are highly dimensional, and the complex trait phenotype involved is continuous. In particular, the pipeline is more efficient for investigating small sets of genotyped SNPs defined in high dimensional spaces that may be associated with continuous phenotypes, rather than for the investigation of whole genome variants. The pipeline, which employs a consensus approach based on the random forest, was able to rapidly identify previously unseen key SNPs, that are significantly associated with the platelet response phenotype, which was used as our complex trait case study. Several of these SNPs, such as rs6141803 of COMMD7 and rs41316468 in PKT2B, have independently confirmed associations with cardiovascular diseases (CVDs) according to other unrelated studies, suggesting that our pipeline is robust in identifying key genetic variants. Our new pipeline provides an important step towards addressing the problem of ‘missing heritability’ through enhanced detection of key genetic variants (SNPs) that are associated with continuous complex traits/disease phenotypes. Public Library of Science 2017-04-25 /pmc/articles/PMC5404774/ /pubmed/28441463 http://dx.doi.org/10.1371/journal.pone.0175957 Text en © 2017 Salehe et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Salehe, Bajuna Rashid
Jones, Chris Ian
Di Fatta, Giuseppe
McGuffin, Liam James
RAPIDSNPs: A new computational pipeline for rapidly identifying key genetic variants reveals previously unidentified SNPs that are significantly associated with individual platelet responses
title RAPIDSNPs: A new computational pipeline for rapidly identifying key genetic variants reveals previously unidentified SNPs that are significantly associated with individual platelet responses
title_full RAPIDSNPs: A new computational pipeline for rapidly identifying key genetic variants reveals previously unidentified SNPs that are significantly associated with individual platelet responses
title_fullStr RAPIDSNPs: A new computational pipeline for rapidly identifying key genetic variants reveals previously unidentified SNPs that are significantly associated with individual platelet responses
title_full_unstemmed RAPIDSNPs: A new computational pipeline for rapidly identifying key genetic variants reveals previously unidentified SNPs that are significantly associated with individual platelet responses
title_short RAPIDSNPs: A new computational pipeline for rapidly identifying key genetic variants reveals previously unidentified SNPs that are significantly associated with individual platelet responses
title_sort rapidsnps: a new computational pipeline for rapidly identifying key genetic variants reveals previously unidentified snps that are significantly associated with individual platelet responses
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5404774/
https://www.ncbi.nlm.nih.gov/pubmed/28441463
http://dx.doi.org/10.1371/journal.pone.0175957
work_keys_str_mv AT salehebajunarashid rapidsnpsanewcomputationalpipelineforrapidlyidentifyingkeygeneticvariantsrevealspreviouslyunidentifiedsnpsthataresignificantlyassociatedwithindividualplateletresponses
AT joneschrisian rapidsnpsanewcomputationalpipelineforrapidlyidentifyingkeygeneticvariantsrevealspreviouslyunidentifiedsnpsthataresignificantlyassociatedwithindividualplateletresponses
AT difattagiuseppe rapidsnpsanewcomputationalpipelineforrapidlyidentifyingkeygeneticvariantsrevealspreviouslyunidentifiedsnpsthataresignificantlyassociatedwithindividualplateletresponses
AT mcguffinliamjames rapidsnpsanewcomputationalpipelineforrapidlyidentifyingkeygeneticvariantsrevealspreviouslyunidentifiedsnpsthataresignificantlyassociatedwithindividualplateletresponses