Cargando…

Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data

BACKGROUND: Single nucleotide polymorphisms (SNPs) have been associated with many aspects of human development and disease, and many non-coding SNPs associated with disease risk are presumed to affect gene regulation. We have previously shown that SNPs within transcription factor binding sites can a...

Descripción completa

Detalles Bibliográficos
Autores principales: Ni, Yunyun, Weber Hall, Amelia, Battenhouse, Anna, Iyer, Vishwanath R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3434080/
https://www.ncbi.nlm.nih.gov/pubmed/22950704
http://dx.doi.org/10.1186/1471-2156-13-46
_version_ 1782242387997753344
author Ni, Yunyun
Weber Hall, Amelia
Battenhouse, Anna
Iyer, Vishwanath R
author_facet Ni, Yunyun
Weber Hall, Amelia
Battenhouse, Anna
Iyer, Vishwanath R
author_sort Ni, Yunyun
collection PubMed
description BACKGROUND: Single nucleotide polymorphisms (SNPs) have been associated with many aspects of human development and disease, and many non-coding SNPs associated with disease risk are presumed to affect gene regulation. We have previously shown that SNPs within transcription factor binding sites can affect transcription factor binding in an allele-specific and heritable manner. However, such analysis has relied on prior whole-genome genotypes provided by large external projects such as HapMap and the 1000 Genomes Project. This requirement limits the study of allele-specific effects of SNPs in primary patient samples from diseases of interest, where complete genotypes are not readily available. RESULTS: In this study, we show that we are able to identify SNPs de novo and accurately from ChIP-seq data generated in the ENCODE Project. Our de novo identified SNPs from ChIP-seq data are highly concordant with published genotypes. Independent experimental verification of more than 100 sites estimates our false discovery rate at less than 5%. Analysis of transcription factor binding at de novo identified SNPs revealed widespread heritable allele-specific binding, confirming previous observations. SNPs identified from ChIP-seq datasets were significantly enriched for disease-associated variants, and we identified dozens of allele-specific binding events in non-coding regions that could distinguish between disease and normal haplotypes. CONCLUSIONS: Our approach combines SNP discovery, genotyping and allele-specific analysis, but is selectively focused on functional regulatory elements occupied by transcription factors or epigenetic marks, and will therefore be valuable for identifying the functional regulatory consequences of non-coding SNPs in primary disease samples.
format Online
Article
Text
id pubmed-3434080
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34340802012-09-06 Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data Ni, Yunyun Weber Hall, Amelia Battenhouse, Anna Iyer, Vishwanath R BMC Genet Methodology Article BACKGROUND: Single nucleotide polymorphisms (SNPs) have been associated with many aspects of human development and disease, and many non-coding SNPs associated with disease risk are presumed to affect gene regulation. We have previously shown that SNPs within transcription factor binding sites can affect transcription factor binding in an allele-specific and heritable manner. However, such analysis has relied on prior whole-genome genotypes provided by large external projects such as HapMap and the 1000 Genomes Project. This requirement limits the study of allele-specific effects of SNPs in primary patient samples from diseases of interest, where complete genotypes are not readily available. RESULTS: In this study, we show that we are able to identify SNPs de novo and accurately from ChIP-seq data generated in the ENCODE Project. Our de novo identified SNPs from ChIP-seq data are highly concordant with published genotypes. Independent experimental verification of more than 100 sites estimates our false discovery rate at less than 5%. Analysis of transcription factor binding at de novo identified SNPs revealed widespread heritable allele-specific binding, confirming previous observations. SNPs identified from ChIP-seq datasets were significantly enriched for disease-associated variants, and we identified dozens of allele-specific binding events in non-coding regions that could distinguish between disease and normal haplotypes. CONCLUSIONS: Our approach combines SNP discovery, genotyping and allele-specific analysis, but is selectively focused on functional regulatory elements occupied by transcription factors or epigenetic marks, and will therefore be valuable for identifying the functional regulatory consequences of non-coding SNPs in primary disease samples. BioMed Central 2012-09-05 /pmc/articles/PMC3434080/ /pubmed/22950704 http://dx.doi.org/10.1186/1471-2156-13-46 Text en Copyright ©2012 Ni et al. licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Ni, Yunyun
Weber Hall, Amelia
Battenhouse, Anna
Iyer, Vishwanath R
Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data
title Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data
title_full Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data
title_fullStr Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data
title_full_unstemmed Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data
title_short Simultaneous SNP identification and assessment of allele-specific bias from ChIP-seq data
title_sort simultaneous snp identification and assessment of allele-specific bias from chip-seq data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3434080/
https://www.ncbi.nlm.nih.gov/pubmed/22950704
http://dx.doi.org/10.1186/1471-2156-13-46
work_keys_str_mv AT niyunyun simultaneoussnpidentificationandassessmentofallelespecificbiasfromchipseqdata
AT weberhallamelia simultaneoussnpidentificationandassessmentofallelespecificbiasfromchipseqdata
AT battenhouseanna simultaneoussnpidentificationandassessmentofallelespecificbiasfromchipseqdata
AT iyervishwanathr simultaneoussnpidentificationandassessmentofallelespecificbiasfromchipseqdata