Cargando…

Predictive long-range allele-specific mapping of regulatory variants and target transcripts

Genome-wide association studies (GWASs) have identified a large number of noncoding associations, calling for systematic mapping to causal regulatory variants and their distal target genes. A widely used method, quantitative trait loci (QTL) mapping for chromatin or expression traits, suffers from s...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Kibaick, Lee, Seulkee, Bang, Hyoeun, Choi, Jung Kyoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5391085/
https://www.ncbi.nlm.nih.gov/pubmed/28406955
http://dx.doi.org/10.1371/journal.pone.0175768
_version_ 1783229209685000192
author Lee, Kibaick
Lee, Seulkee
Bang, Hyoeun
Choi, Jung Kyoon
author_facet Lee, Kibaick
Lee, Seulkee
Bang, Hyoeun
Choi, Jung Kyoon
author_sort Lee, Kibaick
collection PubMed
description Genome-wide association studies (GWASs) have identified a large number of noncoding associations, calling for systematic mapping to causal regulatory variants and their distal target genes. A widely used method, quantitative trait loci (QTL) mapping for chromatin or expression traits, suffers from sample-to-sample experimental variation and trans-acting or environmental effects. Instead, alleles at heterozygous loci can be compared within a sample, thereby controlling for those confounding factors. Here we introduce a method for chromatin structure-based allele-specific pairing of regulatory variants and target transcripts. With phased genotypes, much of allele-specific expression could be explained by paired allelic cis-regulation across a long range. This approach showed approximately two times greater sensitivity than QTL mapping. There are cases in which allele imbalance cannot be tested because heterozygotes are not available among reference samples. Therefore, we employed a machine learning method to predict missing positive cases based on various features shared by observed allele-specific pairs. We showed that only 10 reference samples are sufficient to achieve high prediction accuracy with a low sampling variation. In conclusion, our method enables highly sensitive fine mapping and target identification for trait-associated variants based on a small number of reference samples.
format Online
Article
Text
id pubmed-5391085
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53910852017-05-03 Predictive long-range allele-specific mapping of regulatory variants and target transcripts Lee, Kibaick Lee, Seulkee Bang, Hyoeun Choi, Jung Kyoon PLoS One Research Article Genome-wide association studies (GWASs) have identified a large number of noncoding associations, calling for systematic mapping to causal regulatory variants and their distal target genes. A widely used method, quantitative trait loci (QTL) mapping for chromatin or expression traits, suffers from sample-to-sample experimental variation and trans-acting or environmental effects. Instead, alleles at heterozygous loci can be compared within a sample, thereby controlling for those confounding factors. Here we introduce a method for chromatin structure-based allele-specific pairing of regulatory variants and target transcripts. With phased genotypes, much of allele-specific expression could be explained by paired allelic cis-regulation across a long range. This approach showed approximately two times greater sensitivity than QTL mapping. There are cases in which allele imbalance cannot be tested because heterozygotes are not available among reference samples. Therefore, we employed a machine learning method to predict missing positive cases based on various features shared by observed allele-specific pairs. We showed that only 10 reference samples are sufficient to achieve high prediction accuracy with a low sampling variation. In conclusion, our method enables highly sensitive fine mapping and target identification for trait-associated variants based on a small number of reference samples. Public Library of Science 2017-04-13 /pmc/articles/PMC5391085/ /pubmed/28406955 http://dx.doi.org/10.1371/journal.pone.0175768 Text en © 2017 Lee et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lee, Kibaick
Lee, Seulkee
Bang, Hyoeun
Choi, Jung Kyoon
Predictive long-range allele-specific mapping of regulatory variants and target transcripts
title Predictive long-range allele-specific mapping of regulatory variants and target transcripts
title_full Predictive long-range allele-specific mapping of regulatory variants and target transcripts
title_fullStr Predictive long-range allele-specific mapping of regulatory variants and target transcripts
title_full_unstemmed Predictive long-range allele-specific mapping of regulatory variants and target transcripts
title_short Predictive long-range allele-specific mapping of regulatory variants and target transcripts
title_sort predictive long-range allele-specific mapping of regulatory variants and target transcripts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5391085/
https://www.ncbi.nlm.nih.gov/pubmed/28406955
http://dx.doi.org/10.1371/journal.pone.0175768
work_keys_str_mv AT leekibaick predictivelongrangeallelespecificmappingofregulatoryvariantsandtargettranscripts
AT leeseulkee predictivelongrangeallelespecificmappingofregulatoryvariantsandtargettranscripts
AT banghyoeun predictivelongrangeallelespecificmappingofregulatoryvariantsandtargettranscripts
AT choijungkyoon predictivelongrangeallelespecificmappingofregulatoryvariantsandtargettranscripts