Cargando…
Intergenic disease-associated regions are abundant in novel transcripts
BACKGROUND: Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5747244/ https://www.ncbi.nlm.nih.gov/pubmed/29284497 http://dx.doi.org/10.1186/s13059-017-1363-3 |
_version_ | 1783289253394907136 |
---|---|
author | Bartonicek, N. Clark, M. B. Quek, X. C. Torpy, J. R. Pritchard, A. L. Maag, J. L. V. Gloss, B. S. Crawford, J. Taft, R. J. Hayward, N. K. Montgomery, G. W. Mattick, J. S. Mercer, T. R. Dinger, M. E. |
author_facet | Bartonicek, N. Clark, M. B. Quek, X. C. Torpy, J. R. Pritchard, A. L. Maag, J. L. V. Gloss, B. S. Crawford, J. Taft, R. J. Hayward, N. K. Montgomery, G. W. Mattick, J. S. Mercer, T. R. Dinger, M. E. |
author_sort | Bartonicek, N. |
collection | PubMed |
description | BACKGROUND: Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. RESULTS: To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. CONCLUSIONS: This resource of previously unreported transcripts in disease-associated regions (http://gwas-captureseq.dingerlab.org) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1363-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5747244 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57472442018-01-03 Intergenic disease-associated regions are abundant in novel transcripts Bartonicek, N. Clark, M. B. Quek, X. C. Torpy, J. R. Pritchard, A. L. Maag, J. L. V. Gloss, B. S. Crawford, J. Taft, R. J. Hayward, N. K. Montgomery, G. W. Mattick, J. S. Mercer, T. R. Dinger, M. E. Genome Biol Research Article BACKGROUND: Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. RESULTS: To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. CONCLUSIONS: This resource of previously unreported transcripts in disease-associated regions (http://gwas-captureseq.dingerlab.org) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1363-3) contains supplementary material, which is available to authorized users. BioMed Central 2017-12-28 /pmc/articles/PMC5747244/ /pubmed/29284497 http://dx.doi.org/10.1186/s13059-017-1363-3 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Bartonicek, N. Clark, M. B. Quek, X. C. Torpy, J. R. Pritchard, A. L. Maag, J. L. V. Gloss, B. S. Crawford, J. Taft, R. J. Hayward, N. K. Montgomery, G. W. Mattick, J. S. Mercer, T. R. Dinger, M. E. Intergenic disease-associated regions are abundant in novel transcripts |
title | Intergenic disease-associated regions are abundant in novel transcripts |
title_full | Intergenic disease-associated regions are abundant in novel transcripts |
title_fullStr | Intergenic disease-associated regions are abundant in novel transcripts |
title_full_unstemmed | Intergenic disease-associated regions are abundant in novel transcripts |
title_short | Intergenic disease-associated regions are abundant in novel transcripts |
title_sort | intergenic disease-associated regions are abundant in novel transcripts |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5747244/ https://www.ncbi.nlm.nih.gov/pubmed/29284497 http://dx.doi.org/10.1186/s13059-017-1363-3 |
work_keys_str_mv | AT bartonicekn intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT clarkmb intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT quekxc intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT torpyjr intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT pritchardal intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT maagjlv intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT glossbs intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT crawfordj intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT taftrj intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT haywardnk intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT montgomerygw intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT mattickjs intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT mercertr intergenicdiseaseassociatedregionsareabundantinnoveltranscripts AT dingerme intergenicdiseaseassociatedregionsareabundantinnoveltranscripts |