Cargando…

Intergenic disease-associated regions are abundant in novel transcripts

BACKGROUND: Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in...

Descripción completa

Detalles Bibliográficos
Autores principales: Bartonicek, N., Clark, M. B., Quek, X. C., Torpy, J. R., Pritchard, A. L., Maag, J. L. V., Gloss, B. S., Crawford, J., Taft, R. J., Hayward, N. K., Montgomery, G. W., Mattick, J. S., Mercer, T. R., Dinger, M. E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5747244/
https://www.ncbi.nlm.nih.gov/pubmed/29284497
http://dx.doi.org/10.1186/s13059-017-1363-3
_version_ 1783289253394907136
author Bartonicek, N.
Clark, M. B.
Quek, X. C.
Torpy, J. R.
Pritchard, A. L.
Maag, J. L. V.
Gloss, B. S.
Crawford, J.
Taft, R. J.
Hayward, N. K.
Montgomery, G. W.
Mattick, J. S.
Mercer, T. R.
Dinger, M. E.
author_facet Bartonicek, N.
Clark, M. B.
Quek, X. C.
Torpy, J. R.
Pritchard, A. L.
Maag, J. L. V.
Gloss, B. S.
Crawford, J.
Taft, R. J.
Hayward, N. K.
Montgomery, G. W.
Mattick, J. S.
Mercer, T. R.
Dinger, M. E.
author_sort Bartonicek, N.
collection PubMed
description BACKGROUND: Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. RESULTS: To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. CONCLUSIONS: This resource of previously unreported transcripts in disease-associated regions (http://gwas-captureseq.dingerlab.org) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1363-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5747244
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57472442018-01-03 Intergenic disease-associated regions are abundant in novel transcripts Bartonicek, N. Clark, M. B. Quek, X. C. Torpy, J. R. Pritchard, A. L. Maag, J. L. V. Gloss, B. S. Crawford, J. Taft, R. J. Hayward, N. K. Montgomery, G. W. Mattick, J. S. Mercer, T. R. Dinger, M. E. Genome Biol Research Article BACKGROUND: Genotyping of large populations through genome-wide association studies (GWAS) has successfully identified many genomic variants associated with traits or disease risk. Unexpectedly, a large proportion of GWAS single nucleotide polymorphisms (SNPs) and associated haplotype blocks are in intronic and intergenic regions, hindering their functional evaluation. While some of these risk-susceptibility regions encompass cis-regulatory sites, their transcriptional potential has never been systematically explored. RESULTS: To detect rare tissue-specific expression, we employed the transcript-enrichment method CaptureSeq on 21 human tissues to identify 1775 multi-exonic transcripts from 561 intronic and intergenic haploblocks associated with 392 traits and diseases, covering 73.9 Mb (2.2%) of the human genome. We show that a large proportion (85%) of disease-associated haploblocks express novel multi-exonic non-coding transcripts that are tissue-specific and enriched for GWAS SNPs as well as epigenetic markers of active transcription and enhancer activity. Similarly, we captured transcriptomes from 13 melanomas, targeting nine melanoma-associated haploblocks, and characterized 31 novel melanoma-specific transcripts that include fusion proteins, novel exons and non-coding RNAs, one-third of which showed allelically imbalanced expression. CONCLUSIONS: This resource of previously unreported transcripts in disease-associated regions (http://gwas-captureseq.dingerlab.org) should provide an important starting point for the translational community in search of novel biomarkers, disease mechanisms, and drug targets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1363-3) contains supplementary material, which is available to authorized users. BioMed Central 2017-12-28 /pmc/articles/PMC5747244/ /pubmed/29284497 http://dx.doi.org/10.1186/s13059-017-1363-3 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Bartonicek, N.
Clark, M. B.
Quek, X. C.
Torpy, J. R.
Pritchard, A. L.
Maag, J. L. V.
Gloss, B. S.
Crawford, J.
Taft, R. J.
Hayward, N. K.
Montgomery, G. W.
Mattick, J. S.
Mercer, T. R.
Dinger, M. E.
Intergenic disease-associated regions are abundant in novel transcripts
title Intergenic disease-associated regions are abundant in novel transcripts
title_full Intergenic disease-associated regions are abundant in novel transcripts
title_fullStr Intergenic disease-associated regions are abundant in novel transcripts
title_full_unstemmed Intergenic disease-associated regions are abundant in novel transcripts
title_short Intergenic disease-associated regions are abundant in novel transcripts
title_sort intergenic disease-associated regions are abundant in novel transcripts
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5747244/
https://www.ncbi.nlm.nih.gov/pubmed/29284497
http://dx.doi.org/10.1186/s13059-017-1363-3
work_keys_str_mv AT bartonicekn intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT clarkmb intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT quekxc intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT torpyjr intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT pritchardal intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT maagjlv intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT glossbs intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT crawfordj intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT taftrj intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT haywardnk intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT montgomerygw intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT mattickjs intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT mercertr intergenicdiseaseassociatedregionsareabundantinnoveltranscripts
AT dingerme intergenicdiseaseassociatedregionsareabundantinnoveltranscripts