Cargando…

A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer

BACKGROUND: Sequencing of both healthy and disease singletons yields many novel and low frequency variants of uncertain significance (VUS). Complete gene and genome sequencing by next generation sequencing (NGS) significantly increases the number of VUS detected. While prior studies have emphasized...

Descripción completa

Detalles Bibliográficos
Autores principales: Mucaki, Eliseos J., Caminsky, Natasha G., Perri, Ami M., Lu, Ruipeng, Laederach, Alain, Halvorsen, Matthew, Knoll, Joan H. M., Rogan, Peter K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4828881/
https://www.ncbi.nlm.nih.gov/pubmed/27067391
http://dx.doi.org/10.1186/s12920-016-0178-5
_version_ 1782426671067955200
author Mucaki, Eliseos J.
Caminsky, Natasha G.
Perri, Ami M.
Lu, Ruipeng
Laederach, Alain
Halvorsen, Matthew
Knoll, Joan H. M.
Rogan, Peter K.
author_facet Mucaki, Eliseos J.
Caminsky, Natasha G.
Perri, Ami M.
Lu, Ruipeng
Laederach, Alain
Halvorsen, Matthew
Knoll, Joan H. M.
Rogan, Peter K.
author_sort Mucaki, Eliseos J.
collection PubMed
description BACKGROUND: Sequencing of both healthy and disease singletons yields many novel and low frequency variants of uncertain significance (VUS). Complete gene and genome sequencing by next generation sequencing (NGS) significantly increases the number of VUS detected. While prior studies have emphasized protein coding variants, non-coding sequence variants have also been proven to significantly contribute to high penetrance disorders, such as hereditary breast and ovarian cancer (HBOC). We present a strategy for analyzing different functional classes of non-coding variants based on information theory (IT) and prioritizing patients with large intragenic deletions. METHODS: We captured and enriched for coding and non-coding variants in genes known to harbor mutations that increase HBOC risk. Custom oligonucleotide baits spanning the complete coding, non-coding, and intergenic regions 10 kb up- and downstream of ATM, BRCA1, BRCA2, CDH1, CHEK2, PALB2, and TP53 were synthesized for solution hybridization enrichment. Unique and divergent repetitive sequences were sequenced in 102 high-risk, anonymized patients without identified mutations in BRCA1/2. Aside from protein coding and copy number changes, IT-based sequence analysis was used to identify and prioritize pathogenic non-coding variants that occurred within sequence elements predicted to be recognized by proteins or protein complexes involved in mRNA splicing, transcription, and untranslated region (UTR) binding and structure. This approach was supplemented by in silico and laboratory analysis of UTR structure. RESULTS: 15,311 unique variants were identified, of which 245 occurred in coding regions. With the unified IT-framework, 132 variants were identified and 87 functionally significant VUS were further prioritized. An intragenic 32.1 kb interval in BRCA2 that was likely hemizygous was detected in one patient. We also identified 4 stop-gain variants and 3 reading-frame altering exonic insertions/deletions (indels). CONCLUSIONS: We have presented a strategy for complete gene sequence analysis followed by a unified framework for interpreting non-coding variants that may affect gene expression. This approach distills large numbers of variants detected by NGS to a limited set of variants prioritized as potential deleterious changes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-016-0178-5) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4828881
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48288812016-04-13 A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer Mucaki, Eliseos J. Caminsky, Natasha G. Perri, Ami M. Lu, Ruipeng Laederach, Alain Halvorsen, Matthew Knoll, Joan H. M. Rogan, Peter K. BMC Med Genomics Technical Advance BACKGROUND: Sequencing of both healthy and disease singletons yields many novel and low frequency variants of uncertain significance (VUS). Complete gene and genome sequencing by next generation sequencing (NGS) significantly increases the number of VUS detected. While prior studies have emphasized protein coding variants, non-coding sequence variants have also been proven to significantly contribute to high penetrance disorders, such as hereditary breast and ovarian cancer (HBOC). We present a strategy for analyzing different functional classes of non-coding variants based on information theory (IT) and prioritizing patients with large intragenic deletions. METHODS: We captured and enriched for coding and non-coding variants in genes known to harbor mutations that increase HBOC risk. Custom oligonucleotide baits spanning the complete coding, non-coding, and intergenic regions 10 kb up- and downstream of ATM, BRCA1, BRCA2, CDH1, CHEK2, PALB2, and TP53 were synthesized for solution hybridization enrichment. Unique and divergent repetitive sequences were sequenced in 102 high-risk, anonymized patients without identified mutations in BRCA1/2. Aside from protein coding and copy number changes, IT-based sequence analysis was used to identify and prioritize pathogenic non-coding variants that occurred within sequence elements predicted to be recognized by proteins or protein complexes involved in mRNA splicing, transcription, and untranslated region (UTR) binding and structure. This approach was supplemented by in silico and laboratory analysis of UTR structure. RESULTS: 15,311 unique variants were identified, of which 245 occurred in coding regions. With the unified IT-framework, 132 variants were identified and 87 functionally significant VUS were further prioritized. An intragenic 32.1 kb interval in BRCA2 that was likely hemizygous was detected in one patient. We also identified 4 stop-gain variants and 3 reading-frame altering exonic insertions/deletions (indels). CONCLUSIONS: We have presented a strategy for complete gene sequence analysis followed by a unified framework for interpreting non-coding variants that may affect gene expression. This approach distills large numbers of variants detected by NGS to a limited set of variants prioritized as potential deleterious changes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12920-016-0178-5) contains supplementary material, which is available to authorized users. BioMed Central 2016-04-11 /pmc/articles/PMC4828881/ /pubmed/27067391 http://dx.doi.org/10.1186/s12920-016-0178-5 Text en © Mucaki et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Technical Advance
Mucaki, Eliseos J.
Caminsky, Natasha G.
Perri, Ami M.
Lu, Ruipeng
Laederach, Alain
Halvorsen, Matthew
Knoll, Joan H. M.
Rogan, Peter K.
A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
title A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
title_full A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
title_fullStr A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
title_full_unstemmed A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
title_short A unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
title_sort unified analytic framework for prioritization of non-coding variants of uncertain significance in heritable breast and ovarian cancer
topic Technical Advance
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4828881/
https://www.ncbi.nlm.nih.gov/pubmed/27067391
http://dx.doi.org/10.1186/s12920-016-0178-5
work_keys_str_mv AT mucakieliseosj aunifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT caminskynatashag aunifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT perriamim aunifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT luruipeng aunifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT laederachalain aunifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT halvorsenmatthew aunifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT knolljoanhm aunifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT roganpeterk aunifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT mucakieliseosj unifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT caminskynatashag unifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT perriamim unifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT luruipeng unifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT laederachalain unifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT halvorsenmatthew unifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT knolljoanhm unifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer
AT roganpeterk unifiedanalyticframeworkforprioritizationofnoncodingvariantsofuncertainsignificanceinheritablebreastandovariancancer