Cargando…

Ancestry and frequency of genetic variants in the general population are confounders in the characterization of germline variants linked to cancer

BACKGROUND: Pediatric high-grade gliomas (pHGGs) are incurable malignant brain cancers. Clear somatic genetic drivers are difficult to identify in the majority of cases. We hypothesized that this may be due to the existence of germline variants that influence tumor etiology and/or progression and ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Bobyn, Anna, Zarrei, Mehdi, Zhu, Yuankun, Hoffman, Mary, Brenner, Darren, Resnick, Adam C., Scherer, Stephen W., Gallo, Marco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7201963/
https://www.ncbi.nlm.nih.gov/pubmed/32375678
http://dx.doi.org/10.1186/s12881-020-01033-x
_version_ 1783529644068175872
author Bobyn, Anna
Zarrei, Mehdi
Zhu, Yuankun
Hoffman, Mary
Brenner, Darren
Resnick, Adam C.
Scherer, Stephen W.
Gallo, Marco
author_facet Bobyn, Anna
Zarrei, Mehdi
Zhu, Yuankun
Hoffman, Mary
Brenner, Darren
Resnick, Adam C.
Scherer, Stephen W.
Gallo, Marco
author_sort Bobyn, Anna
collection PubMed
description BACKGROUND: Pediatric high-grade gliomas (pHGGs) are incurable malignant brain cancers. Clear somatic genetic drivers are difficult to identify in the majority of cases. We hypothesized that this may be due to the existence of germline variants that influence tumor etiology and/or progression and are filtered out using traditional pipelines for somatic mutation calling. METHODS: In this study, we analyzed whole-genome sequencing (WGS) datasets of matched germlines and tumor tissues to identify recurrent germline variants in pHGG patients. RESULTS: We identified two structural variants that were highly recurrent in a discovery cohort of 8 pHGG patients. One was a ~ 40 kb deletion immediately upstream of the NEGR1 locus and predicted to remove the promoter region of this gene. This copy number variant (CNV) was present in all patients in our discovery cohort (n = 8) and in 86.3% of patients in our validation cohort (n = 73 cases). We also identified a second recurrent deletion 55.7 kb in size affecting the BTNL3 and BTNL8 loci. This BTNL3–8 deletion was observed in 62.5% patients in our discovery cohort, and in 17.8% of the patients in the validation cohort. Our single-cell RNA sequencing (scRNA-seq) data showed that both deletions result in disruption of transcription of the affected genes. However, analysis of genomic information from multiple non-cancer cohorts showed that both the NEGR1 promoter deletion and the BTNL3–8 deletion were CNVs occurring at high frequencies in the general population. Intriguingly, the upstream NEGR1 CNV deletion was homozygous in ~ 40% of individuals in the non-cancer population. This finding was immediately relevant because the affected genes have important physiological functions, and our analyses showed that NEGR1 expression levels have prognostic value for pHGG patient survival. We also found that these deletions occurred at different frequencies among different ethnic groups. CONCLUSIONS: Our study highlights the need to integrate cancer genomic analyses and genomic data from large control populations. Failure to do so may lead to spurious association of genes with cancer etiology. Importantly, our results showcase the need for careful evaluation of differences in the frequency of genetic variants among different ethnic groups.
format Online
Article
Text
id pubmed-7201963
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72019632020-05-09 Ancestry and frequency of genetic variants in the general population are confounders in the characterization of germline variants linked to cancer Bobyn, Anna Zarrei, Mehdi Zhu, Yuankun Hoffman, Mary Brenner, Darren Resnick, Adam C. Scherer, Stephen W. Gallo, Marco BMC Med Genet Research Article BACKGROUND: Pediatric high-grade gliomas (pHGGs) are incurable malignant brain cancers. Clear somatic genetic drivers are difficult to identify in the majority of cases. We hypothesized that this may be due to the existence of germline variants that influence tumor etiology and/or progression and are filtered out using traditional pipelines for somatic mutation calling. METHODS: In this study, we analyzed whole-genome sequencing (WGS) datasets of matched germlines and tumor tissues to identify recurrent germline variants in pHGG patients. RESULTS: We identified two structural variants that were highly recurrent in a discovery cohort of 8 pHGG patients. One was a ~ 40 kb deletion immediately upstream of the NEGR1 locus and predicted to remove the promoter region of this gene. This copy number variant (CNV) was present in all patients in our discovery cohort (n = 8) and in 86.3% of patients in our validation cohort (n = 73 cases). We also identified a second recurrent deletion 55.7 kb in size affecting the BTNL3 and BTNL8 loci. This BTNL3–8 deletion was observed in 62.5% patients in our discovery cohort, and in 17.8% of the patients in the validation cohort. Our single-cell RNA sequencing (scRNA-seq) data showed that both deletions result in disruption of transcription of the affected genes. However, analysis of genomic information from multiple non-cancer cohorts showed that both the NEGR1 promoter deletion and the BTNL3–8 deletion were CNVs occurring at high frequencies in the general population. Intriguingly, the upstream NEGR1 CNV deletion was homozygous in ~ 40% of individuals in the non-cancer population. This finding was immediately relevant because the affected genes have important physiological functions, and our analyses showed that NEGR1 expression levels have prognostic value for pHGG patient survival. We also found that these deletions occurred at different frequencies among different ethnic groups. CONCLUSIONS: Our study highlights the need to integrate cancer genomic analyses and genomic data from large control populations. Failure to do so may lead to spurious association of genes with cancer etiology. Importantly, our results showcase the need for careful evaluation of differences in the frequency of genetic variants among different ethnic groups. BioMed Central 2020-05-06 /pmc/articles/PMC7201963/ /pubmed/32375678 http://dx.doi.org/10.1186/s12881-020-01033-x Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Bobyn, Anna
Zarrei, Mehdi
Zhu, Yuankun
Hoffman, Mary
Brenner, Darren
Resnick, Adam C.
Scherer, Stephen W.
Gallo, Marco
Ancestry and frequency of genetic variants in the general population are confounders in the characterization of germline variants linked to cancer
title Ancestry and frequency of genetic variants in the general population are confounders in the characterization of germline variants linked to cancer
title_full Ancestry and frequency of genetic variants in the general population are confounders in the characterization of germline variants linked to cancer
title_fullStr Ancestry and frequency of genetic variants in the general population are confounders in the characterization of germline variants linked to cancer
title_full_unstemmed Ancestry and frequency of genetic variants in the general population are confounders in the characterization of germline variants linked to cancer
title_short Ancestry and frequency of genetic variants in the general population are confounders in the characterization of germline variants linked to cancer
title_sort ancestry and frequency of genetic variants in the general population are confounders in the characterization of germline variants linked to cancer
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7201963/
https://www.ncbi.nlm.nih.gov/pubmed/32375678
http://dx.doi.org/10.1186/s12881-020-01033-x
work_keys_str_mv AT bobynanna ancestryandfrequencyofgeneticvariantsinthegeneralpopulationareconfoundersinthecharacterizationofgermlinevariantslinkedtocancer
AT zarreimehdi ancestryandfrequencyofgeneticvariantsinthegeneralpopulationareconfoundersinthecharacterizationofgermlinevariantslinkedtocancer
AT zhuyuankun ancestryandfrequencyofgeneticvariantsinthegeneralpopulationareconfoundersinthecharacterizationofgermlinevariantslinkedtocancer
AT hoffmanmary ancestryandfrequencyofgeneticvariantsinthegeneralpopulationareconfoundersinthecharacterizationofgermlinevariantslinkedtocancer
AT brennerdarren ancestryandfrequencyofgeneticvariantsinthegeneralpopulationareconfoundersinthecharacterizationofgermlinevariantslinkedtocancer
AT resnickadamc ancestryandfrequencyofgeneticvariantsinthegeneralpopulationareconfoundersinthecharacterizationofgermlinevariantslinkedtocancer
AT schererstephenw ancestryandfrequencyofgeneticvariantsinthegeneralpopulationareconfoundersinthecharacterizationofgermlinevariantslinkedtocancer
AT gallomarco ancestryandfrequencyofgeneticvariantsinthegeneralpopulationareconfoundersinthecharacterizationofgermlinevariantslinkedtocancer