Cargando…

Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing

BACKGROUND: Children with birth defects (BD) are more likely to develop cancer and the increased risk of cancer persists into adulthood. Prior population-based assessments have demonstrated that even non-chromosomal BDs are associated with at least two-fold increase of cancer risk. Identification of...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yichuan, Qu, Hui-Qi, Chang, Xiao, Mentch, Frank D, Qiu, Haijun, Nguyen, Kenny, Wang, Xiang, Saeidian, Amir Hossein, Watson, Deborah, Glessner, Joseph, Hakonarson, Hakon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670520/
https://www.ncbi.nlm.nih.gov/pubmed/36384586
http://dx.doi.org/10.1186/s40364-022-00431-y
_version_ 1784832351246745600
author Liu, Yichuan
Qu, Hui-Qi
Chang, Xiao
Mentch, Frank D
Qiu, Haijun
Nguyen, Kenny
Wang, Xiang
Saeidian, Amir Hossein
Watson, Deborah
Glessner, Joseph
Hakonarson, Hakon
author_facet Liu, Yichuan
Qu, Hui-Qi
Chang, Xiao
Mentch, Frank D
Qiu, Haijun
Nguyen, Kenny
Wang, Xiang
Saeidian, Amir Hossein
Watson, Deborah
Glessner, Joseph
Hakonarson, Hakon
author_sort Liu, Yichuan
collection PubMed
description BACKGROUND: Children with birth defects (BD) are more likely to develop cancer and the increased risk of cancer persists into adulthood. Prior population-based assessments have demonstrated that even non-chromosomal BDs are associated with at least two-fold increase of cancer risk. Identification of variants that are associated with malignant tumor in BD patients without chromosomal anomalies may improve our understanding of the underlying molecular mechanisms and provide clues for early cancer detection in children with BD. METHODS: In this study, whole genome sequencing (WGS) data of blood-derived DNA for 1653 individuals without chromosomal anomalies were acquired from the Kids First Data Resource Center (DRC), including 541 BD probands with at least one type of malignant tumors, 767 BD probands without malignant tumor, and 345 healthy family members who are the parents or siblings of the probands. Recurrent variants exclusively seen in cancer patients were selected and mapped to their corresponding genomic regions. The targeted genes/non-coding RNAs were further reduced using random forest and forward feature selection (ffs) models. RESULTS: The filtered genes/non-coding RNAs, including variants in non-coding areas, showed enrichment in cancer-related pathways. To further support the validity of these variants, blood WGS data of additional 40 independent BD probands, including 25 patients with at least one type of cancers from unrelated projects, were acquired. The counts of variants of interest identified in the Kid First data showed clear deviation in the validation dataset between BD patients with cancer and without cancer. Furthermore, a deep learning model was built to assess the predictive abilities in the 40 patients using variants of interest identified in the Kids First cohort as feature vectors. The accuracies are ~ 75%, with the noteworthy observation that variants mapped to non-coding regions provided the highest accuracy (31 out of 40 patients were labeled correctly). CONCLUSION: We present for the first time a panorama of genetic variants that are associated with cancers in non-chromosomal BD patients, implying that our approach may potentially serve for the early detection of malignant tumors in patients with BD. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40364-022-00431-y.
format Online
Article
Text
id pubmed-9670520
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-96705202022-11-18 Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing Liu, Yichuan Qu, Hui-Qi Chang, Xiao Mentch, Frank D Qiu, Haijun Nguyen, Kenny Wang, Xiang Saeidian, Amir Hossein Watson, Deborah Glessner, Joseph Hakonarson, Hakon Biomark Res Research BACKGROUND: Children with birth defects (BD) are more likely to develop cancer and the increased risk of cancer persists into adulthood. Prior population-based assessments have demonstrated that even non-chromosomal BDs are associated with at least two-fold increase of cancer risk. Identification of variants that are associated with malignant tumor in BD patients without chromosomal anomalies may improve our understanding of the underlying molecular mechanisms and provide clues for early cancer detection in children with BD. METHODS: In this study, whole genome sequencing (WGS) data of blood-derived DNA for 1653 individuals without chromosomal anomalies were acquired from the Kids First Data Resource Center (DRC), including 541 BD probands with at least one type of malignant tumors, 767 BD probands without malignant tumor, and 345 healthy family members who are the parents or siblings of the probands. Recurrent variants exclusively seen in cancer patients were selected and mapped to their corresponding genomic regions. The targeted genes/non-coding RNAs were further reduced using random forest and forward feature selection (ffs) models. RESULTS: The filtered genes/non-coding RNAs, including variants in non-coding areas, showed enrichment in cancer-related pathways. To further support the validity of these variants, blood WGS data of additional 40 independent BD probands, including 25 patients with at least one type of cancers from unrelated projects, were acquired. The counts of variants of interest identified in the Kid First data showed clear deviation in the validation dataset between BD patients with cancer and without cancer. Furthermore, a deep learning model was built to assess the predictive abilities in the 40 patients using variants of interest identified in the Kids First cohort as feature vectors. The accuracies are ~ 75%, with the noteworthy observation that variants mapped to non-coding regions provided the highest accuracy (31 out of 40 patients were labeled correctly). CONCLUSION: We present for the first time a panorama of genetic variants that are associated with cancers in non-chromosomal BD patients, implying that our approach may potentially serve for the early detection of malignant tumors in patients with BD. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40364-022-00431-y. BioMed Central 2022-11-16 /pmc/articles/PMC9670520/ /pubmed/36384586 http://dx.doi.org/10.1186/s40364-022-00431-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liu, Yichuan
Qu, Hui-Qi
Chang, Xiao
Mentch, Frank D
Qiu, Haijun
Nguyen, Kenny
Wang, Xiang
Saeidian, Amir Hossein
Watson, Deborah
Glessner, Joseph
Hakonarson, Hakon
Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing
title Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing
title_full Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing
title_fullStr Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing
title_full_unstemmed Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing
title_short Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing
title_sort identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670520/
https://www.ncbi.nlm.nih.gov/pubmed/36384586
http://dx.doi.org/10.1186/s40364-022-00431-y
work_keys_str_mv AT liuyichuan identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT quhuiqi identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT changxiao identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT mentchfrankd identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT qiuhaijun identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT nguyenkenny identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT wangxiang identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT saeidianamirhossein identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT watsondeborah identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT glessnerjoseph identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing
AT hakonarsonhakon identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing