Cargando…
Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing
BACKGROUND: Children with birth defects (BD) are more likely to develop cancer and the increased risk of cancer persists into adulthood. Prior population-based assessments have demonstrated that even non-chromosomal BDs are associated with at least two-fold increase of cancer risk. Identification of...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670520/ https://www.ncbi.nlm.nih.gov/pubmed/36384586 http://dx.doi.org/10.1186/s40364-022-00431-y |
_version_ | 1784832351246745600 |
---|---|
author | Liu, Yichuan Qu, Hui-Qi Chang, Xiao Mentch, Frank D Qiu, Haijun Nguyen, Kenny Wang, Xiang Saeidian, Amir Hossein Watson, Deborah Glessner, Joseph Hakonarson, Hakon |
author_facet | Liu, Yichuan Qu, Hui-Qi Chang, Xiao Mentch, Frank D Qiu, Haijun Nguyen, Kenny Wang, Xiang Saeidian, Amir Hossein Watson, Deborah Glessner, Joseph Hakonarson, Hakon |
author_sort | Liu, Yichuan |
collection | PubMed |
description | BACKGROUND: Children with birth defects (BD) are more likely to develop cancer and the increased risk of cancer persists into adulthood. Prior population-based assessments have demonstrated that even non-chromosomal BDs are associated with at least two-fold increase of cancer risk. Identification of variants that are associated with malignant tumor in BD patients without chromosomal anomalies may improve our understanding of the underlying molecular mechanisms and provide clues for early cancer detection in children with BD. METHODS: In this study, whole genome sequencing (WGS) data of blood-derived DNA for 1653 individuals without chromosomal anomalies were acquired from the Kids First Data Resource Center (DRC), including 541 BD probands with at least one type of malignant tumors, 767 BD probands without malignant tumor, and 345 healthy family members who are the parents or siblings of the probands. Recurrent variants exclusively seen in cancer patients were selected and mapped to their corresponding genomic regions. The targeted genes/non-coding RNAs were further reduced using random forest and forward feature selection (ffs) models. RESULTS: The filtered genes/non-coding RNAs, including variants in non-coding areas, showed enrichment in cancer-related pathways. To further support the validity of these variants, blood WGS data of additional 40 independent BD probands, including 25 patients with at least one type of cancers from unrelated projects, were acquired. The counts of variants of interest identified in the Kid First data showed clear deviation in the validation dataset between BD patients with cancer and without cancer. Furthermore, a deep learning model was built to assess the predictive abilities in the 40 patients using variants of interest identified in the Kids First cohort as feature vectors. The accuracies are ~ 75%, with the noteworthy observation that variants mapped to non-coding regions provided the highest accuracy (31 out of 40 patients were labeled correctly). CONCLUSION: We present for the first time a panorama of genetic variants that are associated with cancers in non-chromosomal BD patients, implying that our approach may potentially serve for the early detection of malignant tumors in patients with BD. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40364-022-00431-y. |
format | Online Article Text |
id | pubmed-9670520 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-96705202022-11-18 Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing Liu, Yichuan Qu, Hui-Qi Chang, Xiao Mentch, Frank D Qiu, Haijun Nguyen, Kenny Wang, Xiang Saeidian, Amir Hossein Watson, Deborah Glessner, Joseph Hakonarson, Hakon Biomark Res Research BACKGROUND: Children with birth defects (BD) are more likely to develop cancer and the increased risk of cancer persists into adulthood. Prior population-based assessments have demonstrated that even non-chromosomal BDs are associated with at least two-fold increase of cancer risk. Identification of variants that are associated with malignant tumor in BD patients without chromosomal anomalies may improve our understanding of the underlying molecular mechanisms and provide clues for early cancer detection in children with BD. METHODS: In this study, whole genome sequencing (WGS) data of blood-derived DNA for 1653 individuals without chromosomal anomalies were acquired from the Kids First Data Resource Center (DRC), including 541 BD probands with at least one type of malignant tumors, 767 BD probands without malignant tumor, and 345 healthy family members who are the parents or siblings of the probands. Recurrent variants exclusively seen in cancer patients were selected and mapped to their corresponding genomic regions. The targeted genes/non-coding RNAs were further reduced using random forest and forward feature selection (ffs) models. RESULTS: The filtered genes/non-coding RNAs, including variants in non-coding areas, showed enrichment in cancer-related pathways. To further support the validity of these variants, blood WGS data of additional 40 independent BD probands, including 25 patients with at least one type of cancers from unrelated projects, were acquired. The counts of variants of interest identified in the Kid First data showed clear deviation in the validation dataset between BD patients with cancer and without cancer. Furthermore, a deep learning model was built to assess the predictive abilities in the 40 patients using variants of interest identified in the Kids First cohort as feature vectors. The accuracies are ~ 75%, with the noteworthy observation that variants mapped to non-coding regions provided the highest accuracy (31 out of 40 patients were labeled correctly). CONCLUSION: We present for the first time a panorama of genetic variants that are associated with cancers in non-chromosomal BD patients, implying that our approach may potentially serve for the early detection of malignant tumors in patients with BD. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40364-022-00431-y. BioMed Central 2022-11-16 /pmc/articles/PMC9670520/ /pubmed/36384586 http://dx.doi.org/10.1186/s40364-022-00431-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Liu, Yichuan Qu, Hui-Qi Chang, Xiao Mentch, Frank D Qiu, Haijun Nguyen, Kenny Wang, Xiang Saeidian, Amir Hossein Watson, Deborah Glessner, Joseph Hakonarson, Hakon Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing |
title | Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing |
title_full | Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing |
title_fullStr | Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing |
title_full_unstemmed | Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing |
title_short | Identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing |
title_sort | identification of risk variants related to malignant tumors in children with birth defects by whole genome sequencing |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9670520/ https://www.ncbi.nlm.nih.gov/pubmed/36384586 http://dx.doi.org/10.1186/s40364-022-00431-y |
work_keys_str_mv | AT liuyichuan identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT quhuiqi identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT changxiao identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT mentchfrankd identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT qiuhaijun identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT nguyenkenny identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT wangxiang identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT saeidianamirhossein identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT watsondeborah identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT glessnerjoseph identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing AT hakonarsonhakon identificationofriskvariantsrelatedtomalignanttumorsinchildrenwithbirthdefectsbywholegenomesequencing |