Cargando…
Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma
BACKGROUND: DNAs released from tumor cells into blood (circulating tumor DNAs, ctDNAs) carry tumor-specific genomic aberrations, providing a non-invasive means for cancer detection. In this study, we aimed to leverage somatic copy number aberration (SCNA) in ctDNA to develop assays to detect early-s...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7276513/ https://www.ncbi.nlm.nih.gov/pubmed/32512514 http://dx.doi.org/10.1016/j.ebiom.2020.102811 |
_version_ | 1783542966086795264 |
---|---|
author | Tao, Kaishan Bian, Zhenyuan Zhang, Qiong Guo, Xu Yin, Chun Wang, Yang Zhou, Kaixiang Wan, Shaogui Shi, Meifang Bao, Dengke Yang, Chuhu Xing, Jinliang |
author_facet | Tao, Kaishan Bian, Zhenyuan Zhang, Qiong Guo, Xu Yin, Chun Wang, Yang Zhou, Kaixiang Wan, Shaogui Shi, Meifang Bao, Dengke Yang, Chuhu Xing, Jinliang |
author_sort | Tao, Kaishan |
collection | PubMed |
description | BACKGROUND: DNAs released from tumor cells into blood (circulating tumor DNAs, ctDNAs) carry tumor-specific genomic aberrations, providing a non-invasive means for cancer detection. In this study, we aimed to leverage somatic copy number aberration (SCNA) in ctDNA to develop assays to detect early-stage HCCs. METHODS: We conducted low-depth whole-genome sequencing (WGS) to profile SCNAs in 384 plasma samples of hepatitis B virus (HBV)-related HCC and cancer-free HBV patients, using one discovery and two validation cohorts. To fully capture the robust signals of WGS data from the complete genome, we developed a machine learning-based statistical model that is focused on detection accuracy in early-stage HCC. FINDINGS: We built the model using a discovery cohort of 209 patients, achieving an overall area under curve (AUC) of 0.893, with 0.874 for early-stage (Barcelona clinical liver cancer [BCLC] stage 0-A) and 0.933 for advanced-stage (BCLC stage B-D). The performance of the model was then assessed in two validation cohorts (76 and 99 patients) that only consisted of patients with stage 0-A HCC. Our model exhibited a robust predictive performance, with an AUC of 0.920 and 0.812 for the two validation cohorts. Further analyses showed the impact of tumor sample heterogeneity in model training on detecting early-stage tumors, and a refined model addressing the heterogeneity in the discovery cohort significantly increased model performance in validation. INTERPRETATION: We developed an SCNA-based, machine learning-driven model in the non-invasive detection of early-stage HCC in HBV patients and demonstrated its performance through strict independent validations. |
format | Online Article Text |
id | pubmed-7276513 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-72765132020-06-10 Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma Tao, Kaishan Bian, Zhenyuan Zhang, Qiong Guo, Xu Yin, Chun Wang, Yang Zhou, Kaixiang Wan, Shaogui Shi, Meifang Bao, Dengke Yang, Chuhu Xing, Jinliang EBioMedicine Research paper BACKGROUND: DNAs released from tumor cells into blood (circulating tumor DNAs, ctDNAs) carry tumor-specific genomic aberrations, providing a non-invasive means for cancer detection. In this study, we aimed to leverage somatic copy number aberration (SCNA) in ctDNA to develop assays to detect early-stage HCCs. METHODS: We conducted low-depth whole-genome sequencing (WGS) to profile SCNAs in 384 plasma samples of hepatitis B virus (HBV)-related HCC and cancer-free HBV patients, using one discovery and two validation cohorts. To fully capture the robust signals of WGS data from the complete genome, we developed a machine learning-based statistical model that is focused on detection accuracy in early-stage HCC. FINDINGS: We built the model using a discovery cohort of 209 patients, achieving an overall area under curve (AUC) of 0.893, with 0.874 for early-stage (Barcelona clinical liver cancer [BCLC] stage 0-A) and 0.933 for advanced-stage (BCLC stage B-D). The performance of the model was then assessed in two validation cohorts (76 and 99 patients) that only consisted of patients with stage 0-A HCC. Our model exhibited a robust predictive performance, with an AUC of 0.920 and 0.812 for the two validation cohorts. Further analyses showed the impact of tumor sample heterogeneity in model training on detecting early-stage tumors, and a refined model addressing the heterogeneity in the discovery cohort significantly increased model performance in validation. INTERPRETATION: We developed an SCNA-based, machine learning-driven model in the non-invasive detection of early-stage HCC in HBV patients and demonstrated its performance through strict independent validations. Elsevier 2020-06-05 /pmc/articles/PMC7276513/ /pubmed/32512514 http://dx.doi.org/10.1016/j.ebiom.2020.102811 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research paper Tao, Kaishan Bian, Zhenyuan Zhang, Qiong Guo, Xu Yin, Chun Wang, Yang Zhou, Kaixiang Wan, Shaogui Shi, Meifang Bao, Dengke Yang, Chuhu Xing, Jinliang Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma |
title | Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma |
title_full | Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma |
title_fullStr | Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma |
title_full_unstemmed | Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma |
title_short | Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma |
title_sort | machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor dna for early detection of hepatocellular carcinoma |
topic | Research paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7276513/ https://www.ncbi.nlm.nih.gov/pubmed/32512514 http://dx.doi.org/10.1016/j.ebiom.2020.102811 |
work_keys_str_mv | AT taokaishan machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT bianzhenyuan machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT zhangqiong machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT guoxu machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT yinchun machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT wangyang machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT zhoukaixiang machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT wanshaogui machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT shimeifang machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT baodengke machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT yangchuhu machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma AT xingjinliang machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma |