Cargando…

Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma

BACKGROUND: DNAs released from tumor cells into blood (circulating tumor DNAs, ctDNAs) carry tumor-specific genomic aberrations, providing a non-invasive means for cancer detection. In this study, we aimed to leverage somatic copy number aberration (SCNA) in ctDNA to develop assays to detect early-s...

Descripción completa

Detalles Bibliográficos
Autores principales: Tao, Kaishan, Bian, Zhenyuan, Zhang, Qiong, Guo, Xu, Yin, Chun, Wang, Yang, Zhou, Kaixiang, Wan, Shaogui, Shi, Meifang, Bao, Dengke, Yang, Chuhu, Xing, Jinliang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7276513/
https://www.ncbi.nlm.nih.gov/pubmed/32512514
http://dx.doi.org/10.1016/j.ebiom.2020.102811
_version_ 1783542966086795264
author Tao, Kaishan
Bian, Zhenyuan
Zhang, Qiong
Guo, Xu
Yin, Chun
Wang, Yang
Zhou, Kaixiang
Wan, Shaogui
Shi, Meifang
Bao, Dengke
Yang, Chuhu
Xing, Jinliang
author_facet Tao, Kaishan
Bian, Zhenyuan
Zhang, Qiong
Guo, Xu
Yin, Chun
Wang, Yang
Zhou, Kaixiang
Wan, Shaogui
Shi, Meifang
Bao, Dengke
Yang, Chuhu
Xing, Jinliang
author_sort Tao, Kaishan
collection PubMed
description BACKGROUND: DNAs released from tumor cells into blood (circulating tumor DNAs, ctDNAs) carry tumor-specific genomic aberrations, providing a non-invasive means for cancer detection. In this study, we aimed to leverage somatic copy number aberration (SCNA) in ctDNA to develop assays to detect early-stage HCCs. METHODS: We conducted low-depth whole-genome sequencing (WGS) to profile SCNAs in 384 plasma samples of hepatitis B virus (HBV)-related HCC and cancer-free HBV patients, using one discovery and two validation cohorts. To fully capture the robust signals of WGS data from the complete genome, we developed a machine learning-based statistical model that is focused on detection accuracy in early-stage HCC. FINDINGS: We built the model using a discovery cohort of 209 patients, achieving an overall area under curve (AUC) of 0.893, with 0.874 for early-stage (Barcelona clinical liver cancer [BCLC] stage 0-A) and 0.933 for advanced-stage (BCLC stage B-D). The performance of the model was then assessed in two validation cohorts (76 and 99 patients) that only consisted of patients with stage 0-A HCC. Our model exhibited a robust predictive performance, with an AUC of 0.920 and 0.812 for the two validation cohorts. Further analyses showed the impact of tumor sample heterogeneity in model training on detecting early-stage tumors, and a refined model addressing the heterogeneity in the discovery cohort significantly increased model performance in validation. INTERPRETATION: We developed an SCNA-based, machine learning-driven model in the non-invasive detection of early-stage HCC in HBV patients and demonstrated its performance through strict independent validations.
format Online
Article
Text
id pubmed-7276513
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-72765132020-06-10 Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma Tao, Kaishan Bian, Zhenyuan Zhang, Qiong Guo, Xu Yin, Chun Wang, Yang Zhou, Kaixiang Wan, Shaogui Shi, Meifang Bao, Dengke Yang, Chuhu Xing, Jinliang EBioMedicine Research paper BACKGROUND: DNAs released from tumor cells into blood (circulating tumor DNAs, ctDNAs) carry tumor-specific genomic aberrations, providing a non-invasive means for cancer detection. In this study, we aimed to leverage somatic copy number aberration (SCNA) in ctDNA to develop assays to detect early-stage HCCs. METHODS: We conducted low-depth whole-genome sequencing (WGS) to profile SCNAs in 384 plasma samples of hepatitis B virus (HBV)-related HCC and cancer-free HBV patients, using one discovery and two validation cohorts. To fully capture the robust signals of WGS data from the complete genome, we developed a machine learning-based statistical model that is focused on detection accuracy in early-stage HCC. FINDINGS: We built the model using a discovery cohort of 209 patients, achieving an overall area under curve (AUC) of 0.893, with 0.874 for early-stage (Barcelona clinical liver cancer [BCLC] stage 0-A) and 0.933 for advanced-stage (BCLC stage B-D). The performance of the model was then assessed in two validation cohorts (76 and 99 patients) that only consisted of patients with stage 0-A HCC. Our model exhibited a robust predictive performance, with an AUC of 0.920 and 0.812 for the two validation cohorts. Further analyses showed the impact of tumor sample heterogeneity in model training on detecting early-stage tumors, and a refined model addressing the heterogeneity in the discovery cohort significantly increased model performance in validation. INTERPRETATION: We developed an SCNA-based, machine learning-driven model in the non-invasive detection of early-stage HCC in HBV patients and demonstrated its performance through strict independent validations. Elsevier 2020-06-05 /pmc/articles/PMC7276513/ /pubmed/32512514 http://dx.doi.org/10.1016/j.ebiom.2020.102811 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research paper
Tao, Kaishan
Bian, Zhenyuan
Zhang, Qiong
Guo, Xu
Yin, Chun
Wang, Yang
Zhou, Kaixiang
Wan, Shaogui
Shi, Meifang
Bao, Dengke
Yang, Chuhu
Xing, Jinliang
Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma
title Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma
title_full Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma
title_fullStr Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma
title_full_unstemmed Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma
title_short Machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor DNA for early detection of hepatocellular carcinoma
title_sort machine learning-based genome-wide interrogation of somatic copy number aberrations in circulating tumor dna for early detection of hepatocellular carcinoma
topic Research paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7276513/
https://www.ncbi.nlm.nih.gov/pubmed/32512514
http://dx.doi.org/10.1016/j.ebiom.2020.102811
work_keys_str_mv AT taokaishan machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT bianzhenyuan machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT zhangqiong machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT guoxu machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT yinchun machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT wangyang machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT zhoukaixiang machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT wanshaogui machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT shimeifang machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT baodengke machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT yangchuhu machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma
AT xingjinliang machinelearningbasedgenomewideinterrogationofsomaticcopynumberaberrationsincirculatingtumordnaforearlydetectionofhepatocellularcarcinoma