Cargando…

A Deep Learning Framework to Predict Tumor Tissue-of-Origin Based on Copy Number Alteration

Cancer of unknown primary site (CUPS) is a type of metastatic tumor for which the sites of tumor origin cannot be determined. Precise diagnosis of the tissue origin for metastatic CUPS is crucial for developing treatment schemes to improve patient prognosis. Recently, there have been many studies us...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Ying, Wang, Haifeng, Yang, Jialiang, Li, Xiong, Dai, Chan, Shao, Peng, Tian, Geng, Wang, Bo, Wang, Yinglong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7419421/
https://www.ncbi.nlm.nih.gov/pubmed/32850687
http://dx.doi.org/10.3389/fbioe.2020.00701
_version_ 1783569880325292032
author Liang, Ying
Wang, Haifeng
Yang, Jialiang
Li, Xiong
Dai, Chan
Shao, Peng
Tian, Geng
Wang, Bo
Wang, Yinglong
author_facet Liang, Ying
Wang, Haifeng
Yang, Jialiang
Li, Xiong
Dai, Chan
Shao, Peng
Tian, Geng
Wang, Bo
Wang, Yinglong
author_sort Liang, Ying
collection PubMed
description Cancer of unknown primary site (CUPS) is a type of metastatic tumor for which the sites of tumor origin cannot be determined. Precise diagnosis of the tissue origin for metastatic CUPS is crucial for developing treatment schemes to improve patient prognosis. Recently, there have been many studies using various cancer biomarkers to predict the tissue-of-origin (TOO) of CUPS. However, only a very few of them use copy number alteration (CNA) to trance TOO. In this paper, a two-step computational framework called CNA_origin is introduced to predict the tissue-of-origin of a tumor from its gene CNA levels. CNA_origin set up an intellectual deep-learning network mainly composed of an autoencoder and a convolution neural network (CNN). Based on real datasets released from the public database, CNA_origin had an overall accuracy of 83.81% on 10-fold cross-validation and 79% on independent datasets for predicting tumor origin, which improved the accuracy by 7.75 and 9.72% compared with the method published in a previous paper. Our results suggested that the autoencoder model can extract key characteristics of CNA and that the CNN classifier model developed in this study can predict the origin of tumors robustly and effectively. CNA_origin was written in Python and can be downloaded from https://github.com/YingLianghnu/CNA_origin.
format Online
Article
Text
id pubmed-7419421
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-74194212020-08-25 A Deep Learning Framework to Predict Tumor Tissue-of-Origin Based on Copy Number Alteration Liang, Ying Wang, Haifeng Yang, Jialiang Li, Xiong Dai, Chan Shao, Peng Tian, Geng Wang, Bo Wang, Yinglong Front Bioeng Biotechnol Bioengineering and Biotechnology Cancer of unknown primary site (CUPS) is a type of metastatic tumor for which the sites of tumor origin cannot be determined. Precise diagnosis of the tissue origin for metastatic CUPS is crucial for developing treatment schemes to improve patient prognosis. Recently, there have been many studies using various cancer biomarkers to predict the tissue-of-origin (TOO) of CUPS. However, only a very few of them use copy number alteration (CNA) to trance TOO. In this paper, a two-step computational framework called CNA_origin is introduced to predict the tissue-of-origin of a tumor from its gene CNA levels. CNA_origin set up an intellectual deep-learning network mainly composed of an autoencoder and a convolution neural network (CNN). Based on real datasets released from the public database, CNA_origin had an overall accuracy of 83.81% on 10-fold cross-validation and 79% on independent datasets for predicting tumor origin, which improved the accuracy by 7.75 and 9.72% compared with the method published in a previous paper. Our results suggested that the autoencoder model can extract key characteristics of CNA and that the CNN classifier model developed in this study can predict the origin of tumors robustly and effectively. CNA_origin was written in Python and can be downloaded from https://github.com/YingLianghnu/CNA_origin. Frontiers Media S.A. 2020-08-05 /pmc/articles/PMC7419421/ /pubmed/32850687 http://dx.doi.org/10.3389/fbioe.2020.00701 Text en Copyright © 2020 Liang, Wang, Yang, Li, Dai, Shao, Tian, Wang and Wang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Bioengineering and Biotechnology
Liang, Ying
Wang, Haifeng
Yang, Jialiang
Li, Xiong
Dai, Chan
Shao, Peng
Tian, Geng
Wang, Bo
Wang, Yinglong
A Deep Learning Framework to Predict Tumor Tissue-of-Origin Based on Copy Number Alteration
title A Deep Learning Framework to Predict Tumor Tissue-of-Origin Based on Copy Number Alteration
title_full A Deep Learning Framework to Predict Tumor Tissue-of-Origin Based on Copy Number Alteration
title_fullStr A Deep Learning Framework to Predict Tumor Tissue-of-Origin Based on Copy Number Alteration
title_full_unstemmed A Deep Learning Framework to Predict Tumor Tissue-of-Origin Based on Copy Number Alteration
title_short A Deep Learning Framework to Predict Tumor Tissue-of-Origin Based on Copy Number Alteration
title_sort deep learning framework to predict tumor tissue-of-origin based on copy number alteration
topic Bioengineering and Biotechnology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7419421/
https://www.ncbi.nlm.nih.gov/pubmed/32850687
http://dx.doi.org/10.3389/fbioe.2020.00701
work_keys_str_mv AT liangying adeeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT wanghaifeng adeeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT yangjialiang adeeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT lixiong adeeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT daichan adeeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT shaopeng adeeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT tiangeng adeeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT wangbo adeeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT wangyinglong adeeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT liangying deeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT wanghaifeng deeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT yangjialiang deeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT lixiong deeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT daichan deeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT shaopeng deeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT tiangeng deeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT wangbo deeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration
AT wangyinglong deeplearningframeworktopredicttumortissueoforiginbasedoncopynumberalteration