Cargando…

CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence

BACKGROUND: Cancer of unknown primary (CUP), representing approximately 3-5% of all malignancies, is defined as metastatic cancer where a primary site of origin cannot be found despite a standard diagnostic workup. Because knowledge of a patient's primary cancer remains fundamental to their tre...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yue, Pan, Ziwei, Namburi, Sandeep, Pattison, Andrew, Posner, Atara, Balachander, Shiva, Paisie, Carolyn A., Reddi, Honey V, Rueter, Jens, Gill, Anthony J, Fox, Stephen, Raghav, Kanwal P.S., Flynn, William F, Tothill, Richard W., Li, Sheng, Karuturi, R. Krishna Murthy, George, Joshy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7553237/
https://www.ncbi.nlm.nih.gov/pubmed/33039710
http://dx.doi.org/10.1016/j.ebiom.2020.103030
_version_ 1783593558339485696
author Zhao, Yue
Pan, Ziwei
Namburi, Sandeep
Pattison, Andrew
Posner, Atara
Balachander, Shiva
Paisie, Carolyn A.
Reddi, Honey V
Rueter, Jens
Gill, Anthony J
Fox, Stephen
Raghav, Kanwal P.S.
Flynn, William F
Tothill, Richard W.
Li, Sheng
Karuturi, R. Krishna Murthy
George, Joshy
author_facet Zhao, Yue
Pan, Ziwei
Namburi, Sandeep
Pattison, Andrew
Posner, Atara
Balachander, Shiva
Paisie, Carolyn A.
Reddi, Honey V
Rueter, Jens
Gill, Anthony J
Fox, Stephen
Raghav, Kanwal P.S.
Flynn, William F
Tothill, Richard W.
Li, Sheng
Karuturi, R. Krishna Murthy
George, Joshy
author_sort Zhao, Yue
collection PubMed
description BACKGROUND: Cancer of unknown primary (CUP), representing approximately 3-5% of all malignancies, is defined as metastatic cancer where a primary site of origin cannot be found despite a standard diagnostic workup. Because knowledge of a patient's primary cancer remains fundamental to their treatment, CUP patients are significantly disadvantaged and most have a poor survival outcome. Developing robust and accessible diagnostic methods for resolving cancer tissue of origin, therefore, has significant value for CUP patients. METHODS: We developed an RNA-based classifier called CUP-AI-Dx that utilizes a 1D Inception convolutional neural network (1D-Inception) model to infer a tumor's primary tissue of origin. CUP-AI-Dx was trained using the transcriptional profiles of 18,217 primary tumours representing 32 cancer types from The Cancer Genome Atlas project (TCGA) and International Cancer Genome Consortium (ICGC). Gene expression data was ordered by gene chromosomal coordinates as input to the 1D-CNN model, and the model utilizes multiple convolutional kernels with different configurations simultaneously to improve generality. The model was optimized through extensive hyperparameter tuning, including different max-pooling layers and dropout settings. For 11 tumour types, we also developed a random forest model that can classify the tumour's molecular subtype according to prior TCGA studies. The optimised CUP-AI-Dx tissue of origin classifier was tested on 394 metastatic samples from 11 tumour types from TCGA and 92 formalin-fixed paraffin-embedded (FFPE) samples representing 18 cancer types from two clinical laboratories. The CUP-AI-Dx molecular subtype was also independently tested on independent ovarian and breast cancer microarray datasets FINDINGS: CUP-AI-Dx identifies the primary site with an overall top-1-accuracy of 98.54% in cross-validation and 96.70% on a test dataset. When applied to two independent clinical-grade RNA-seq datasets generated from two different institutes from the US and Australia, our model predicted the primary site with a top-1-accuracy of 86.96% and 72.46% respectively. INTERPRETATION: The CUP-AI-Dx predicts tumour primary site and molecular subtype with high accuracy and therefore can be used to assist the diagnostic work-up of cancers of unknown primary or uncertain origin using a common and accessible genomics platform. FUNDING: NIH R35 GM133562, NCI P30 CA034196, Victorian Cancer Agency Australia.
format Online
Article
Text
id pubmed-7553237
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-75532372020-10-19 CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence Zhao, Yue Pan, Ziwei Namburi, Sandeep Pattison, Andrew Posner, Atara Balachander, Shiva Paisie, Carolyn A. Reddi, Honey V Rueter, Jens Gill, Anthony J Fox, Stephen Raghav, Kanwal P.S. Flynn, William F Tothill, Richard W. Li, Sheng Karuturi, R. Krishna Murthy George, Joshy EBioMedicine Research Paper BACKGROUND: Cancer of unknown primary (CUP), representing approximately 3-5% of all malignancies, is defined as metastatic cancer where a primary site of origin cannot be found despite a standard diagnostic workup. Because knowledge of a patient's primary cancer remains fundamental to their treatment, CUP patients are significantly disadvantaged and most have a poor survival outcome. Developing robust and accessible diagnostic methods for resolving cancer tissue of origin, therefore, has significant value for CUP patients. METHODS: We developed an RNA-based classifier called CUP-AI-Dx that utilizes a 1D Inception convolutional neural network (1D-Inception) model to infer a tumor's primary tissue of origin. CUP-AI-Dx was trained using the transcriptional profiles of 18,217 primary tumours representing 32 cancer types from The Cancer Genome Atlas project (TCGA) and International Cancer Genome Consortium (ICGC). Gene expression data was ordered by gene chromosomal coordinates as input to the 1D-CNN model, and the model utilizes multiple convolutional kernels with different configurations simultaneously to improve generality. The model was optimized through extensive hyperparameter tuning, including different max-pooling layers and dropout settings. For 11 tumour types, we also developed a random forest model that can classify the tumour's molecular subtype according to prior TCGA studies. The optimised CUP-AI-Dx tissue of origin classifier was tested on 394 metastatic samples from 11 tumour types from TCGA and 92 formalin-fixed paraffin-embedded (FFPE) samples representing 18 cancer types from two clinical laboratories. The CUP-AI-Dx molecular subtype was also independently tested on independent ovarian and breast cancer microarray datasets FINDINGS: CUP-AI-Dx identifies the primary site with an overall top-1-accuracy of 98.54% in cross-validation and 96.70% on a test dataset. When applied to two independent clinical-grade RNA-seq datasets generated from two different institutes from the US and Australia, our model predicted the primary site with a top-1-accuracy of 86.96% and 72.46% respectively. INTERPRETATION: The CUP-AI-Dx predicts tumour primary site and molecular subtype with high accuracy and therefore can be used to assist the diagnostic work-up of cancers of unknown primary or uncertain origin using a common and accessible genomics platform. FUNDING: NIH R35 GM133562, NCI P30 CA034196, Victorian Cancer Agency Australia. Elsevier 2020-10-09 /pmc/articles/PMC7553237/ /pubmed/33039710 http://dx.doi.org/10.1016/j.ebiom.2020.103030 Text en © 2020 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Paper
Zhao, Yue
Pan, Ziwei
Namburi, Sandeep
Pattison, Andrew
Posner, Atara
Balachander, Shiva
Paisie, Carolyn A.
Reddi, Honey V
Rueter, Jens
Gill, Anthony J
Fox, Stephen
Raghav, Kanwal P.S.
Flynn, William F
Tothill, Richard W.
Li, Sheng
Karuturi, R. Krishna Murthy
George, Joshy
CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence
title CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence
title_full CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence
title_fullStr CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence
title_full_unstemmed CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence
title_short CUP-AI-Dx: A tool for inferring cancer tissue of origin and molecular subtype using RNA gene-expression data and artificial intelligence
title_sort cup-ai-dx: a tool for inferring cancer tissue of origin and molecular subtype using rna gene-expression data and artificial intelligence
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7553237/
https://www.ncbi.nlm.nih.gov/pubmed/33039710
http://dx.doi.org/10.1016/j.ebiom.2020.103030
work_keys_str_mv AT zhaoyue cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT panziwei cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT namburisandeep cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT pattisonandrew cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT posneratara cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT balachandershiva cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT paisiecarolyna cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT reddihoneyv cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT rueterjens cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT gillanthonyj cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT foxstephen cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT raghavkanwalps cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT flynnwilliamf cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT tothillrichardw cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT lisheng cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT karuturirkrishnamurthy cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence
AT georgejoshy cupaidxatoolforinferringcancertissueoforiginandmolecularsubtypeusingrnageneexpressiondataandartificialintelligence