Cargando…

Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data

A cancer of unknown primary (CUP) is a metastatic cancer for which standard diagnostic tests fail to locate the primary cancer. As standard treatments are based on the cancer type, such cases are hard to treat and have very poor prognosis. Using molecular data from the metastatic cancer to predict t...

Descripción completa

Detalles Bibliográficos
Autores principales: Bavafaye Haghighi, Elham, Knudsen, Michael, Elmedal Laursen, Britt, Besenbacher, Søren
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6719477/
https://www.ncbi.nlm.nih.gov/pubmed/31516310
http://dx.doi.org/10.1177/1176935119872163
_version_ 1783447943491092480
author Bavafaye Haghighi, Elham
Knudsen, Michael
Elmedal Laursen, Britt
Besenbacher, Søren
author_facet Bavafaye Haghighi, Elham
Knudsen, Michael
Elmedal Laursen, Britt
Besenbacher, Søren
author_sort Bavafaye Haghighi, Elham
collection PubMed
description A cancer of unknown primary (CUP) is a metastatic cancer for which standard diagnostic tests fail to locate the primary cancer. As standard treatments are based on the cancer type, such cases are hard to treat and have very poor prognosis. Using molecular data from the metastatic cancer to predict the primary site can make treatment choice easier and enable targeted therapy. In this article, we first examine the ability to predict cancer type using different types of omics data. Methylation data lead to slightly better prediction than gene expression and both these are superior to classification using somatic mutations. After using 3 data types independently, we notice some differences between the classes that tend to be misclassified, suggesting that integrating the data might improve accuracy. In light of the different levels of information provided by different omics types and to be able to handle missing data, we perform multi-omics classification by hierarchically combining the classifiers. The proposed hierarchical method first classifies based on the most informative type of omics data and then uses the other types of omics data to classify samples that did not get a high confidence classification in the first step. The resulting hierarchical classifier has higher accuracy than any of the single omics classifiers and thus proves that the combination of different data types is beneficial. Our results show that using multi-omics data can improve the classification of cancer types. We confirm this by testing our method on metastatic cancers from the MET500 dataset.
format Online
Article
Text
id pubmed-6719477
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-67194772019-09-12 Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data Bavafaye Haghighi, Elham Knudsen, Michael Elmedal Laursen, Britt Besenbacher, Søren Cancer Inform Original Research A cancer of unknown primary (CUP) is a metastatic cancer for which standard diagnostic tests fail to locate the primary cancer. As standard treatments are based on the cancer type, such cases are hard to treat and have very poor prognosis. Using molecular data from the metastatic cancer to predict the primary site can make treatment choice easier and enable targeted therapy. In this article, we first examine the ability to predict cancer type using different types of omics data. Methylation data lead to slightly better prediction than gene expression and both these are superior to classification using somatic mutations. After using 3 data types independently, we notice some differences between the classes that tend to be misclassified, suggesting that integrating the data might improve accuracy. In light of the different levels of information provided by different omics types and to be able to handle missing data, we perform multi-omics classification by hierarchically combining the classifiers. The proposed hierarchical method first classifies based on the most informative type of omics data and then uses the other types of omics data to classify samples that did not get a high confidence classification in the first step. The resulting hierarchical classifier has higher accuracy than any of the single omics classifiers and thus proves that the combination of different data types is beneficial. Our results show that using multi-omics data can improve the classification of cancer types. We confirm this by testing our method on metastatic cancers from the MET500 dataset. SAGE Publications 2019-08-30 /pmc/articles/PMC6719477/ /pubmed/31516310 http://dx.doi.org/10.1177/1176935119872163 Text en © The Author(s) 2019 http://www.creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Bavafaye Haghighi, Elham
Knudsen, Michael
Elmedal Laursen, Britt
Besenbacher, Søren
Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data
title Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data
title_full Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data
title_fullStr Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data
title_full_unstemmed Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data
title_short Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data
title_sort hierarchical classification of cancers of unknown primary using multi-omics data
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6719477/
https://www.ncbi.nlm.nih.gov/pubmed/31516310
http://dx.doi.org/10.1177/1176935119872163
work_keys_str_mv AT bavafayehaghighielham hierarchicalclassificationofcancersofunknownprimaryusingmultiomicsdata
AT knudsenmichael hierarchicalclassificationofcancersofunknownprimaryusingmultiomicsdata
AT elmedallaursenbritt hierarchicalclassificationofcancersofunknownprimaryusingmultiomicsdata
AT besenbachersøren hierarchicalclassificationofcancersofunknownprimaryusingmultiomicsdata