Cargando…
Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data
A cancer of unknown primary (CUP) is a metastatic cancer for which standard diagnostic tests fail to locate the primary cancer. As standard treatments are based on the cancer type, such cases are hard to treat and have very poor prognosis. Using molecular data from the metastatic cancer to predict t...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6719477/ https://www.ncbi.nlm.nih.gov/pubmed/31516310 http://dx.doi.org/10.1177/1176935119872163 |
_version_ | 1783447943491092480 |
---|---|
author | Bavafaye Haghighi, Elham Knudsen, Michael Elmedal Laursen, Britt Besenbacher, Søren |
author_facet | Bavafaye Haghighi, Elham Knudsen, Michael Elmedal Laursen, Britt Besenbacher, Søren |
author_sort | Bavafaye Haghighi, Elham |
collection | PubMed |
description | A cancer of unknown primary (CUP) is a metastatic cancer for which standard diagnostic tests fail to locate the primary cancer. As standard treatments are based on the cancer type, such cases are hard to treat and have very poor prognosis. Using molecular data from the metastatic cancer to predict the primary site can make treatment choice easier and enable targeted therapy. In this article, we first examine the ability to predict cancer type using different types of omics data. Methylation data lead to slightly better prediction than gene expression and both these are superior to classification using somatic mutations. After using 3 data types independently, we notice some differences between the classes that tend to be misclassified, suggesting that integrating the data might improve accuracy. In light of the different levels of information provided by different omics types and to be able to handle missing data, we perform multi-omics classification by hierarchically combining the classifiers. The proposed hierarchical method first classifies based on the most informative type of omics data and then uses the other types of omics data to classify samples that did not get a high confidence classification in the first step. The resulting hierarchical classifier has higher accuracy than any of the single omics classifiers and thus proves that the combination of different data types is beneficial. Our results show that using multi-omics data can improve the classification of cancer types. We confirm this by testing our method on metastatic cancers from the MET500 dataset. |
format | Online Article Text |
id | pubmed-6719477 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-67194772019-09-12 Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data Bavafaye Haghighi, Elham Knudsen, Michael Elmedal Laursen, Britt Besenbacher, Søren Cancer Inform Original Research A cancer of unknown primary (CUP) is a metastatic cancer for which standard diagnostic tests fail to locate the primary cancer. As standard treatments are based on the cancer type, such cases are hard to treat and have very poor prognosis. Using molecular data from the metastatic cancer to predict the primary site can make treatment choice easier and enable targeted therapy. In this article, we first examine the ability to predict cancer type using different types of omics data. Methylation data lead to slightly better prediction than gene expression and both these are superior to classification using somatic mutations. After using 3 data types independently, we notice some differences between the classes that tend to be misclassified, suggesting that integrating the data might improve accuracy. In light of the different levels of information provided by different omics types and to be able to handle missing data, we perform multi-omics classification by hierarchically combining the classifiers. The proposed hierarchical method first classifies based on the most informative type of omics data and then uses the other types of omics data to classify samples that did not get a high confidence classification in the first step. The resulting hierarchical classifier has higher accuracy than any of the single omics classifiers and thus proves that the combination of different data types is beneficial. Our results show that using multi-omics data can improve the classification of cancer types. We confirm this by testing our method on metastatic cancers from the MET500 dataset. SAGE Publications 2019-08-30 /pmc/articles/PMC6719477/ /pubmed/31516310 http://dx.doi.org/10.1177/1176935119872163 Text en © The Author(s) 2019 http://www.creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Bavafaye Haghighi, Elham Knudsen, Michael Elmedal Laursen, Britt Besenbacher, Søren Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data |
title | Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data |
title_full | Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data |
title_fullStr | Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data |
title_full_unstemmed | Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data |
title_short | Hierarchical Classification of Cancers of Unknown Primary Using Multi-Omics Data |
title_sort | hierarchical classification of cancers of unknown primary using multi-omics data |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6719477/ https://www.ncbi.nlm.nih.gov/pubmed/31516310 http://dx.doi.org/10.1177/1176935119872163 |
work_keys_str_mv | AT bavafayehaghighielham hierarchicalclassificationofcancersofunknownprimaryusingmultiomicsdata AT knudsenmichael hierarchicalclassificationofcancersofunknownprimaryusingmultiomicsdata AT elmedallaursenbritt hierarchicalclassificationofcancersofunknownprimaryusingmultiomicsdata AT besenbachersøren hierarchicalclassificationofcancersofunknownprimaryusingmultiomicsdata |