Cargando…

Widespread redundancy in -omics profiles of cancer mutation states

BACKGROUND: In studies of cellular function in cancer, researchers are increasingly able to choose from many -omics assays as functional readouts. Choosing the correct readout for a given study can be difficult, and which layer of cellular function is most suitable to capture the relevant signal rem...

Descripción completa

Detalles Bibliográficos
Autores principales:	Crawford, Jake, Christensen, Brock C., Chikina, Maria, Greene, Casey S.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9238138/ https://www.ncbi.nlm.nih.gov/pubmed/35761387 http://dx.doi.org/10.1186/s13059-022-02705-y

_version_	1784736964520443904
author	Crawford, Jake Christensen, Brock C. Chikina, Maria Greene, Casey S.
author_facet	Crawford, Jake Christensen, Brock C. Chikina, Maria Greene, Casey S.
author_sort	Crawford, Jake
collection	PubMed
description	BACKGROUND: In studies of cellular function in cancer, researchers are increasingly able to choose from many -omics assays as functional readouts. Choosing the correct readout for a given study can be difficult, and which layer of cellular function is most suitable to capture the relevant signal remains unclear. RESULTS: We consider prediction of cancer mutation status (presence or absence) from functional -omics data as a representative problem that presents an opportunity to quantify and compare the ability of different -omics readouts to capture signals of dysregulation in cancer. From the TCGA Pan-Cancer Atlas that contains genetic alteration data, we focus on RNA sequencing, DNA methylation arrays, reverse phase protein arrays (RPPA), microRNA, and somatic mutational signatures as -omics readouts. Across a collection of genes recurrently mutated in cancer, RNA sequencing tends to be the most effective predictor of mutation state. We find that one or more other data types for many of the genes are approximately equally effective predictors. Performance is more variable between mutations than that between data types for the same mutation, and there is little difference between the top data types. We also find that combining data types into a single multi-omics model provides little or no improvement in predictive ability over the best individual data type. CONCLUSIONS: Based on our results, for the design of studies focused on the functional outcomes of cancer mutations, there are often multiple -omics types that can serve as effective readouts, although gene expression seems to be a reasonable default option. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02705-y.
format	Online Article Text
id	pubmed-9238138
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-92381382022-06-29 Widespread redundancy in -omics profiles of cancer mutation states Crawford, Jake Christensen, Brock C. Chikina, Maria Greene, Casey S. Genome Biol Research BACKGROUND: In studies of cellular function in cancer, researchers are increasingly able to choose from many -omics assays as functional readouts. Choosing the correct readout for a given study can be difficult, and which layer of cellular function is most suitable to capture the relevant signal remains unclear. RESULTS: We consider prediction of cancer mutation status (presence or absence) from functional -omics data as a representative problem that presents an opportunity to quantify and compare the ability of different -omics readouts to capture signals of dysregulation in cancer. From the TCGA Pan-Cancer Atlas that contains genetic alteration data, we focus on RNA sequencing, DNA methylation arrays, reverse phase protein arrays (RPPA), microRNA, and somatic mutational signatures as -omics readouts. Across a collection of genes recurrently mutated in cancer, RNA sequencing tends to be the most effective predictor of mutation state. We find that one or more other data types for many of the genes are approximately equally effective predictors. Performance is more variable between mutations than that between data types for the same mutation, and there is little difference between the top data types. We also find that combining data types into a single multi-omics model provides little or no improvement in predictive ability over the best individual data type. CONCLUSIONS: Based on our results, for the design of studies focused on the functional outcomes of cancer mutations, there are often multiple -omics types that can serve as effective readouts, although gene expression seems to be a reasonable default option. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02705-y. BioMed Central 2022-06-27 /pmc/articles/PMC9238138/ /pubmed/35761387 http://dx.doi.org/10.1186/s13059-022-02705-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research Crawford, Jake Christensen, Brock C. Chikina, Maria Greene, Casey S. Widespread redundancy in -omics profiles of cancer mutation states
title	Widespread redundancy in -omics profiles of cancer mutation states
title_full	Widespread redundancy in -omics profiles of cancer mutation states
title_fullStr	Widespread redundancy in -omics profiles of cancer mutation states
title_full_unstemmed	Widespread redundancy in -omics profiles of cancer mutation states
title_short	Widespread redundancy in -omics profiles of cancer mutation states
title_sort	widespread redundancy in -omics profiles of cancer mutation states
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9238138/ https://www.ncbi.nlm.nih.gov/pubmed/35761387 http://dx.doi.org/10.1186/s13059-022-02705-y
work_keys_str_mv	AT crawfordjake widespreadredundancyinomicsprofilesofcancermutationstates AT christensenbrockc widespreadredundancyinomicsprofilesofcancermutationstates AT chikinamaria widespreadredundancyinomicsprofilesofcancermutationstates AT greenecaseys widespreadredundancyinomicsprofilesofcancermutationstates

Widespread redundancy in -omics profiles of cancer mutation states

Ejemplares similares