Cargando…

Exploring the Limits of Combined Image/'omics Analysis for Non-cancer Histological Phenotypes

The last several years have witnessed an explosion of methods and applications for combining image data with 'omics data, and for prediction of clinical phenotypes. Much of this research has focused on cancer histology, for which genetic perturbations are large, and the signal to noise ratio is...

Descripción completa

Detalles Bibliográficos
Autores principales: Gallins, Paul, Saghapour, Ehsan, Zhou, Yi-Hui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644963/
https://www.ncbi.nlm.nih.gov/pubmed/33193632
http://dx.doi.org/10.3389/fgene.2020.555886
_version_ 1783606564593074176
author Gallins, Paul
Saghapour, Ehsan
Zhou, Yi-Hui
author_facet Gallins, Paul
Saghapour, Ehsan
Zhou, Yi-Hui
author_sort Gallins, Paul
collection PubMed
description The last several years have witnessed an explosion of methods and applications for combining image data with 'omics data, and for prediction of clinical phenotypes. Much of this research has focused on cancer histology, for which genetic perturbations are large, and the signal to noise ratio is high. Related research on chronic, complex diseases is limited by tissue sample availability, lower genomic signal strength, and the less extreme and tissue-specific nature of intermediate histological phenotypes. Data from the GTEx Consortium provides a unique opportunity to investigate the connections among phenotypic histological variation, imaging data, and 'omics profiling, from multiple tissue-specific phenotypes at the sub-clinical level. Investigating histological designations in multiple tissues, we survey the evidence for genomic association and prediction of histology, and use the results to test the limits of prediction accuracy using machine learning methods applied to the imaging data, genomics data, and their combination. We find that expression data has similar or superior accuracy for pathology prediction as our use of imaging data, despite the fact that pathological determination is made from the images themselves. A variety of machine learning methods have similar performance, while network embedding methods offer at best limited improvements. These observations hold across a range of tissues and predictor types. The results are supportive of the use of genomic measurements for prediction, and in using the same target tissue in which pathological phenotyping has been performed. Although this last finding is sensible, to our knowledge our study is the first to demonstrate this fact empirically. Even while prediction accuracy remains a challenge, the results show clear evidence of pathway and tissue-specific biology.
format Online
Article
Text
id pubmed-7644963
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-76449632020-11-13 Exploring the Limits of Combined Image/'omics Analysis for Non-cancer Histological Phenotypes Gallins, Paul Saghapour, Ehsan Zhou, Yi-Hui Front Genet Genetics The last several years have witnessed an explosion of methods and applications for combining image data with 'omics data, and for prediction of clinical phenotypes. Much of this research has focused on cancer histology, for which genetic perturbations are large, and the signal to noise ratio is high. Related research on chronic, complex diseases is limited by tissue sample availability, lower genomic signal strength, and the less extreme and tissue-specific nature of intermediate histological phenotypes. Data from the GTEx Consortium provides a unique opportunity to investigate the connections among phenotypic histological variation, imaging data, and 'omics profiling, from multiple tissue-specific phenotypes at the sub-clinical level. Investigating histological designations in multiple tissues, we survey the evidence for genomic association and prediction of histology, and use the results to test the limits of prediction accuracy using machine learning methods applied to the imaging data, genomics data, and their combination. We find that expression data has similar or superior accuracy for pathology prediction as our use of imaging data, despite the fact that pathological determination is made from the images themselves. A variety of machine learning methods have similar performance, while network embedding methods offer at best limited improvements. These observations hold across a range of tissues and predictor types. The results are supportive of the use of genomic measurements for prediction, and in using the same target tissue in which pathological phenotyping has been performed. Although this last finding is sensible, to our knowledge our study is the first to demonstrate this fact empirically. Even while prediction accuracy remains a challenge, the results show clear evidence of pathway and tissue-specific biology. Frontiers Media S.A. 2020-10-23 /pmc/articles/PMC7644963/ /pubmed/33193632 http://dx.doi.org/10.3389/fgene.2020.555886 Text en Copyright © 2020 Gallins, Saghapour and Zhou. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Gallins, Paul
Saghapour, Ehsan
Zhou, Yi-Hui
Exploring the Limits of Combined Image/'omics Analysis for Non-cancer Histological Phenotypes
title Exploring the Limits of Combined Image/'omics Analysis for Non-cancer Histological Phenotypes
title_full Exploring the Limits of Combined Image/'omics Analysis for Non-cancer Histological Phenotypes
title_fullStr Exploring the Limits of Combined Image/'omics Analysis for Non-cancer Histological Phenotypes
title_full_unstemmed Exploring the Limits of Combined Image/'omics Analysis for Non-cancer Histological Phenotypes
title_short Exploring the Limits of Combined Image/'omics Analysis for Non-cancer Histological Phenotypes
title_sort exploring the limits of combined image/'omics analysis for non-cancer histological phenotypes
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7644963/
https://www.ncbi.nlm.nih.gov/pubmed/33193632
http://dx.doi.org/10.3389/fgene.2020.555886
work_keys_str_mv AT gallinspaul exploringthelimitsofcombinedimageomicsanalysisfornoncancerhistologicalphenotypes
AT saghapourehsan exploringthelimitsofcombinedimageomicsanalysisfornoncancerhistologicalphenotypes
AT zhouyihui exploringthelimitsofcombinedimageomicsanalysisfornoncancerhistologicalphenotypes