Cargando…

Uncovering Effective Explanations for Interactive Genomic Data Analysis

Better tools are needed to enable researchers to quickly identify and explore effective and interpretable feature-based explanations for discriminating multi-class genomic datasets, e.g., healthy versus diseased samples. We develop an interactive exploration tool, GENVISAGE, which rapidly discovers...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Silu, Blatti, Charles, Sinha, Saurabh, Parameswaran, Aditya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660438/
https://www.ncbi.nlm.nih.gov/pubmed/33205133
http://dx.doi.org/10.1016/j.patter.2020.100093
_version_ 1783609003339677696
author Huang, Silu
Blatti, Charles
Sinha, Saurabh
Parameswaran, Aditya
author_facet Huang, Silu
Blatti, Charles
Sinha, Saurabh
Parameswaran, Aditya
author_sort Huang, Silu
collection PubMed
description Better tools are needed to enable researchers to quickly identify and explore effective and interpretable feature-based explanations for discriminating multi-class genomic datasets, e.g., healthy versus diseased samples. We develop an interactive exploration tool, GENVISAGE, which rapidly discovers the most discriminative feature pairs that separate two classes of genomic objects and then displays the corresponding visualizations. Since quickly finding top feature pairs is computationally challenging, especially for large numbers of objects and features, we propose a suite of optimizations to make GENVISAGE responsive at scale and demonstrate that our optimizations lead to a 400× speedup over competitive baselines for multiple biological datasets. We apply our rapid and interpretable tool to identify literature-supported pairs of genes whose transcriptomic responses significantly discriminate several chemotherapy drug treatments. With its generalizable optimizations and framework, GENVISAGE opens up real-time feature-based explanation generation to data from massive sequencing efforts, as well as many other scientific domains.
format Online
Article
Text
id pubmed-7660438
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-76604382020-11-16 Uncovering Effective Explanations for Interactive Genomic Data Analysis Huang, Silu Blatti, Charles Sinha, Saurabh Parameswaran, Aditya Patterns (N Y) Article Better tools are needed to enable researchers to quickly identify and explore effective and interpretable feature-based explanations for discriminating multi-class genomic datasets, e.g., healthy versus diseased samples. We develop an interactive exploration tool, GENVISAGE, which rapidly discovers the most discriminative feature pairs that separate two classes of genomic objects and then displays the corresponding visualizations. Since quickly finding top feature pairs is computationally challenging, especially for large numbers of objects and features, we propose a suite of optimizations to make GENVISAGE responsive at scale and demonstrate that our optimizations lead to a 400× speedup over competitive baselines for multiple biological datasets. We apply our rapid and interpretable tool to identify literature-supported pairs of genes whose transcriptomic responses significantly discriminate several chemotherapy drug treatments. With its generalizable optimizations and framework, GENVISAGE opens up real-time feature-based explanation generation to data from massive sequencing efforts, as well as many other scientific domains. Elsevier 2020-09-11 /pmc/articles/PMC7660438/ /pubmed/33205133 http://dx.doi.org/10.1016/j.patter.2020.100093 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Huang, Silu
Blatti, Charles
Sinha, Saurabh
Parameswaran, Aditya
Uncovering Effective Explanations for Interactive Genomic Data Analysis
title Uncovering Effective Explanations for Interactive Genomic Data Analysis
title_full Uncovering Effective Explanations for Interactive Genomic Data Analysis
title_fullStr Uncovering Effective Explanations for Interactive Genomic Data Analysis
title_full_unstemmed Uncovering Effective Explanations for Interactive Genomic Data Analysis
title_short Uncovering Effective Explanations for Interactive Genomic Data Analysis
title_sort uncovering effective explanations for interactive genomic data analysis
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7660438/
https://www.ncbi.nlm.nih.gov/pubmed/33205133
http://dx.doi.org/10.1016/j.patter.2020.100093
work_keys_str_mv AT huangsilu uncoveringeffectiveexplanationsforinteractivegenomicdataanalysis
AT blatticharles uncoveringeffectiveexplanationsforinteractivegenomicdataanalysis
AT sinhasaurabh uncoveringeffectiveexplanationsforinteractivegenomicdataanalysis
AT parameswaranaditya uncoveringeffectiveexplanationsforinteractivegenomicdataanalysis