Cargando…

Toward computing attributions for dimensionality reduction techniques

SUMMARY: We describe the problem of computing local feature attributions for dimensionality reduction methods. We use one such method that is well established within the context of supervised classification—using the gradients of target outputs with respect to the inputs—on the popular dimensionalit...

Descripción completa

Detalles Bibliográficos
Autores principales: Scicluna, Matthew, Grenier, Jean-Christophe, Poujol, Raphaël, Lemieux, Sébastien, Hussin, Julie G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10502234/
https://www.ncbi.nlm.nih.gov/pubmed/37720006
http://dx.doi.org/10.1093/bioadv/vbad097
_version_ 1785106276441653248
author Scicluna, Matthew
Grenier, Jean-Christophe
Poujol, Raphaël
Lemieux, Sébastien
Hussin, Julie G
author_facet Scicluna, Matthew
Grenier, Jean-Christophe
Poujol, Raphaël
Lemieux, Sébastien
Hussin, Julie G
author_sort Scicluna, Matthew
collection PubMed
description SUMMARY: We describe the problem of computing local feature attributions for dimensionality reduction methods. We use one such method that is well established within the context of supervised classification—using the gradients of target outputs with respect to the inputs—on the popular dimensionality reduction technique t-SNE, widely used in analyses of biological data. We provide an efficient implementation for the gradient computation for this dimensionality reduction technique. We show that our explanations identify significant features using novel validation methodology; using synthetic datasets and the popular MNIST benchmark dataset. We then demonstrate the practical utility of our algorithm by showing that it can produce explanations that agree with domain knowledge on a SARS-CoV-2 sequence dataset. Throughout, we provide a road map so that similar explanation methods could be applied to other dimensionality reduction techniques to rigorously analyze biological datasets. AVAILABILITY AND IMPLEMENTATION: We have created a Python package that can be installed using the following command: pip install interpretable_tsne. All code used can be found at github.com/MattScicluna/interpretable_tsne.
format Online
Article
Text
id pubmed-10502234
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105022342023-09-16 Toward computing attributions for dimensionality reduction techniques Scicluna, Matthew Grenier, Jean-Christophe Poujol, Raphaël Lemieux, Sébastien Hussin, Julie G Bioinform Adv Original Article SUMMARY: We describe the problem of computing local feature attributions for dimensionality reduction methods. We use one such method that is well established within the context of supervised classification—using the gradients of target outputs with respect to the inputs—on the popular dimensionality reduction technique t-SNE, widely used in analyses of biological data. We provide an efficient implementation for the gradient computation for this dimensionality reduction technique. We show that our explanations identify significant features using novel validation methodology; using synthetic datasets and the popular MNIST benchmark dataset. We then demonstrate the practical utility of our algorithm by showing that it can produce explanations that agree with domain knowledge on a SARS-CoV-2 sequence dataset. Throughout, we provide a road map so that similar explanation methods could be applied to other dimensionality reduction techniques to rigorously analyze biological datasets. AVAILABILITY AND IMPLEMENTATION: We have created a Python package that can be installed using the following command: pip install interpretable_tsne. All code used can be found at github.com/MattScicluna/interpretable_tsne. Oxford University Press 2023-08-03 /pmc/articles/PMC10502234/ /pubmed/37720006 http://dx.doi.org/10.1093/bioadv/vbad097 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Scicluna, Matthew
Grenier, Jean-Christophe
Poujol, Raphaël
Lemieux, Sébastien
Hussin, Julie G
Toward computing attributions for dimensionality reduction techniques
title Toward computing attributions for dimensionality reduction techniques
title_full Toward computing attributions for dimensionality reduction techniques
title_fullStr Toward computing attributions for dimensionality reduction techniques
title_full_unstemmed Toward computing attributions for dimensionality reduction techniques
title_short Toward computing attributions for dimensionality reduction techniques
title_sort toward computing attributions for dimensionality reduction techniques
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10502234/
https://www.ncbi.nlm.nih.gov/pubmed/37720006
http://dx.doi.org/10.1093/bioadv/vbad097
work_keys_str_mv AT sciclunamatthew towardcomputingattributionsfordimensionalityreductiontechniques
AT grenierjeanchristophe towardcomputingattributionsfordimensionalityreductiontechniques
AT poujolraphael towardcomputingattributionsfordimensionalityreductiontechniques
AT lemieuxsebastien towardcomputingattributionsfordimensionalityreductiontechniques
AT hussinjulieg towardcomputingattributionsfordimensionalityreductiontechniques