Cargando…

GIFT: Guided and Interpretable Factorization for Tensors with an application to large-scale multi-platform cancer analysis

MOTIVATION: Given multi-platform genome data with prior knowledge of functional gene sets, how can we extract interpretable latent relationships between patients and genes? More specifically, how can we devise a tensor factorization method which produces an interpretable gene factor matrix based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Jungwoo, Oh, Sejoon, Sael, Lee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6289137/
https://www.ncbi.nlm.nih.gov/pubmed/29931238
http://dx.doi.org/10.1093/bioinformatics/bty490
_version_ 1783379929541378048
author Lee, Jungwoo
Oh, Sejoon
Sael, Lee
author_facet Lee, Jungwoo
Oh, Sejoon
Sael, Lee
author_sort Lee, Jungwoo
collection PubMed
description MOTIVATION: Given multi-platform genome data with prior knowledge of functional gene sets, how can we extract interpretable latent relationships between patients and genes? More specifically, how can we devise a tensor factorization method which produces an interpretable gene factor matrix based on functional gene set information while maintaining the decomposition quality and speed? RESULTS: We propose GIFT, a Guided and Interpretable Factorization for Tensors. GIFT provides interpretable factor matrices by encoding prior knowledge as a regularization term in its objective function. We apply GIFT to the PanCan12 dataset (TCGA multi-platform genome data) and compare the performance with P-Tucker, our baseline method without prior knowledge constraint, and Silenced-TF, our naive interpretable method. Results show that GIFT produces interpretable factorizations with high scalability and accuracy. Furthermore, we demonstrate how results of GIFT can be used to reveal significant relations between (cancer, gene sets, genes) and validate the findings based on literature evidence. AVAILABILITY AND IMPLEMENTATION: The code and datasets used in the paper are available at https://github.com/leesael/GIFT. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6289137
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-62891372018-12-14 GIFT: Guided and Interpretable Factorization for Tensors with an application to large-scale multi-platform cancer analysis Lee, Jungwoo Oh, Sejoon Sael, Lee Bioinformatics Original Papers MOTIVATION: Given multi-platform genome data with prior knowledge of functional gene sets, how can we extract interpretable latent relationships between patients and genes? More specifically, how can we devise a tensor factorization method which produces an interpretable gene factor matrix based on functional gene set information while maintaining the decomposition quality and speed? RESULTS: We propose GIFT, a Guided and Interpretable Factorization for Tensors. GIFT provides interpretable factor matrices by encoding prior knowledge as a regularization term in its objective function. We apply GIFT to the PanCan12 dataset (TCGA multi-platform genome data) and compare the performance with P-Tucker, our baseline method without prior knowledge constraint, and Silenced-TF, our naive interpretable method. Results show that GIFT produces interpretable factorizations with high scalability and accuracy. Furthermore, we demonstrate how results of GIFT can be used to reveal significant relations between (cancer, gene sets, genes) and validate the findings based on literature evidence. AVAILABILITY AND IMPLEMENTATION: The code and datasets used in the paper are available at https://github.com/leesael/GIFT. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-12-15 2018-06-21 /pmc/articles/PMC6289137/ /pubmed/29931238 http://dx.doi.org/10.1093/bioinformatics/bty490 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Lee, Jungwoo
Oh, Sejoon
Sael, Lee
GIFT: Guided and Interpretable Factorization for Tensors with an application to large-scale multi-platform cancer analysis
title GIFT: Guided and Interpretable Factorization for Tensors with an application to large-scale multi-platform cancer analysis
title_full GIFT: Guided and Interpretable Factorization for Tensors with an application to large-scale multi-platform cancer analysis
title_fullStr GIFT: Guided and Interpretable Factorization for Tensors with an application to large-scale multi-platform cancer analysis
title_full_unstemmed GIFT: Guided and Interpretable Factorization for Tensors with an application to large-scale multi-platform cancer analysis
title_short GIFT: Guided and Interpretable Factorization for Tensors with an application to large-scale multi-platform cancer analysis
title_sort gift: guided and interpretable factorization for tensors with an application to large-scale multi-platform cancer analysis
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6289137/
https://www.ncbi.nlm.nih.gov/pubmed/29931238
http://dx.doi.org/10.1093/bioinformatics/bty490
work_keys_str_mv AT leejungwoo giftguidedandinterpretablefactorizationfortensorswithanapplicationtolargescalemultiplatformcanceranalysis
AT ohsejoon giftguidedandinterpretablefactorizationfortensorswithanapplicationtolargescalemultiplatformcanceranalysis
AT saellee giftguidedandinterpretablefactorizationfortensorswithanapplicationtolargescalemultiplatformcanceranalysis