Cargando…

wenda_gpu: fast domain adaptation for genomic data

MOTIVATION: Domain adaptation allows for the development of predictive models even in cases with limited sample data. Weighted elastic net domain adaptation specifically leverages features of genomic data to maximize transferability but the method is too computationally demanding to apply to many ge...

Descripción completa

Detalles Bibliográficos
Autores principales: Hippen, Ariel A, Crawford, Jake, Gardner, Jacob R, Greene, Casey S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9665854/
https://www.ncbi.nlm.nih.gov/pubmed/36193991
http://dx.doi.org/10.1093/bioinformatics/btac663
_version_ 1784831377467768832
author Hippen, Ariel A
Crawford, Jake
Gardner, Jacob R
Greene, Casey S
author_facet Hippen, Ariel A
Crawford, Jake
Gardner, Jacob R
Greene, Casey S
author_sort Hippen, Ariel A
collection PubMed
description MOTIVATION: Domain adaptation allows for the development of predictive models even in cases with limited sample data. Weighted elastic net domain adaptation specifically leverages features of genomic data to maximize transferability but the method is too computationally demanding to apply to many genome-sized datasets. RESULTS: We developed wenda_gpu, which uses GPyTorch to train models on genomic data within hours on a single GPU-enabled machine. We show that wenda_gpu returns comparable results to the original wenda implementation, and that it can be used for improved prediction of cancer mutation status on small sample sizes than regular elastic net. AVAILABILITY AND IMPLEMENTATION: wenda_gpu is available on GitHub at https://github.com/greenelab/wenda_gpu/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9665854
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-96658542022-11-16 wenda_gpu: fast domain adaptation for genomic data Hippen, Ariel A Crawford, Jake Gardner, Jacob R Greene, Casey S Bioinformatics Applications Note MOTIVATION: Domain adaptation allows for the development of predictive models even in cases with limited sample data. Weighted elastic net domain adaptation specifically leverages features of genomic data to maximize transferability but the method is too computationally demanding to apply to many genome-sized datasets. RESULTS: We developed wenda_gpu, which uses GPyTorch to train models on genomic data within hours on a single GPU-enabled machine. We show that wenda_gpu returns comparable results to the original wenda implementation, and that it can be used for improved prediction of cancer mutation status on small sample sizes than regular elastic net. AVAILABILITY AND IMPLEMENTATION: wenda_gpu is available on GitHub at https://github.com/greenelab/wenda_gpu/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-10-04 /pmc/articles/PMC9665854/ /pubmed/36193991 http://dx.doi.org/10.1093/bioinformatics/btac663 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Hippen, Ariel A
Crawford, Jake
Gardner, Jacob R
Greene, Casey S
wenda_gpu: fast domain adaptation for genomic data
title wenda_gpu: fast domain adaptation for genomic data
title_full wenda_gpu: fast domain adaptation for genomic data
title_fullStr wenda_gpu: fast domain adaptation for genomic data
title_full_unstemmed wenda_gpu: fast domain adaptation for genomic data
title_short wenda_gpu: fast domain adaptation for genomic data
title_sort wenda_gpu: fast domain adaptation for genomic data
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9665854/
https://www.ncbi.nlm.nih.gov/pubmed/36193991
http://dx.doi.org/10.1093/bioinformatics/btac663
work_keys_str_mv AT hippenariela wendagpufastdomainadaptationforgenomicdata
AT crawfordjake wendagpufastdomainadaptationforgenomicdata
AT gardnerjacobr wendagpufastdomainadaptationforgenomicdata
AT greenecaseys wendagpufastdomainadaptationforgenomicdata