Cargando…

methCancer-gen: a DNA methylome dataset generator for user-specified cancer type based on conditional variational autoencoder

BACKGROUND: Recently, DNA methylation has drawn great attention due to its strong correlation with abnormal gene activities and informative representation of the cancer status. As a number of studies focus on DNA methylation signatures in cancer, demand for utilizing publicly available methylome dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Joungmin, Chae, Heejoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7216580/
https://www.ncbi.nlm.nih.gov/pubmed/32393170
http://dx.doi.org/10.1186/s12859-020-3516-8
_version_ 1783532439383048192
author Choi, Joungmin
Chae, Heejoon
author_facet Choi, Joungmin
Chae, Heejoon
author_sort Choi, Joungmin
collection PubMed
description BACKGROUND: Recently, DNA methylation has drawn great attention due to its strong correlation with abnormal gene activities and informative representation of the cancer status. As a number of studies focus on DNA methylation signatures in cancer, demand for utilizing publicly available methylome dataset has been increased. To satisfy this, large-scale projects were launched to discover biological insights into cancer, providing a collection of the dataset. However, public cancer data, especially for certain cancer types, is still limited to be used in research. Several simulation tools for producing epigenetic dataset have been introduced in order to alleviate the issue, still, to date, generation for user-specified cancer type dataset has not been proposed. RESULTS: In this paper, we present methCancer-gen, a tool for generating DNA methylome dataset considering type for cancer. Employing conditional variational autoencoder, a neural network-based generative model, it estimates the conditional distribution with latent variables and data, and generates samples for specified cancer type. CONCLUSIONS: To evaluate the simulation performance of methCancer-gen for the user-specified cancer type, our proposed model was compared to a benchmark method and it could successfully reproduce cancer type-wise data with high accuracy helping to alleviate the lack of condition-specific data issue. methCancer-gen is publicly available at https://github.com/cbi-bioinfo/methCancer-gen.
format Online
Article
Text
id pubmed-7216580
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-72165802020-05-18 methCancer-gen: a DNA methylome dataset generator for user-specified cancer type based on conditional variational autoencoder Choi, Joungmin Chae, Heejoon BMC Bioinformatics Research Article BACKGROUND: Recently, DNA methylation has drawn great attention due to its strong correlation with abnormal gene activities and informative representation of the cancer status. As a number of studies focus on DNA methylation signatures in cancer, demand for utilizing publicly available methylome dataset has been increased. To satisfy this, large-scale projects were launched to discover biological insights into cancer, providing a collection of the dataset. However, public cancer data, especially for certain cancer types, is still limited to be used in research. Several simulation tools for producing epigenetic dataset have been introduced in order to alleviate the issue, still, to date, generation for user-specified cancer type dataset has not been proposed. RESULTS: In this paper, we present methCancer-gen, a tool for generating DNA methylome dataset considering type for cancer. Employing conditional variational autoencoder, a neural network-based generative model, it estimates the conditional distribution with latent variables and data, and generates samples for specified cancer type. CONCLUSIONS: To evaluate the simulation performance of methCancer-gen for the user-specified cancer type, our proposed model was compared to a benchmark method and it could successfully reproduce cancer type-wise data with high accuracy helping to alleviate the lack of condition-specific data issue. methCancer-gen is publicly available at https://github.com/cbi-bioinfo/methCancer-gen. BioMed Central 2020-05-11 /pmc/articles/PMC7216580/ /pubmed/32393170 http://dx.doi.org/10.1186/s12859-020-3516-8 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Choi, Joungmin
Chae, Heejoon
methCancer-gen: a DNA methylome dataset generator for user-specified cancer type based on conditional variational autoencoder
title methCancer-gen: a DNA methylome dataset generator for user-specified cancer type based on conditional variational autoencoder
title_full methCancer-gen: a DNA methylome dataset generator for user-specified cancer type based on conditional variational autoencoder
title_fullStr methCancer-gen: a DNA methylome dataset generator for user-specified cancer type based on conditional variational autoencoder
title_full_unstemmed methCancer-gen: a DNA methylome dataset generator for user-specified cancer type based on conditional variational autoencoder
title_short methCancer-gen: a DNA methylome dataset generator for user-specified cancer type based on conditional variational autoencoder
title_sort methcancer-gen: a dna methylome dataset generator for user-specified cancer type based on conditional variational autoencoder
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7216580/
https://www.ncbi.nlm.nih.gov/pubmed/32393170
http://dx.doi.org/10.1186/s12859-020-3516-8
work_keys_str_mv AT choijoungmin methcancergenadnamethylomedatasetgeneratorforuserspecifiedcancertypebasedonconditionalvariationalautoencoder
AT chaeheejoon methcancergenadnamethylomedatasetgeneratorforuserspecifiedcancertypebasedonconditionalvariationalautoencoder