Cargando…

Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence

The high level of sparsity in methylome profiles obtained using whole-genome bisulfite sequencing in the case of low biological material amount limits its value in the study of systems in which large samples are difficult to assemble, such as mammalian preimplantation embryonic development. The rece...

Descripción completa

Detalles Bibliográficos
Autores principales: Dodlapati, Sanjeeva, Jiang, Zongliang, Sun, Jiangwen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353187/
https://www.ncbi.nlm.nih.gov/pubmed/35938031
http://dx.doi.org/10.3389/fgene.2022.910439
_version_ 1784762820194205696
author Dodlapati, Sanjeeva
Jiang, Zongliang
Sun, Jiangwen
author_facet Dodlapati, Sanjeeva
Jiang, Zongliang
Sun, Jiangwen
author_sort Dodlapati, Sanjeeva
collection PubMed
description The high level of sparsity in methylome profiles obtained using whole-genome bisulfite sequencing in the case of low biological material amount limits its value in the study of systems in which large samples are difficult to assemble, such as mammalian preimplantation embryonic development. The recently developed computational methods for addressing the sparsity by imputing missing have their limits when the required minimum data coverage or profiles of the same tissue in other modalities are not available. In this study, we explored the use of transfer learning together with Kullback-Leibler (KL) divergence to train predictive models for completing methylome profiles with very low coverage (below 2%). Transfer learning was used to leverage less sparse profiles that are typically available for different tissues for the same species, while KL divergence was employed to maximize the usage of information carried in the input data. A deep neural network was adopted to extract both DNA sequence and local methylation patterns for imputation. Our study of training models for completing methylome profiles of bovine oocytes and early embryos demonstrates the effectiveness of transfer learning and KL divergence, with individual increase of 29.98 and 29.43%, respectively, in prediction performance and 38.70% increase when the two were used together. The drastically increased data coverage (43.80–73.6%) after imputation powers downstream analyses involving methylomes that cannot be effectively done using the very low coverage profiles (0.06–1.47%) before imputation.
format Online
Article
Text
id pubmed-9353187
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-93531872022-08-06 Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence Dodlapati, Sanjeeva Jiang, Zongliang Sun, Jiangwen Front Genet Genetics The high level of sparsity in methylome profiles obtained using whole-genome bisulfite sequencing in the case of low biological material amount limits its value in the study of systems in which large samples are difficult to assemble, such as mammalian preimplantation embryonic development. The recently developed computational methods for addressing the sparsity by imputing missing have their limits when the required minimum data coverage or profiles of the same tissue in other modalities are not available. In this study, we explored the use of transfer learning together with Kullback-Leibler (KL) divergence to train predictive models for completing methylome profiles with very low coverage (below 2%). Transfer learning was used to leverage less sparse profiles that are typically available for different tissues for the same species, while KL divergence was employed to maximize the usage of information carried in the input data. A deep neural network was adopted to extract both DNA sequence and local methylation patterns for imputation. Our study of training models for completing methylome profiles of bovine oocytes and early embryos demonstrates the effectiveness of transfer learning and KL divergence, with individual increase of 29.98 and 29.43%, respectively, in prediction performance and 38.70% increase when the two were used together. The drastically increased data coverage (43.80–73.6%) after imputation powers downstream analyses involving methylomes that cannot be effectively done using the very low coverage profiles (0.06–1.47%) before imputation. Frontiers Media S.A. 2022-07-22 /pmc/articles/PMC9353187/ /pubmed/35938031 http://dx.doi.org/10.3389/fgene.2022.910439 Text en Copyright © 2022 Dodlapati, Jiang and Sun. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Dodlapati, Sanjeeva
Jiang, Zongliang
Sun, Jiangwen
Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence
title Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence
title_full Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence
title_fullStr Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence
title_full_unstemmed Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence
title_short Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence
title_sort completing single-cell dna methylome profiles via transfer learning together with kl-divergence
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353187/
https://www.ncbi.nlm.nih.gov/pubmed/35938031
http://dx.doi.org/10.3389/fgene.2022.910439
work_keys_str_mv AT dodlapatisanjeeva completingsinglecelldnamethylomeprofilesviatransferlearningtogetherwithkldivergence
AT jiangzongliang completingsinglecelldnamethylomeprofilesviatransferlearningtogetherwithkldivergence
AT sunjiangwen completingsinglecelldnamethylomeprofilesviatransferlearningtogetherwithkldivergence