Cargando…
Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence
The high level of sparsity in methylome profiles obtained using whole-genome bisulfite sequencing in the case of low biological material amount limits its value in the study of systems in which large samples are difficult to assemble, such as mammalian preimplantation embryonic development. The rece...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353187/ https://www.ncbi.nlm.nih.gov/pubmed/35938031 http://dx.doi.org/10.3389/fgene.2022.910439 |
_version_ | 1784762820194205696 |
---|---|
author | Dodlapati, Sanjeeva Jiang, Zongliang Sun, Jiangwen |
author_facet | Dodlapati, Sanjeeva Jiang, Zongliang Sun, Jiangwen |
author_sort | Dodlapati, Sanjeeva |
collection | PubMed |
description | The high level of sparsity in methylome profiles obtained using whole-genome bisulfite sequencing in the case of low biological material amount limits its value in the study of systems in which large samples are difficult to assemble, such as mammalian preimplantation embryonic development. The recently developed computational methods for addressing the sparsity by imputing missing have their limits when the required minimum data coverage or profiles of the same tissue in other modalities are not available. In this study, we explored the use of transfer learning together with Kullback-Leibler (KL) divergence to train predictive models for completing methylome profiles with very low coverage (below 2%). Transfer learning was used to leverage less sparse profiles that are typically available for different tissues for the same species, while KL divergence was employed to maximize the usage of information carried in the input data. A deep neural network was adopted to extract both DNA sequence and local methylation patterns for imputation. Our study of training models for completing methylome profiles of bovine oocytes and early embryos demonstrates the effectiveness of transfer learning and KL divergence, with individual increase of 29.98 and 29.43%, respectively, in prediction performance and 38.70% increase when the two were used together. The drastically increased data coverage (43.80–73.6%) after imputation powers downstream analyses involving methylomes that cannot be effectively done using the very low coverage profiles (0.06–1.47%) before imputation. |
format | Online Article Text |
id | pubmed-9353187 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-93531872022-08-06 Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence Dodlapati, Sanjeeva Jiang, Zongliang Sun, Jiangwen Front Genet Genetics The high level of sparsity in methylome profiles obtained using whole-genome bisulfite sequencing in the case of low biological material amount limits its value in the study of systems in which large samples are difficult to assemble, such as mammalian preimplantation embryonic development. The recently developed computational methods for addressing the sparsity by imputing missing have their limits when the required minimum data coverage or profiles of the same tissue in other modalities are not available. In this study, we explored the use of transfer learning together with Kullback-Leibler (KL) divergence to train predictive models for completing methylome profiles with very low coverage (below 2%). Transfer learning was used to leverage less sparse profiles that are typically available for different tissues for the same species, while KL divergence was employed to maximize the usage of information carried in the input data. A deep neural network was adopted to extract both DNA sequence and local methylation patterns for imputation. Our study of training models for completing methylome profiles of bovine oocytes and early embryos demonstrates the effectiveness of transfer learning and KL divergence, with individual increase of 29.98 and 29.43%, respectively, in prediction performance and 38.70% increase when the two were used together. The drastically increased data coverage (43.80–73.6%) after imputation powers downstream analyses involving methylomes that cannot be effectively done using the very low coverage profiles (0.06–1.47%) before imputation. Frontiers Media S.A. 2022-07-22 /pmc/articles/PMC9353187/ /pubmed/35938031 http://dx.doi.org/10.3389/fgene.2022.910439 Text en Copyright © 2022 Dodlapati, Jiang and Sun. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Dodlapati, Sanjeeva Jiang, Zongliang Sun, Jiangwen Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence |
title | Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence |
title_full | Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence |
title_fullStr | Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence |
title_full_unstemmed | Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence |
title_short | Completing Single-Cell DNA Methylome Profiles via Transfer Learning Together With KL-Divergence |
title_sort | completing single-cell dna methylome profiles via transfer learning together with kl-divergence |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9353187/ https://www.ncbi.nlm.nih.gov/pubmed/35938031 http://dx.doi.org/10.3389/fgene.2022.910439 |
work_keys_str_mv | AT dodlapatisanjeeva completingsinglecelldnamethylomeprofilesviatransferlearningtogetherwithkldivergence AT jiangzongliang completingsinglecelldnamethylomeprofilesviatransferlearningtogetherwithkldivergence AT sunjiangwen completingsinglecelldnamethylomeprofilesviatransferlearningtogetherwithkldivergence |