Cargando…

A Deep-Learning Pipeline for TSS Coverage Imputation From Shallow Cell-Free DNA Sequencing

Cell-free DNA (cfDNA) serves as a footprint of the nucleosome occupancy status of transcription start sites (TSSs), and has been subject to wide development for use in noninvasive health monitoring and disease detection. However, the requirement for high sequencing depth limits its clinical use. Her...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Bo-Wei, Yang, Xu, Qu, Shou-Fang, Guo, Zhi-Wei, Huang, Li-Min, Li, Kun, Ouyang, Guo-Jun, Cai, Geng-Xi, Xiao, Wei-Wei, Weng, Rong-Tao, Xu, Shun, Huang, Jie, Yang, Xue-Xi, Wu, Ying-Song
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8678047/
https://www.ncbi.nlm.nih.gov/pubmed/34926480
http://dx.doi.org/10.3389/fmed.2021.684238
Descripción
Sumario:Cell-free DNA (cfDNA) serves as a footprint of the nucleosome occupancy status of transcription start sites (TSSs), and has been subject to wide development for use in noninvasive health monitoring and disease detection. However, the requirement for high sequencing depth limits its clinical use. Here, we introduce a deep-learning pipeline designed for TSS coverage profiles generated from shallow cfDNA sequencing called the Autoencoder of cfDNA TSS (AECT) coverage profile. AECT outperformed existing single-cell sequencing imputation algorithms in terms of improvements to TSS coverage accuracy and the capture of latent biological features that distinguish sex or tumor status. We built classifiers for the detection of breast and rectal cancer using AECT-imputed shallow sequencing data, and their performance was close to that achieved by high-depth sequencing, suggesting that AECT could provide a broadly applicable noninvasive screening approach with high accuracy and at a moderate cost.