Cargando…

EvoAug: improving generalization and interpretability of genomic deep neural networks with evolution-inspired data augmentations

Deep neural networks (DNNs) hold promise for functional genomics prediction, but their generalization capability may be limited by the amount of available data. To address this, we propose EvoAug, a suite of evolution-inspired augmentations that enhance the training of genomic DNNs by increasing gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Nicholas Keone, Tang, Ziqi, Toneyan, Shushan, Koo, Peter K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10161416/
https://www.ncbi.nlm.nih.gov/pubmed/37143118
http://dx.doi.org/10.1186/s13059-023-02941-w
Descripción
Sumario:Deep neural networks (DNNs) hold promise for functional genomics prediction, but their generalization capability may be limited by the amount of available data. To address this, we propose EvoAug, a suite of evolution-inspired augmentations that enhance the training of genomic DNNs by increasing genetic variation. Random transformation of DNA sequences can potentially alter their function in unknown ways, so we employ a fine-tuning procedure using the original non-transformed data to preserve functional integrity. Our results demonstrate that EvoAug substantially improves the generalization and interpretability of established DNNs across prominent regulatory genomics prediction tasks, offering a robust solution for genomic DNNs. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-02941-w.