Cargando…

Predicting phenotypes from novel genomic markers using deep learning

Summary: Genomic selection (GS) models use single nucleotide polymorphism (SNP) markers to predict phenotypes. However, these predictive models face challenges due to the high dimensionality of genome-wide SNP marker data. Thanks to recent breakthroughs in DNA sequencing and decreased sequencing cos...

Descripción completa

Detalles Bibliográficos
Autores principales: Sehrawat, Shivani, Najafian, Keyhan, Jin, Lingling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10132579/
https://www.ncbi.nlm.nih.gov/pubmed/37123455
http://dx.doi.org/10.1093/bioadv/vbad028
_version_ 1785031412350451712
author Sehrawat, Shivani
Najafian, Keyhan
Jin, Lingling
author_facet Sehrawat, Shivani
Najafian, Keyhan
Jin, Lingling
author_sort Sehrawat, Shivani
collection PubMed
description Summary: Genomic selection (GS) models use single nucleotide polymorphism (SNP) markers to predict phenotypes. However, these predictive models face challenges due to the high dimensionality of genome-wide SNP marker data. Thanks to recent breakthroughs in DNA sequencing and decreased sequencing cost, the study of novel genomic variants such as structural variations (SVs) and transposable elements (TEs) become increasingly prevalent. In this article, we develop a deep convolutional neural network model, NovGMDeep, to predict phenotypes using SVs and TEs markers for GS. The proposed model is trained and tested on samples of Arabidopsis thaliana and Oryza sativa using k-fold cross-validation. The prediction accuracy is evaluated using Pearson’s Correlation Coefficient (PCC), mean absolute error (MAE) and SD of MAE. The predicted results showed higher correlation when the model is trained with SVs and TEs than with SNPs. NovGMDeep also has higher prediction accuracy when comparing with conventional statistical models. This work sheds light on the unappreciated function of SVs and TEs in genotype-to-phenotype associations, as well as their extensive significance and value in crop development.
format Online
Article
Text
id pubmed-10132579
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101325792023-04-27 Predicting phenotypes from novel genomic markers using deep learning Sehrawat, Shivani Najafian, Keyhan Jin, Lingling Bioinform Adv Original Article Summary: Genomic selection (GS) models use single nucleotide polymorphism (SNP) markers to predict phenotypes. However, these predictive models face challenges due to the high dimensionality of genome-wide SNP marker data. Thanks to recent breakthroughs in DNA sequencing and decreased sequencing cost, the study of novel genomic variants such as structural variations (SVs) and transposable elements (TEs) become increasingly prevalent. In this article, we develop a deep convolutional neural network model, NovGMDeep, to predict phenotypes using SVs and TEs markers for GS. The proposed model is trained and tested on samples of Arabidopsis thaliana and Oryza sativa using k-fold cross-validation. The prediction accuracy is evaluated using Pearson’s Correlation Coefficient (PCC), mean absolute error (MAE) and SD of MAE. The predicted results showed higher correlation when the model is trained with SVs and TEs than with SNPs. NovGMDeep also has higher prediction accuracy when comparing with conventional statistical models. This work sheds light on the unappreciated function of SVs and TEs in genotype-to-phenotype associations, as well as their extensive significance and value in crop development. Oxford University Press 2023-03-09 /pmc/articles/PMC10132579/ /pubmed/37123455 http://dx.doi.org/10.1093/bioadv/vbad028 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Sehrawat, Shivani
Najafian, Keyhan
Jin, Lingling
Predicting phenotypes from novel genomic markers using deep learning
title Predicting phenotypes from novel genomic markers using deep learning
title_full Predicting phenotypes from novel genomic markers using deep learning
title_fullStr Predicting phenotypes from novel genomic markers using deep learning
title_full_unstemmed Predicting phenotypes from novel genomic markers using deep learning
title_short Predicting phenotypes from novel genomic markers using deep learning
title_sort predicting phenotypes from novel genomic markers using deep learning
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10132579/
https://www.ncbi.nlm.nih.gov/pubmed/37123455
http://dx.doi.org/10.1093/bioadv/vbad028
work_keys_str_mv AT sehrawatshivani predictingphenotypesfromnovelgenomicmarkersusingdeeplearning
AT najafiankeyhan predictingphenotypesfromnovelgenomicmarkersusingdeeplearning
AT jinlingling predictingphenotypesfromnovelgenomicmarkersusingdeeplearning