Cargando…
Predicting phenotypes from novel genomic markers using deep learning
Summary: Genomic selection (GS) models use single nucleotide polymorphism (SNP) markers to predict phenotypes. However, these predictive models face challenges due to the high dimensionality of genome-wide SNP marker data. Thanks to recent breakthroughs in DNA sequencing and decreased sequencing cos...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10132579/ https://www.ncbi.nlm.nih.gov/pubmed/37123455 http://dx.doi.org/10.1093/bioadv/vbad028 |
_version_ | 1785031412350451712 |
---|---|
author | Sehrawat, Shivani Najafian, Keyhan Jin, Lingling |
author_facet | Sehrawat, Shivani Najafian, Keyhan Jin, Lingling |
author_sort | Sehrawat, Shivani |
collection | PubMed |
description | Summary: Genomic selection (GS) models use single nucleotide polymorphism (SNP) markers to predict phenotypes. However, these predictive models face challenges due to the high dimensionality of genome-wide SNP marker data. Thanks to recent breakthroughs in DNA sequencing and decreased sequencing cost, the study of novel genomic variants such as structural variations (SVs) and transposable elements (TEs) become increasingly prevalent. In this article, we develop a deep convolutional neural network model, NovGMDeep, to predict phenotypes using SVs and TEs markers for GS. The proposed model is trained and tested on samples of Arabidopsis thaliana and Oryza sativa using k-fold cross-validation. The prediction accuracy is evaluated using Pearson’s Correlation Coefficient (PCC), mean absolute error (MAE) and SD of MAE. The predicted results showed higher correlation when the model is trained with SVs and TEs than with SNPs. NovGMDeep also has higher prediction accuracy when comparing with conventional statistical models. This work sheds light on the unappreciated function of SVs and TEs in genotype-to-phenotype associations, as well as their extensive significance and value in crop development. |
format | Online Article Text |
id | pubmed-10132579 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101325792023-04-27 Predicting phenotypes from novel genomic markers using deep learning Sehrawat, Shivani Najafian, Keyhan Jin, Lingling Bioinform Adv Original Article Summary: Genomic selection (GS) models use single nucleotide polymorphism (SNP) markers to predict phenotypes. However, these predictive models face challenges due to the high dimensionality of genome-wide SNP marker data. Thanks to recent breakthroughs in DNA sequencing and decreased sequencing cost, the study of novel genomic variants such as structural variations (SVs) and transposable elements (TEs) become increasingly prevalent. In this article, we develop a deep convolutional neural network model, NovGMDeep, to predict phenotypes using SVs and TEs markers for GS. The proposed model is trained and tested on samples of Arabidopsis thaliana and Oryza sativa using k-fold cross-validation. The prediction accuracy is evaluated using Pearson’s Correlation Coefficient (PCC), mean absolute error (MAE) and SD of MAE. The predicted results showed higher correlation when the model is trained with SVs and TEs than with SNPs. NovGMDeep also has higher prediction accuracy when comparing with conventional statistical models. This work sheds light on the unappreciated function of SVs and TEs in genotype-to-phenotype associations, as well as their extensive significance and value in crop development. Oxford University Press 2023-03-09 /pmc/articles/PMC10132579/ /pubmed/37123455 http://dx.doi.org/10.1093/bioadv/vbad028 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Sehrawat, Shivani Najafian, Keyhan Jin, Lingling Predicting phenotypes from novel genomic markers using deep learning |
title | Predicting phenotypes from novel genomic markers using deep learning |
title_full | Predicting phenotypes from novel genomic markers using deep learning |
title_fullStr | Predicting phenotypes from novel genomic markers using deep learning |
title_full_unstemmed | Predicting phenotypes from novel genomic markers using deep learning |
title_short | Predicting phenotypes from novel genomic markers using deep learning |
title_sort | predicting phenotypes from novel genomic markers using deep learning |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10132579/ https://www.ncbi.nlm.nih.gov/pubmed/37123455 http://dx.doi.org/10.1093/bioadv/vbad028 |
work_keys_str_mv | AT sehrawatshivani predictingphenotypesfromnovelgenomicmarkersusingdeeplearning AT najafiankeyhan predictingphenotypesfromnovelgenomicmarkersusingdeeplearning AT jinlingling predictingphenotypesfromnovelgenomicmarkersusingdeeplearning |