Cargando…

Using Local Convolutional Neural Networks for Genomic Prediction

The prediction of breeding values and phenotypes is of central importance for both livestock and crop breeding. In this study, we analyze the use of artificial neural networks (ANN) and, in particular, local convolutional neural networks (LCNN) for genomic prediction, as a region-specific filter cor...

Descripción completa

Detalles Bibliográficos
Autores principales:	Pook, Torsten, Freudenthal, Jan, Korte, Arthur, Simianer, Henner
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2020
Materias:	Genetics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7689358/ https://www.ncbi.nlm.nih.gov/pubmed/33281867 http://dx.doi.org/10.3389/fgene.2020.561497

_version_	1783613849930301440
author	Pook, Torsten Freudenthal, Jan Korte, Arthur Simianer, Henner
author_facet	Pook, Torsten Freudenthal, Jan Korte, Arthur Simianer, Henner
author_sort	Pook, Torsten
collection	PubMed
description	The prediction of breeding values and phenotypes is of central importance for both livestock and crop breeding. In this study, we analyze the use of artificial neural networks (ANN) and, in particular, local convolutional neural networks (LCNN) for genomic prediction, as a region-specific filter corresponds much better with our prior genetic knowledge on the genetic architecture of traits than traditional convolutional neural networks. Model performances are evaluated on a simulated maize data panel (n = 10,000; p = 34,595) and real Arabidopsis data (n = 2,039; p = 180,000) for a variety of traits based on their predictive ability. The baseline LCNN, containing one local convolutional layer (kernel size: 10) and two fully connected layers with 64 nodes each, is outperforming commonly proposed ANNs (multi layer perceptrons and convolutional neural networks) for basically all considered traits. For traits with high heritability and large training population as present in the simulated data, LCNN are even outperforming state-of-the-art methods like genomic best linear unbiased prediction (GBLUP), Bayesian models and extended GBLUP, indicated by an increase in predictive ability of up to 24%. However, for small training populations, these state-of-the-art methods outperform all considered ANNs. Nevertheless, the LCNN still outperforms all other considered ANNs by around 10%. Minor improvements to the tested baseline network architecture of the LCNN were obtained by increasing the kernel size and of reducing the stride, whereas the number of subsequent fully connected layers and their node sizes had neglectable impact. Although gains in predictive ability were obtained for large scale data sets by using LCNNs, the practical use of ANNs comes with additional problems, such as the need of genotyping all considered individuals, the lack of estimation of heritability and reliability. Furthermore, breeding values are additive by design, whereas ANN-based estimates are not. However, ANNs also comes with new opportunities, as networks can easily be extended to account for additional inputs (omics, weather etc.) and outputs (multi-trait models), and computing time increases linearly with the number of individuals. With advances in high-throughput phenotyping and cheaper genotyping, ANNs can become a valid alternative for genomic prediction.
format	Online Article Text
id	pubmed-7689358
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-76893582020-12-04 Using Local Convolutional Neural Networks for Genomic Prediction Pook, Torsten Freudenthal, Jan Korte, Arthur Simianer, Henner Front Genet Genetics The prediction of breeding values and phenotypes is of central importance for both livestock and crop breeding. In this study, we analyze the use of artificial neural networks (ANN) and, in particular, local convolutional neural networks (LCNN) for genomic prediction, as a region-specific filter corresponds much better with our prior genetic knowledge on the genetic architecture of traits than traditional convolutional neural networks. Model performances are evaluated on a simulated maize data panel (n = 10,000; p = 34,595) and real Arabidopsis data (n = 2,039; p = 180,000) for a variety of traits based on their predictive ability. The baseline LCNN, containing one local convolutional layer (kernel size: 10) and two fully connected layers with 64 nodes each, is outperforming commonly proposed ANNs (multi layer perceptrons and convolutional neural networks) for basically all considered traits. For traits with high heritability and large training population as present in the simulated data, LCNN are even outperforming state-of-the-art methods like genomic best linear unbiased prediction (GBLUP), Bayesian models and extended GBLUP, indicated by an increase in predictive ability of up to 24%. However, for small training populations, these state-of-the-art methods outperform all considered ANNs. Nevertheless, the LCNN still outperforms all other considered ANNs by around 10%. Minor improvements to the tested baseline network architecture of the LCNN were obtained by increasing the kernel size and of reducing the stride, whereas the number of subsequent fully connected layers and their node sizes had neglectable impact. Although gains in predictive ability were obtained for large scale data sets by using LCNNs, the practical use of ANNs comes with additional problems, such as the need of genotyping all considered individuals, the lack of estimation of heritability and reliability. Furthermore, breeding values are additive by design, whereas ANN-based estimates are not. However, ANNs also comes with new opportunities, as networks can easily be extended to account for additional inputs (omics, weather etc.) and outputs (multi-trait models), and computing time increases linearly with the number of individuals. With advances in high-throughput phenotyping and cheaper genotyping, ANNs can become a valid alternative for genomic prediction. Frontiers Media S.A. 2020-11-12 /pmc/articles/PMC7689358/ /pubmed/33281867 http://dx.doi.org/10.3389/fgene.2020.561497 Text en Copyright © 2020 Pook, Freudenthal, Korte and Simianer. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Genetics Pook, Torsten Freudenthal, Jan Korte, Arthur Simianer, Henner Using Local Convolutional Neural Networks for Genomic Prediction
title	Using Local Convolutional Neural Networks for Genomic Prediction
title_full	Using Local Convolutional Neural Networks for Genomic Prediction
title_fullStr	Using Local Convolutional Neural Networks for Genomic Prediction
title_full_unstemmed	Using Local Convolutional Neural Networks for Genomic Prediction
title_short	Using Local Convolutional Neural Networks for Genomic Prediction
title_sort	using local convolutional neural networks for genomic prediction
topic	Genetics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7689358/ https://www.ncbi.nlm.nih.gov/pubmed/33281867 http://dx.doi.org/10.3389/fgene.2020.561497
work_keys_str_mv	AT pooktorsten usinglocalconvolutionalneuralnetworksforgenomicprediction AT freudenthaljan usinglocalconvolutionalneuralnetworksforgenomicprediction AT kortearthur usinglocalconvolutionalneuralnetworksforgenomicprediction AT simianerhenner usinglocalconvolutionalneuralnetworksforgenomicprediction

Using Local Convolutional Neural Networks for Genomic Prediction

Ejemplares similares