Cargando…

Prediction of count phenotypes using high-resolution images and genomic data

Genomic selection (GS) is revolutionizing plant breeding since the selection process is done with the help of statistical machine learning methods. A model is trained with a reference population and then it is used for predicting the candidate individuals available in the testing set. However, given...

Descripción completa

Detalles Bibliográficos
Autores principales: Kismiantini,  , Montesinos-López, Osval Antonio, Crossa, José, Setiawan, Ezra Putranda, Wutsqa, Dhoriva Urwatul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8022939/
https://www.ncbi.nlm.nih.gov/pubmed/33847694
http://dx.doi.org/10.1093/g3journal/jkab035
_version_ 1783675031043178496
author Kismiantini,  
Montesinos-López, Osval Antonio
Crossa, José
Setiawan, Ezra Putranda
Wutsqa, Dhoriva Urwatul
author_facet Kismiantini,  
Montesinos-López, Osval Antonio
Crossa, José
Setiawan, Ezra Putranda
Wutsqa, Dhoriva Urwatul
author_sort Kismiantini,  
collection PubMed
description Genomic selection (GS) is revolutionizing plant breeding since the selection process is done with the help of statistical machine learning methods. A model is trained with a reference population and then it is used for predicting the candidate individuals available in the testing set. However, given that breeding phenotypic values are very noisy, new models must be able to integrate not only genotypic and environmental data but also high-resolution images that have been collected by breeders with advanced image technology. For this reason, this paper explores the use of generalized Poisson regression (GPR) for genome-enabled prediction of count phenotypes using genomic and hyperspectral images. The GPR model allows integrating input information of many sources like environments, genomic data, high resolution data, and interaction terms between these three sources. We found that the best prediction performance was obtained when the three sources of information were taken into account in the predictor, and those measures of high-resolution images close to the harvest day provided the best prediction performance.
format Online
Article
Text
id pubmed-8022939
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80229392021-04-09 Prediction of count phenotypes using high-resolution images and genomic data Kismiantini,   Montesinos-López, Osval Antonio Crossa, José Setiawan, Ezra Putranda Wutsqa, Dhoriva Urwatul G3 (Bethesda) Investigation Genomic selection (GS) is revolutionizing plant breeding since the selection process is done with the help of statistical machine learning methods. A model is trained with a reference population and then it is used for predicting the candidate individuals available in the testing set. However, given that breeding phenotypic values are very noisy, new models must be able to integrate not only genotypic and environmental data but also high-resolution images that have been collected by breeders with advanced image technology. For this reason, this paper explores the use of generalized Poisson regression (GPR) for genome-enabled prediction of count phenotypes using genomic and hyperspectral images. The GPR model allows integrating input information of many sources like environments, genomic data, high resolution data, and interaction terms between these three sources. We found that the best prediction performance was obtained when the three sources of information were taken into account in the predictor, and those measures of high-resolution images close to the harvest day provided the best prediction performance. Oxford University Press 2021-02-05 /pmc/articles/PMC8022939/ /pubmed/33847694 http://dx.doi.org/10.1093/g3journal/jkab035 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Investigation
Kismiantini,  
Montesinos-López, Osval Antonio
Crossa, José
Setiawan, Ezra Putranda
Wutsqa, Dhoriva Urwatul
Prediction of count phenotypes using high-resolution images and genomic data
title Prediction of count phenotypes using high-resolution images and genomic data
title_full Prediction of count phenotypes using high-resolution images and genomic data
title_fullStr Prediction of count phenotypes using high-resolution images and genomic data
title_full_unstemmed Prediction of count phenotypes using high-resolution images and genomic data
title_short Prediction of count phenotypes using high-resolution images and genomic data
title_sort prediction of count phenotypes using high-resolution images and genomic data
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8022939/
https://www.ncbi.nlm.nih.gov/pubmed/33847694
http://dx.doi.org/10.1093/g3journal/jkab035
work_keys_str_mv AT kismiantini predictionofcountphenotypesusinghighresolutionimagesandgenomicdata
AT montesinoslopezosvalantonio predictionofcountphenotypesusinghighresolutionimagesandgenomicdata
AT crossajose predictionofcountphenotypesusinghighresolutionimagesandgenomicdata
AT setiawanezraputranda predictionofcountphenotypesusinghighresolutionimagesandgenomicdata
AT wutsqadhorivaurwatul predictionofcountphenotypesusinghighresolutionimagesandgenomicdata