Cargando…

Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation

It is expected the predictive performance of genomic prediction methods may be adversely affected in the presence of outliers. In agriculture science an outlier may arise due to wrong data imputation, outlying response, and in a series of trials over the time or location. Although several statistica...

Descripción completa

Detalles Bibliográficos
Autores principales: Budhlakoti, Neeraj, Rai, Anil, Mishra, D. C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7242349/
https://www.ncbi.nlm.nih.gov/pubmed/32439883
http://dx.doi.org/10.1038/s41598-020-65323-3
_version_ 1783537220911628288
author Budhlakoti, Neeraj
Rai, Anil
Mishra, D. C.
author_facet Budhlakoti, Neeraj
Rai, Anil
Mishra, D. C.
author_sort Budhlakoti, Neeraj
collection PubMed
description It is expected the predictive performance of genomic prediction methods may be adversely affected in the presence of outliers. In agriculture science an outlier may arise due to wrong data imputation, outlying response, and in a series of trials over the time or location. Although several statistical procedures are already there in literature for identification of outlier but identification of true outlier is still a challenge especially in case of high dimensional genomic data. Here we have proposed an efficient approach for detecting outlier in high dimensional genomic data, our approach is p-value based combination methods to produce single p-value for detecting the outliers. Robustness of our approach has been tested using simulated data through the evaluation measures like precision, recall etc. It has been observed that significant improvement in the performance of genomic prediction has been obtained by detecting the outliers and handling them accordingly through our proposed approach using real data.
format Online
Article
Text
id pubmed-7242349
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72423492020-05-29 Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation Budhlakoti, Neeraj Rai, Anil Mishra, D. C. Sci Rep Article It is expected the predictive performance of genomic prediction methods may be adversely affected in the presence of outliers. In agriculture science an outlier may arise due to wrong data imputation, outlying response, and in a series of trials over the time or location. Although several statistical procedures are already there in literature for identification of outlier but identification of true outlier is still a challenge especially in case of high dimensional genomic data. Here we have proposed an efficient approach for detecting outlier in high dimensional genomic data, our approach is p-value based combination methods to produce single p-value for detecting the outliers. Robustness of our approach has been tested using simulated data through the evaluation measures like precision, recall etc. It has been observed that significant improvement in the performance of genomic prediction has been obtained by detecting the outliers and handling them accordingly through our proposed approach using real data. Nature Publishing Group UK 2020-05-21 /pmc/articles/PMC7242349/ /pubmed/32439883 http://dx.doi.org/10.1038/s41598-020-65323-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Budhlakoti, Neeraj
Rai, Anil
Mishra, D. C.
Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation
title Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation
title_full Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation
title_fullStr Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation
title_full_unstemmed Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation
title_short Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation
title_sort statistical approach for improving genomic prediction accuracy through efficient diagnostic measure of influential observation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7242349/
https://www.ncbi.nlm.nih.gov/pubmed/32439883
http://dx.doi.org/10.1038/s41598-020-65323-3
work_keys_str_mv AT budhlakotineeraj statisticalapproachforimprovinggenomicpredictionaccuracythroughefficientdiagnosticmeasureofinfluentialobservation
AT raianil statisticalapproachforimprovinggenomicpredictionaccuracythroughefficientdiagnosticmeasureofinfluentialobservation
AT mishradc statisticalapproachforimprovinggenomicpredictionaccuracythroughefficientdiagnosticmeasureofinfluentialobservation