Cargando…
Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation
It is expected the predictive performance of genomic prediction methods may be adversely affected in the presence of outliers. In agriculture science an outlier may arise due to wrong data imputation, outlying response, and in a series of trials over the time or location. Although several statistica...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7242349/ https://www.ncbi.nlm.nih.gov/pubmed/32439883 http://dx.doi.org/10.1038/s41598-020-65323-3 |
_version_ | 1783537220911628288 |
---|---|
author | Budhlakoti, Neeraj Rai, Anil Mishra, D. C. |
author_facet | Budhlakoti, Neeraj Rai, Anil Mishra, D. C. |
author_sort | Budhlakoti, Neeraj |
collection | PubMed |
description | It is expected the predictive performance of genomic prediction methods may be adversely affected in the presence of outliers. In agriculture science an outlier may arise due to wrong data imputation, outlying response, and in a series of trials over the time or location. Although several statistical procedures are already there in literature for identification of outlier but identification of true outlier is still a challenge especially in case of high dimensional genomic data. Here we have proposed an efficient approach for detecting outlier in high dimensional genomic data, our approach is p-value based combination methods to produce single p-value for detecting the outliers. Robustness of our approach has been tested using simulated data through the evaluation measures like precision, recall etc. It has been observed that significant improvement in the performance of genomic prediction has been obtained by detecting the outliers and handling them accordingly through our proposed approach using real data. |
format | Online Article Text |
id | pubmed-7242349 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-72423492020-05-29 Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation Budhlakoti, Neeraj Rai, Anil Mishra, D. C. Sci Rep Article It is expected the predictive performance of genomic prediction methods may be adversely affected in the presence of outliers. In agriculture science an outlier may arise due to wrong data imputation, outlying response, and in a series of trials over the time or location. Although several statistical procedures are already there in literature for identification of outlier but identification of true outlier is still a challenge especially in case of high dimensional genomic data. Here we have proposed an efficient approach for detecting outlier in high dimensional genomic data, our approach is p-value based combination methods to produce single p-value for detecting the outliers. Robustness of our approach has been tested using simulated data through the evaluation measures like precision, recall etc. It has been observed that significant improvement in the performance of genomic prediction has been obtained by detecting the outliers and handling them accordingly through our proposed approach using real data. Nature Publishing Group UK 2020-05-21 /pmc/articles/PMC7242349/ /pubmed/32439883 http://dx.doi.org/10.1038/s41598-020-65323-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Budhlakoti, Neeraj Rai, Anil Mishra, D. C. Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation |
title | Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation |
title_full | Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation |
title_fullStr | Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation |
title_full_unstemmed | Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation |
title_short | Statistical Approach for Improving Genomic Prediction Accuracy through Efficient Diagnostic Measure of Influential Observation |
title_sort | statistical approach for improving genomic prediction accuracy through efficient diagnostic measure of influential observation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7242349/ https://www.ncbi.nlm.nih.gov/pubmed/32439883 http://dx.doi.org/10.1038/s41598-020-65323-3 |
work_keys_str_mv | AT budhlakotineeraj statisticalapproachforimprovinggenomicpredictionaccuracythroughefficientdiagnosticmeasureofinfluentialobservation AT raianil statisticalapproachforimprovinggenomicpredictionaccuracythroughefficientdiagnosticmeasureofinfluentialobservation AT mishradc statisticalapproachforimprovinggenomicpredictionaccuracythroughefficientdiagnosticmeasureofinfluentialobservation |