Cargando…

An integrative imputation method based on multi-omics datasets

BACKGROUND: Integrative analysis of multi-omics data is becoming increasingly important to unravel functional mechanisms of complex diseases. However, the currently available multi-omics datasets inevitably suffer from missing values due to technical limitations and various constrains in experiments...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Dongdong, Zhang, Jigang, Li, Jingyao, Xu, Chao, Deng, Hong-Wen, Wang, Yu-Ping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915152/
https://www.ncbi.nlm.nih.gov/pubmed/27329642
http://dx.doi.org/10.1186/s12859-016-1122-6
_version_ 1782438655690801152
author Lin, Dongdong
Zhang, Jigang
Li, Jingyao
Xu, Chao
Deng, Hong-Wen
Wang, Yu-Ping
author_facet Lin, Dongdong
Zhang, Jigang
Li, Jingyao
Xu, Chao
Deng, Hong-Wen
Wang, Yu-Ping
author_sort Lin, Dongdong
collection PubMed
description BACKGROUND: Integrative analysis of multi-omics data is becoming increasingly important to unravel functional mechanisms of complex diseases. However, the currently available multi-omics datasets inevitably suffer from missing values due to technical limitations and various constrains in experiments. These missing values severely hinder integrative analysis of multi-omics data. Current imputation methods mainly focus on using single omics data while ignoring biological interconnections and information imbedded in multi-omics data sets. RESULTS: In this study, a novel multi-omics imputation method was proposed to integrate multiple correlated omics datasets for improving the imputation accuracy. Our method was designed to: 1) combine the estimates of missing value from individual omics data itself as well as from other omics, and 2) simultaneously impute multiple missing omics datasets by an iterative algorithm. We compared our method with five imputation methods using single omics data at different noise levels, sample sizes and data missing rates. The results demonstrated the advantage and efficiency of our method, consistently in terms of the imputation error and the recovery of mRNA-miRNA network structure. CONCLUSIONS: We concluded that our proposed imputation method can utilize more biological information to minimize the imputation error and thus can improve the performance of downstream analysis such as genetic regulatory network construction. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1122-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4915152
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49151522016-06-27 An integrative imputation method based on multi-omics datasets Lin, Dongdong Zhang, Jigang Li, Jingyao Xu, Chao Deng, Hong-Wen Wang, Yu-Ping BMC Bioinformatics Methodology Article BACKGROUND: Integrative analysis of multi-omics data is becoming increasingly important to unravel functional mechanisms of complex diseases. However, the currently available multi-omics datasets inevitably suffer from missing values due to technical limitations and various constrains in experiments. These missing values severely hinder integrative analysis of multi-omics data. Current imputation methods mainly focus on using single omics data while ignoring biological interconnections and information imbedded in multi-omics data sets. RESULTS: In this study, a novel multi-omics imputation method was proposed to integrate multiple correlated omics datasets for improving the imputation accuracy. Our method was designed to: 1) combine the estimates of missing value from individual omics data itself as well as from other omics, and 2) simultaneously impute multiple missing omics datasets by an iterative algorithm. We compared our method with five imputation methods using single omics data at different noise levels, sample sizes and data missing rates. The results demonstrated the advantage and efficiency of our method, consistently in terms of the imputation error and the recovery of mRNA-miRNA network structure. CONCLUSIONS: We concluded that our proposed imputation method can utilize more biological information to minimize the imputation error and thus can improve the performance of downstream analysis such as genetic regulatory network construction. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1122-6) contains supplementary material, which is available to authorized users. BioMed Central 2016-06-21 /pmc/articles/PMC4915152/ /pubmed/27329642 http://dx.doi.org/10.1186/s12859-016-1122-6 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Lin, Dongdong
Zhang, Jigang
Li, Jingyao
Xu, Chao
Deng, Hong-Wen
Wang, Yu-Ping
An integrative imputation method based on multi-omics datasets
title An integrative imputation method based on multi-omics datasets
title_full An integrative imputation method based on multi-omics datasets
title_fullStr An integrative imputation method based on multi-omics datasets
title_full_unstemmed An integrative imputation method based on multi-omics datasets
title_short An integrative imputation method based on multi-omics datasets
title_sort integrative imputation method based on multi-omics datasets
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915152/
https://www.ncbi.nlm.nih.gov/pubmed/27329642
http://dx.doi.org/10.1186/s12859-016-1122-6
work_keys_str_mv AT lindongdong anintegrativeimputationmethodbasedonmultiomicsdatasets
AT zhangjigang anintegrativeimputationmethodbasedonmultiomicsdatasets
AT lijingyao anintegrativeimputationmethodbasedonmultiomicsdatasets
AT xuchao anintegrativeimputationmethodbasedonmultiomicsdatasets
AT denghongwen anintegrativeimputationmethodbasedonmultiomicsdatasets
AT wangyuping anintegrativeimputationmethodbasedonmultiomicsdatasets
AT lindongdong integrativeimputationmethodbasedonmultiomicsdatasets
AT zhangjigang integrativeimputationmethodbasedonmultiomicsdatasets
AT lijingyao integrativeimputationmethodbasedonmultiomicsdatasets
AT xuchao integrativeimputationmethodbasedonmultiomicsdatasets
AT denghongwen integrativeimputationmethodbasedonmultiomicsdatasets
AT wangyuping integrativeimputationmethodbasedonmultiomicsdatasets