Cargando…

The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer

Deciphering important genes and pathways from incomplete gene expression data could facilitate a better understanding of cancer. Different imputation methods can be applied to estimate the missing values. In our study, we evaluated various imputation methods for their performance in preserving signi...

Descripción completa

Detalles Bibliográficos
Autores principales: Aghdam, Rosa, Baghfalaki, Taban, Khosravi, Pegah, Saberi Ansari, Elnaz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5828654/
https://www.ncbi.nlm.nih.gov/pubmed/29247873
http://dx.doi.org/10.1016/j.gpb.2017.08.003
_version_ 1783302675903807488
author Aghdam, Rosa
Baghfalaki, Taban
Khosravi, Pegah
Saberi Ansari, Elnaz
author_facet Aghdam, Rosa
Baghfalaki, Taban
Khosravi, Pegah
Saberi Ansari, Elnaz
author_sort Aghdam, Rosa
collection PubMed
description Deciphering important genes and pathways from incomplete gene expression data could facilitate a better understanding of cancer. Different imputation methods can be applied to estimate the missing values. In our study, we evaluated various imputation methods for their performance in preserving significant genes and pathways. In the first step, 5% genes are considered in random for two types of ignorable and non-ignorable missingness mechanisms with various missing rates. Next, 10 well-known imputation methods were applied to the complete datasets. The significance analysis of microarrays (SAM) method was applied to detect the significant genes in rectal and lung cancers to showcase the utility of imputation approaches in preserving significant genes. To determine the impact of different imputation methods on the identification of important genes, the chi-squared test was used to compare the proportions of overlaps between significant genes detected from original data and those detected from the imputed datasets. Additionally, the significant genes are tested for their enrichment in important pathways, using the ConsensusPathDB. Our results showed that almost all the significant genes and pathways of the original dataset can be detected in all imputed datasets, indicating that there is no significant difference in the performance of various imputation methods tested. The source code and selected datasets are available on http://profiles.bs.ipm.ir/softwares/imputation_methods/.
format Online
Article
Text
id pubmed-5828654
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-58286542018-02-28 The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer Aghdam, Rosa Baghfalaki, Taban Khosravi, Pegah Saberi Ansari, Elnaz Genomics Proteomics Bioinformatics Application Note Deciphering important genes and pathways from incomplete gene expression data could facilitate a better understanding of cancer. Different imputation methods can be applied to estimate the missing values. In our study, we evaluated various imputation methods for their performance in preserving significant genes and pathways. In the first step, 5% genes are considered in random for two types of ignorable and non-ignorable missingness mechanisms with various missing rates. Next, 10 well-known imputation methods were applied to the complete datasets. The significance analysis of microarrays (SAM) method was applied to detect the significant genes in rectal and lung cancers to showcase the utility of imputation approaches in preserving significant genes. To determine the impact of different imputation methods on the identification of important genes, the chi-squared test was used to compare the proportions of overlaps between significant genes detected from original data and those detected from the imputed datasets. Additionally, the significant genes are tested for their enrichment in important pathways, using the ConsensusPathDB. Our results showed that almost all the significant genes and pathways of the original dataset can be detected in all imputed datasets, indicating that there is no significant difference in the performance of various imputation methods tested. The source code and selected datasets are available on http://profiles.bs.ipm.ir/softwares/imputation_methods/. Elsevier 2017-12 2017-12-13 /pmc/articles/PMC5828654/ /pubmed/29247873 http://dx.doi.org/10.1016/j.gpb.2017.08.003 Text en © 2017 The Authors. Production and hosting by Elsevier B.V. http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Application Note
Aghdam, Rosa
Baghfalaki, Taban
Khosravi, Pegah
Saberi Ansari, Elnaz
The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer
title The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer
title_full The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer
title_fullStr The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer
title_full_unstemmed The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer
title_short The Ability of Different Imputation Methods to Preserve the Significant Genes and Pathways in Cancer
title_sort ability of different imputation methods to preserve the significant genes and pathways in cancer
topic Application Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5828654/
https://www.ncbi.nlm.nih.gov/pubmed/29247873
http://dx.doi.org/10.1016/j.gpb.2017.08.003
work_keys_str_mv AT aghdamrosa theabilityofdifferentimputationmethodstopreservethesignificantgenesandpathwaysincancer
AT baghfalakitaban theabilityofdifferentimputationmethodstopreservethesignificantgenesandpathwaysincancer
AT khosravipegah theabilityofdifferentimputationmethodstopreservethesignificantgenesandpathwaysincancer
AT saberiansarielnaz theabilityofdifferentimputationmethodstopreservethesignificantgenesandpathwaysincancer
AT aghdamrosa abilityofdifferentimputationmethodstopreservethesignificantgenesandpathwaysincancer
AT baghfalakitaban abilityofdifferentimputationmethodstopreservethesignificantgenesandpathwaysincancer
AT khosravipegah abilityofdifferentimputationmethodstopreservethesignificantgenesandpathwaysincancer
AT saberiansarielnaz abilityofdifferentimputationmethodstopreservethesignificantgenesandpathwaysincancer