Cargando…

Imputing gene expression from optimally reduced probe sets

Measuring complete gene expression profiles for a large number of experiments is costly. We propose an approach in which a small subset of probes is selected based on a preliminary set of full expression profiles. In subsequent experiments, only the subset is measured, and the missing values are imp...

Descripción completa

Detalles Bibliográficos
Autores principales: Donner, Yoni, Feng, Ting, Benoist, Christophe, Koller, Daphne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3505039/
https://www.ncbi.nlm.nih.gov/pubmed/23064520
http://dx.doi.org/10.1038/nmeth.2207
Descripción
Sumario:Measuring complete gene expression profiles for a large number of experiments is costly. We propose an approach in which a small subset of probes is selected based on a preliminary set of full expression profiles. In subsequent experiments, only the subset is measured, and the missing values are imputed. We develop several algorithms to simultaneously select probes and impute missing values, and demonstrate that these probe selection for imputation (PSI) algorithms can successfully reconstruct missing gene expression values in a wide variety of applications, as evaluated using multiple metrics of biological importance. We analyze the performance of PSI methods under varying conditions, provide guidelines for choosing the optimal method based on the experimental setting, and indicate how to estimate imputation accuracy. Finally, we apply our approach to a large-scale study of immune system variation.