Cargando…

An improved and explicit surrogate variable analysis procedure by coefficient adjustment

Unobserved environmental, demographic and technical factors canadversely affect the estimation and testing of the effects ofprimary variables. Surrogate variable analysis, proposed to tacklethis problem, has been widely used in genomic studies. To estimatehidden factors that are correlated with the...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Seunggeun, Sun, Wei, Wright, Fred A., Zou, Fei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627626/
https://www.ncbi.nlm.nih.gov/pubmed/29430031
http://dx.doi.org/10.1093/biomet/asx018
_version_ 1783268746270343168
author Lee, Seunggeun
Sun, Wei
Wright, Fred A.
Zou, Fei
author_facet Lee, Seunggeun
Sun, Wei
Wright, Fred A.
Zou, Fei
author_sort Lee, Seunggeun
collection PubMed
description Unobserved environmental, demographic and technical factors canadversely affect the estimation and testing of the effects ofprimary variables. Surrogate variable analysis, proposed to tacklethis problem, has been widely used in genomic studies. To estimatehidden factors that are correlated with the primary variables,surrogate variable analysis performs principal component analysiseither on a subset of features or on all features, but weightingeach differently. However, existing approaches may fail to identifyhidden factors that are strongly correlated with the primaryvariables, and the extra step of feature selection and weightcalculation makes the theoretical investigation of surrogatevariable analysis challenging. In this paper, we propose an improvedsurrogate variable analysis, using all measured features, that has anatural connection with restricted least squares, which allows us tostudy its theoretical properties. Simulation studies and real-dataanalysis show that the method is competitive with state-of-the-artmethods.
format Online
Article
Text
id pubmed-5627626
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-56276262018-06-01 An improved and explicit surrogate variable analysis procedure by coefficient adjustment Lee, Seunggeun Sun, Wei Wright, Fred A. Zou, Fei Biometrika Articles Unobserved environmental, demographic and technical factors canadversely affect the estimation and testing of the effects ofprimary variables. Surrogate variable analysis, proposed to tacklethis problem, has been widely used in genomic studies. To estimatehidden factors that are correlated with the primary variables,surrogate variable analysis performs principal component analysiseither on a subset of features or on all features, but weightingeach differently. However, existing approaches may fail to identifyhidden factors that are strongly correlated with the primaryvariables, and the extra step of feature selection and weightcalculation makes the theoretical investigation of surrogatevariable analysis challenging. In this paper, we propose an improvedsurrogate variable analysis, using all measured features, that has anatural connection with restricted least squares, which allows us tostudy its theoretical properties. Simulation studies and real-dataanalysis show that the method is competitive with state-of-the-artmethods. Oxford University Press 2017-06 2017-04-21 /pmc/articles/PMC5627626/ /pubmed/29430031 http://dx.doi.org/10.1093/biomet/asx018 Text en © 2017 Biometrika Trust
spellingShingle Articles
Lee, Seunggeun
Sun, Wei
Wright, Fred A.
Zou, Fei
An improved and explicit surrogate variable analysis procedure by coefficient adjustment
title An improved and explicit surrogate variable analysis procedure by coefficient adjustment
title_full An improved and explicit surrogate variable analysis procedure by coefficient adjustment
title_fullStr An improved and explicit surrogate variable analysis procedure by coefficient adjustment
title_full_unstemmed An improved and explicit surrogate variable analysis procedure by coefficient adjustment
title_short An improved and explicit surrogate variable analysis procedure by coefficient adjustment
title_sort improved and explicit surrogate variable analysis procedure by coefficient adjustment
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627626/
https://www.ncbi.nlm.nih.gov/pubmed/29430031
http://dx.doi.org/10.1093/biomet/asx018
work_keys_str_mv AT leeseunggeun animprovedandexplicitsurrogatevariableanalysisprocedurebycoefficientadjustment
AT sunwei animprovedandexplicitsurrogatevariableanalysisprocedurebycoefficientadjustment
AT wrightfreda animprovedandexplicitsurrogatevariableanalysisprocedurebycoefficientadjustment
AT zoufei animprovedandexplicitsurrogatevariableanalysisprocedurebycoefficientadjustment
AT leeseunggeun improvedandexplicitsurrogatevariableanalysisprocedurebycoefficientadjustment
AT sunwei improvedandexplicitsurrogatevariableanalysisprocedurebycoefficientadjustment
AT wrightfreda improvedandexplicitsurrogatevariableanalysisprocedurebycoefficientadjustment
AT zoufei improvedandexplicitsurrogatevariableanalysisprocedurebycoefficientadjustment