Cargando…

Partial identification in the statistical matching problem

The statistical matching problem involves the integration of multiple datasets where some variables are not observed jointly. This missing data pattern leaves most statistical models unidentifiable. Statistical inference is still possible when operating under the framework of partially identified mo...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahfock, Daniel, Pyne, Saumyadipta, Lee, Sharon X., McLachlan, Geoffrey J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5423529/
https://www.ncbi.nlm.nih.gov/pubmed/28496285
http://dx.doi.org/10.1016/j.csda.2016.06.005
_version_ 1783234961614045184
author Ahfock, Daniel
Pyne, Saumyadipta
Lee, Sharon X.
McLachlan, Geoffrey J.
author_facet Ahfock, Daniel
Pyne, Saumyadipta
Lee, Sharon X.
McLachlan, Geoffrey J.
author_sort Ahfock, Daniel
collection PubMed
description The statistical matching problem involves the integration of multiple datasets where some variables are not observed jointly. This missing data pattern leaves most statistical models unidentifiable. Statistical inference is still possible when operating under the framework of partially identified models, where the goal is to bound the parameters rather than to estimate them precisely. In many matching problems, developing feasible bounds on the parameters is equivalent to finding the set of positive-definite completions of a partially specified covariance matrix. Existing methods for characterising the set of possible completions do not extend to high-dimensional problems. A Gibbs sampler to draw from the set of possible completions is proposed. The variation in the observed samples gives an estimate of the feasible region of the parameters. The Gibbs sampler extends easily to high-dimensional statistical matching problems.
format Online
Article
Text
id pubmed-5423529
institution National Center for Biotechnology Information
language English
publishDate 2016
record_format MEDLINE/PubMed
spelling pubmed-54235292017-05-09 Partial identification in the statistical matching problem Ahfock, Daniel Pyne, Saumyadipta Lee, Sharon X. McLachlan, Geoffrey J. Comput Stat Data Anal Article The statistical matching problem involves the integration of multiple datasets where some variables are not observed jointly. This missing data pattern leaves most statistical models unidentifiable. Statistical inference is still possible when operating under the framework of partially identified models, where the goal is to bound the parameters rather than to estimate them precisely. In many matching problems, developing feasible bounds on the parameters is equivalent to finding the set of positive-definite completions of a partially specified covariance matrix. Existing methods for characterising the set of possible completions do not extend to high-dimensional problems. A Gibbs sampler to draw from the set of possible completions is proposed. The variation in the observed samples gives an estimate of the feasible region of the parameters. The Gibbs sampler extends easily to high-dimensional statistical matching problems. 2016-12 /pmc/articles/PMC5423529/ /pubmed/28496285 http://dx.doi.org/10.1016/j.csda.2016.06.005 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ).
spellingShingle Article
Ahfock, Daniel
Pyne, Saumyadipta
Lee, Sharon X.
McLachlan, Geoffrey J.
Partial identification in the statistical matching problem
title Partial identification in the statistical matching problem
title_full Partial identification in the statistical matching problem
title_fullStr Partial identification in the statistical matching problem
title_full_unstemmed Partial identification in the statistical matching problem
title_short Partial identification in the statistical matching problem
title_sort partial identification in the statistical matching problem
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5423529/
https://www.ncbi.nlm.nih.gov/pubmed/28496285
http://dx.doi.org/10.1016/j.csda.2016.06.005
work_keys_str_mv AT ahfockdaniel partialidentificationinthestatisticalmatchingproblem
AT pynesaumyadipta partialidentificationinthestatisticalmatchingproblem
AT leesharonx partialidentificationinthestatisticalmatchingproblem
AT mclachlangeoffreyj partialidentificationinthestatisticalmatchingproblem