Cargando…

A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario

Two classical multivariate statistical problems, testing of multivariate normality and the k-sample problem, are explored by a novel analysis on several resolutions simultaneously. The presented methods do not invert any estimated covariance matrix. Thereby, the methods work in the High Dimension Lo...

Descripción completa

Detalles Bibliográficos
Autores principales: Hindberg, Kristian, Hannig, Jan, Godtliebsen, Fred
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6342313/
https://www.ncbi.nlm.nih.gov/pubmed/30668596
http://dx.doi.org/10.1371/journal.pone.0211044
_version_ 1783389105691820032
author Hindberg, Kristian
Hannig, Jan
Godtliebsen, Fred
author_facet Hindberg, Kristian
Hannig, Jan
Godtliebsen, Fred
author_sort Hindberg, Kristian
collection PubMed
description Two classical multivariate statistical problems, testing of multivariate normality and the k-sample problem, are explored by a novel analysis on several resolutions simultaneously. The presented methods do not invert any estimated covariance matrix. Thereby, the methods work in the High Dimension Low Sample Size situation, i.e. when n ≤ p. The output, a significance map, is produced by doing a one-dimensional test for all possible resolution/position pairs. The significance map shows for which resolution/position pairs the null hypothesis is rejected. For the testing of multinormality, the Anderson-Darling test is utilized to detect potential departures from multinormality at different combinations of resolutions and positions. In the k-sample case, it is tested whether k data sets can be said to originate from the same unspecified discrete or continuous multivariate distribution. This is done by testing the k vectors corresponding to the same resolution/position pair of the k different data sets through the k-sample Anderson-Darling test. Successful demonstrations of the new methodology on artificial and real data sets are presented, and a feature selection scheme is demonstrated.
format Online
Article
Text
id pubmed-6342313
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-63423132019-02-02 A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario Hindberg, Kristian Hannig, Jan Godtliebsen, Fred PLoS One Research Article Two classical multivariate statistical problems, testing of multivariate normality and the k-sample problem, are explored by a novel analysis on several resolutions simultaneously. The presented methods do not invert any estimated covariance matrix. Thereby, the methods work in the High Dimension Low Sample Size situation, i.e. when n ≤ p. The output, a significance map, is produced by doing a one-dimensional test for all possible resolution/position pairs. The significance map shows for which resolution/position pairs the null hypothesis is rejected. For the testing of multinormality, the Anderson-Darling test is utilized to detect potential departures from multinormality at different combinations of resolutions and positions. In the k-sample case, it is tested whether k data sets can be said to originate from the same unspecified discrete or continuous multivariate distribution. This is done by testing the k vectors corresponding to the same resolution/position pair of the k different data sets through the k-sample Anderson-Darling test. Successful demonstrations of the new methodology on artificial and real data sets are presented, and a feature selection scheme is demonstrated. Public Library of Science 2019-01-22 /pmc/articles/PMC6342313/ /pubmed/30668596 http://dx.doi.org/10.1371/journal.pone.0211044 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Hindberg, Kristian
Hannig, Jan
Godtliebsen, Fred
A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario
title A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario
title_full A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario
title_fullStr A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario
title_full_unstemmed A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario
title_short A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario
title_sort novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6342313/
https://www.ncbi.nlm.nih.gov/pubmed/30668596
http://dx.doi.org/10.1371/journal.pone.0211044
work_keys_str_mv AT hindbergkristian anovelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario
AT hannigjan anovelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario
AT godtliebsenfred anovelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario
AT hindbergkristian novelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario
AT hannigjan novelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario
AT godtliebsenfred novelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario