Cargando…
A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario
Two classical multivariate statistical problems, testing of multivariate normality and the k-sample problem, are explored by a novel analysis on several resolutions simultaneously. The presented methods do not invert any estimated covariance matrix. Thereby, the methods work in the High Dimension Lo...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6342313/ https://www.ncbi.nlm.nih.gov/pubmed/30668596 http://dx.doi.org/10.1371/journal.pone.0211044 |
_version_ | 1783389105691820032 |
---|---|
author | Hindberg, Kristian Hannig, Jan Godtliebsen, Fred |
author_facet | Hindberg, Kristian Hannig, Jan Godtliebsen, Fred |
author_sort | Hindberg, Kristian |
collection | PubMed |
description | Two classical multivariate statistical problems, testing of multivariate normality and the k-sample problem, are explored by a novel analysis on several resolutions simultaneously. The presented methods do not invert any estimated covariance matrix. Thereby, the methods work in the High Dimension Low Sample Size situation, i.e. when n ≤ p. The output, a significance map, is produced by doing a one-dimensional test for all possible resolution/position pairs. The significance map shows for which resolution/position pairs the null hypothesis is rejected. For the testing of multinormality, the Anderson-Darling test is utilized to detect potential departures from multinormality at different combinations of resolutions and positions. In the k-sample case, it is tested whether k data sets can be said to originate from the same unspecified discrete or continuous multivariate distribution. This is done by testing the k vectors corresponding to the same resolution/position pair of the k different data sets through the k-sample Anderson-Darling test. Successful demonstrations of the new methodology on artificial and real data sets are presented, and a feature selection scheme is demonstrated. |
format | Online Article Text |
id | pubmed-6342313 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-63423132019-02-02 A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario Hindberg, Kristian Hannig, Jan Godtliebsen, Fred PLoS One Research Article Two classical multivariate statistical problems, testing of multivariate normality and the k-sample problem, are explored by a novel analysis on several resolutions simultaneously. The presented methods do not invert any estimated covariance matrix. Thereby, the methods work in the High Dimension Low Sample Size situation, i.e. when n ≤ p. The output, a significance map, is produced by doing a one-dimensional test for all possible resolution/position pairs. The significance map shows for which resolution/position pairs the null hypothesis is rejected. For the testing of multinormality, the Anderson-Darling test is utilized to detect potential departures from multinormality at different combinations of resolutions and positions. In the k-sample case, it is tested whether k data sets can be said to originate from the same unspecified discrete or continuous multivariate distribution. This is done by testing the k vectors corresponding to the same resolution/position pair of the k different data sets through the k-sample Anderson-Darling test. Successful demonstrations of the new methodology on artificial and real data sets are presented, and a feature selection scheme is demonstrated. Public Library of Science 2019-01-22 /pmc/articles/PMC6342313/ /pubmed/30668596 http://dx.doi.org/10.1371/journal.pone.0211044 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article Hindberg, Kristian Hannig, Jan Godtliebsen, Fred A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario |
title | A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario |
title_full | A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario |
title_fullStr | A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario |
title_full_unstemmed | A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario |
title_short | A novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario |
title_sort | novel scale-space approach for multinormality testing and the k-sample problem in the high dimension low sample size scenario |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6342313/ https://www.ncbi.nlm.nih.gov/pubmed/30668596 http://dx.doi.org/10.1371/journal.pone.0211044 |
work_keys_str_mv | AT hindbergkristian anovelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario AT hannigjan anovelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario AT godtliebsenfred anovelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario AT hindbergkristian novelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario AT hannigjan novelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario AT godtliebsenfred novelscalespaceapproachformultinormalitytestingandtheksampleprobleminthehighdimensionlowsamplesizescenario |