Cargando…
Identification of Significant Features by the Global Mean Rank Test
With the introduction of omics-technologies such as transcriptomics and proteomics, numerous methods for the reliable identification of significantly regulated features (genes, proteins, etc.) have been developed. Experimental practice requires these tests to successfully deal with conditions such a...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4132091/ https://www.ncbi.nlm.nih.gov/pubmed/25119995 http://dx.doi.org/10.1371/journal.pone.0104504 |
_version_ | 1782330566615498752 |
---|---|
author | Klammer, Martin Dybowski, J. Nikolaj Hoffmann, Daniel Schaab, Christoph |
author_facet | Klammer, Martin Dybowski, J. Nikolaj Hoffmann, Daniel Schaab, Christoph |
author_sort | Klammer, Martin |
collection | PubMed |
description | With the introduction of omics-technologies such as transcriptomics and proteomics, numerous methods for the reliable identification of significantly regulated features (genes, proteins, etc.) have been developed. Experimental practice requires these tests to successfully deal with conditions such as small numbers of replicates, missing values, non-normally distributed expression levels, and non-identical distributions of features. With the MeanRank test we aimed at developing a test that performs robustly under these conditions, while favorably scaling with the number of replicates. The test proposed here is a global one-sample location test, which is based on the mean ranks across replicates, and internally estimates and controls the false discovery rate. Furthermore, missing data is accounted for without the need of imputation. In extensive simulations comparing MeanRank to other frequently used methods, we found that it performs well with small and large numbers of replicates, feature dependent variance between replicates, and variable regulation across features on simulation data and a recent two-color microarray spike-in dataset. The tests were then used to identify significant changes in the phosphoproteomes of cancer cells induced by the kinase inhibitors erlotinib and 3-MB-PP1 in two independently published mass spectrometry-based studies. MeanRank outperformed the other global rank-based methods applied in this study. Compared to the popular Significance Analysis of Microarrays and Linear Models for Microarray methods, MeanRank performed similar or better. Furthermore, MeanRank exhibits more consistent behavior regarding the degree of regulation and is robust against the choice of preprocessing methods. MeanRank does not require any imputation of missing values, is easy to understand, and yields results that are easy to interpret. The software implementing the algorithm is freely available for academic and commercial use. |
format | Online Article Text |
id | pubmed-4132091 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41320912014-08-19 Identification of Significant Features by the Global Mean Rank Test Klammer, Martin Dybowski, J. Nikolaj Hoffmann, Daniel Schaab, Christoph PLoS One Research Article With the introduction of omics-technologies such as transcriptomics and proteomics, numerous methods for the reliable identification of significantly regulated features (genes, proteins, etc.) have been developed. Experimental practice requires these tests to successfully deal with conditions such as small numbers of replicates, missing values, non-normally distributed expression levels, and non-identical distributions of features. With the MeanRank test we aimed at developing a test that performs robustly under these conditions, while favorably scaling with the number of replicates. The test proposed here is a global one-sample location test, which is based on the mean ranks across replicates, and internally estimates and controls the false discovery rate. Furthermore, missing data is accounted for without the need of imputation. In extensive simulations comparing MeanRank to other frequently used methods, we found that it performs well with small and large numbers of replicates, feature dependent variance between replicates, and variable regulation across features on simulation data and a recent two-color microarray spike-in dataset. The tests were then used to identify significant changes in the phosphoproteomes of cancer cells induced by the kinase inhibitors erlotinib and 3-MB-PP1 in two independently published mass spectrometry-based studies. MeanRank outperformed the other global rank-based methods applied in this study. Compared to the popular Significance Analysis of Microarrays and Linear Models for Microarray methods, MeanRank performed similar or better. Furthermore, MeanRank exhibits more consistent behavior regarding the degree of regulation and is robust against the choice of preprocessing methods. MeanRank does not require any imputation of missing values, is easy to understand, and yields results that are easy to interpret. The software implementing the algorithm is freely available for academic and commercial use. Public Library of Science 2014-08-13 /pmc/articles/PMC4132091/ /pubmed/25119995 http://dx.doi.org/10.1371/journal.pone.0104504 Text en © 2014 Klammer et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Klammer, Martin Dybowski, J. Nikolaj Hoffmann, Daniel Schaab, Christoph Identification of Significant Features by the Global Mean Rank Test |
title | Identification of Significant Features by the Global Mean Rank Test |
title_full | Identification of Significant Features by the Global Mean Rank Test |
title_fullStr | Identification of Significant Features by the Global Mean Rank Test |
title_full_unstemmed | Identification of Significant Features by the Global Mean Rank Test |
title_short | Identification of Significant Features by the Global Mean Rank Test |
title_sort | identification of significant features by the global mean rank test |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4132091/ https://www.ncbi.nlm.nih.gov/pubmed/25119995 http://dx.doi.org/10.1371/journal.pone.0104504 |
work_keys_str_mv | AT klammermartin identificationofsignificantfeaturesbytheglobalmeanranktest AT dybowskijnikolaj identificationofsignificantfeaturesbytheglobalmeanranktest AT hoffmanndaniel identificationofsignificantfeaturesbytheglobalmeanranktest AT schaabchristoph identificationofsignificantfeaturesbytheglobalmeanranktest |