Cargando…
SGI: automatic clinical subgroup identification in omics datasets
SUMMARY: The ‘Subgroup Identification’ (SGI) toolbox provides an algorithm to automatically detect clinical subgroups of samples in large-scale omics datasets. It is based on hierarchical clustering trees in combination with a specifically designed association testing and visualization framework tha...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8723155/ https://www.ncbi.nlm.nih.gov/pubmed/34529048 http://dx.doi.org/10.1093/bioinformatics/btab656 |
_version_ | 1784625652527267840 |
---|---|
author | Buyukozkan, Mustafa Suhre, Karsten Krumsiek, Jan |
author_facet | Buyukozkan, Mustafa Suhre, Karsten Krumsiek, Jan |
author_sort | Buyukozkan, Mustafa |
collection | PubMed |
description | SUMMARY: The ‘Subgroup Identification’ (SGI) toolbox provides an algorithm to automatically detect clinical subgroups of samples in large-scale omics datasets. It is based on hierarchical clustering trees in combination with a specifically designed association testing and visualization framework that can process an arbitrary number of clinical parameters and outcomes in a systematic fashion. A multi-block extension allows for the simultaneous use of multiple omics datasets on the same samples. In this article, we first describe the functionality of the toolbox and then demonstrate its capabilities through application examples on a type 2 diabetes metabolomics study as well as two copy number variation datasets from The Cancer Genome Atlas. AVAILABILITY AND IMPLEMENTATION: SGI is an open-source package implemented in R. Package source codes and hands-on tutorials are available at https://github.com/krumsieklab/sgi. The QMdiab metabolomics data is included in the package and can be downloaded from https://doi.org/10.6084/m9.figshare.5904022. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8723155 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-87231552022-01-05 SGI: automatic clinical subgroup identification in omics datasets Buyukozkan, Mustafa Suhre, Karsten Krumsiek, Jan Bioinformatics Applications Notes SUMMARY: The ‘Subgroup Identification’ (SGI) toolbox provides an algorithm to automatically detect clinical subgroups of samples in large-scale omics datasets. It is based on hierarchical clustering trees in combination with a specifically designed association testing and visualization framework that can process an arbitrary number of clinical parameters and outcomes in a systematic fashion. A multi-block extension allows for the simultaneous use of multiple omics datasets on the same samples. In this article, we first describe the functionality of the toolbox and then demonstrate its capabilities through application examples on a type 2 diabetes metabolomics study as well as two copy number variation datasets from The Cancer Genome Atlas. AVAILABILITY AND IMPLEMENTATION: SGI is an open-source package implemented in R. Package source codes and hands-on tutorials are available at https://github.com/krumsieklab/sgi. The QMdiab metabolomics data is included in the package and can be downloaded from https://doi.org/10.6084/m9.figshare.5904022. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-09-16 /pmc/articles/PMC8723155/ /pubmed/34529048 http://dx.doi.org/10.1093/bioinformatics/btab656 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Buyukozkan, Mustafa Suhre, Karsten Krumsiek, Jan SGI: automatic clinical subgroup identification in omics datasets |
title | SGI: automatic clinical subgroup identification in omics datasets |
title_full | SGI: automatic clinical subgroup identification in omics datasets |
title_fullStr | SGI: automatic clinical subgroup identification in omics datasets |
title_full_unstemmed | SGI: automatic clinical subgroup identification in omics datasets |
title_short | SGI: automatic clinical subgroup identification in omics datasets |
title_sort | sgi: automatic clinical subgroup identification in omics datasets |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8723155/ https://www.ncbi.nlm.nih.gov/pubmed/34529048 http://dx.doi.org/10.1093/bioinformatics/btab656 |
work_keys_str_mv | AT buyukozkanmustafa sgiautomaticclinicalsubgroupidentificationinomicsdatasets AT suhrekarsten sgiautomaticclinicalsubgroupidentificationinomicsdatasets AT krumsiekjan sgiautomaticclinicalsubgroupidentificationinomicsdatasets |