Cargando…

MoSBi: Automated signature mining for molecular stratification and subtyping

The improving access to increasing amounts of biomedical data provides completely new chances for advanced patient stratification and disease subtyping strategies. This requires computational tools that produce uniformly robust results across highly heterogeneous molecular data. Unsupervised machine...

Descripción completa

Detalles Bibliográficos
Autores principales: Rose, Tim Daniel, Bechtler, Thibault, Ciora, Octavia-Andreea, Anh Lilian Le, Kim, Molnar, Florian, Köhler, Nikolai, Baumbach, Jan, Röttger, Richard, Pauling, Josch Konstantin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9169782/
https://www.ncbi.nlm.nih.gov/pubmed/35412913
http://dx.doi.org/10.1073/pnas.2118210119
_version_ 1784721273307267072
author Rose, Tim Daniel
Bechtler, Thibault
Ciora, Octavia-Andreea
Anh Lilian Le, Kim
Molnar, Florian
Köhler, Nikolai
Baumbach, Jan
Röttger, Richard
Pauling, Josch Konstantin
author_facet Rose, Tim Daniel
Bechtler, Thibault
Ciora, Octavia-Andreea
Anh Lilian Le, Kim
Molnar, Florian
Köhler, Nikolai
Baumbach, Jan
Röttger, Richard
Pauling, Josch Konstantin
author_sort Rose, Tim Daniel
collection PubMed
description The improving access to increasing amounts of biomedical data provides completely new chances for advanced patient stratification and disease subtyping strategies. This requires computational tools that produce uniformly robust results across highly heterogeneous molecular data. Unsupervised machine learning methodologies are able to discover de novo patterns in such data. Biclustering is especially suited by simultaneously identifying sample groups and corresponding feature sets across heterogeneous omics data. The performance of available biclustering algorithms heavily depends on individual parameterization and varies with their application. Here, we developed MoSBi (molecular signature identification using biclustering), an automated multialgorithm ensemble approach that integrates results utilizing an error model-supported similarity network. We systematically evaluated the performance of 11 available and established biclustering algorithms together with MoSBi. For this, we used transcriptomics, proteomics, and metabolomics data, as well as synthetic datasets covering various data properties. Profiting from multialgorithm integration, MoSBi identified robust group and disease-specific signatures across all scenarios, overcoming single algorithm specificities. Furthermore, we developed a scalable network-based visualization of bicluster communities that supports biological hypothesis generation. MoSBi is available as an R package and web service to make automated biclustering analysis accessible for application in molecular sample stratification.
format Online
Article
Text
id pubmed-9169782
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-91697822022-06-07 MoSBi: Automated signature mining for molecular stratification and subtyping Rose, Tim Daniel Bechtler, Thibault Ciora, Octavia-Andreea Anh Lilian Le, Kim Molnar, Florian Köhler, Nikolai Baumbach, Jan Röttger, Richard Pauling, Josch Konstantin Proc Natl Acad Sci U S A Biological Sciences The improving access to increasing amounts of biomedical data provides completely new chances for advanced patient stratification and disease subtyping strategies. This requires computational tools that produce uniformly robust results across highly heterogeneous molecular data. Unsupervised machine learning methodologies are able to discover de novo patterns in such data. Biclustering is especially suited by simultaneously identifying sample groups and corresponding feature sets across heterogeneous omics data. The performance of available biclustering algorithms heavily depends on individual parameterization and varies with their application. Here, we developed MoSBi (molecular signature identification using biclustering), an automated multialgorithm ensemble approach that integrates results utilizing an error model-supported similarity network. We systematically evaluated the performance of 11 available and established biclustering algorithms together with MoSBi. For this, we used transcriptomics, proteomics, and metabolomics data, as well as synthetic datasets covering various data properties. Profiting from multialgorithm integration, MoSBi identified robust group and disease-specific signatures across all scenarios, overcoming single algorithm specificities. Furthermore, we developed a scalable network-based visualization of bicluster communities that supports biological hypothesis generation. MoSBi is available as an R package and web service to make automated biclustering analysis accessible for application in molecular sample stratification. National Academy of Sciences 2022-04-11 2022-04-19 /pmc/articles/PMC9169782/ /pubmed/35412913 http://dx.doi.org/10.1073/pnas.2118210119 Text en Copyright © 2022 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Rose, Tim Daniel
Bechtler, Thibault
Ciora, Octavia-Andreea
Anh Lilian Le, Kim
Molnar, Florian
Köhler, Nikolai
Baumbach, Jan
Röttger, Richard
Pauling, Josch Konstantin
MoSBi: Automated signature mining for molecular stratification and subtyping
title MoSBi: Automated signature mining for molecular stratification and subtyping
title_full MoSBi: Automated signature mining for molecular stratification and subtyping
title_fullStr MoSBi: Automated signature mining for molecular stratification and subtyping
title_full_unstemmed MoSBi: Automated signature mining for molecular stratification and subtyping
title_short MoSBi: Automated signature mining for molecular stratification and subtyping
title_sort mosbi: automated signature mining for molecular stratification and subtyping
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9169782/
https://www.ncbi.nlm.nih.gov/pubmed/35412913
http://dx.doi.org/10.1073/pnas.2118210119
work_keys_str_mv AT rosetimdaniel mosbiautomatedsignatureminingformolecularstratificationandsubtyping
AT bechtlerthibault mosbiautomatedsignatureminingformolecularstratificationandsubtyping
AT cioraoctaviaandreea mosbiautomatedsignatureminingformolecularstratificationandsubtyping
AT anhlilianlekim mosbiautomatedsignatureminingformolecularstratificationandsubtyping
AT molnarflorian mosbiautomatedsignatureminingformolecularstratificationandsubtyping
AT kohlernikolai mosbiautomatedsignatureminingformolecularstratificationandsubtyping
AT baumbachjan mosbiautomatedsignatureminingformolecularstratificationandsubtyping
AT rottgerrichard mosbiautomatedsignatureminingformolecularstratificationandsubtyping
AT paulingjoschkonstantin mosbiautomatedsignatureminingformolecularstratificationandsubtyping