Cargando…

Similarity-driven multi-view embeddings from high-dimensional biomedical data

Diverse, high-dimensional modalities collected in large cohorts present new opportunities for the formulation and testing of integrative scientific hypotheses. Similarity-driven multi-view linear reconstruction (SiMLR) is an algorithm that exploits inter-modality relationships to transform large sci...

Descripción completa

Detalles Bibliográficos
Autores principales: Avants, Brian B., Tustison, Nicholas J., Stone, James R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8009088/
https://www.ncbi.nlm.nih.gov/pubmed/33796865
http://dx.doi.org/10.1038/s43588-021-00029-8
_version_ 1783672813634191360
author Avants, Brian B.
Tustison, Nicholas J.
Stone, James R.
author_facet Avants, Brian B.
Tustison, Nicholas J.
Stone, James R.
author_sort Avants, Brian B.
collection PubMed
description Diverse, high-dimensional modalities collected in large cohorts present new opportunities for the formulation and testing of integrative scientific hypotheses. Similarity-driven multi-view linear reconstruction (SiMLR) is an algorithm that exploits inter-modality relationships to transform large scientific datasets into smaller, more well-powered and interpretable low-dimensional spaces. SiMLR contributes an objective function for identifying joint signal, regularization based on sparse matrices representing prior within-modality relationships and an implementation that permits application to joint reduction of large data matrices. We demonstrate that SiMLR outperforms closely related methods on supervised learning problems in simulation data, a multi-omics cancer survival prediction dataset and multiple modality neuroimaging datasets. Taken together, this collection of results shows that SiMLR may be applied to joint signal estimation from disparate modalities and may yield practically useful results in a variety of application domains.
format Online
Article
Text
id pubmed-8009088
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-80090882021-08-01 Similarity-driven multi-view embeddings from high-dimensional biomedical data Avants, Brian B. Tustison, Nicholas J. Stone, James R. Nat Comput Sci Article Diverse, high-dimensional modalities collected in large cohorts present new opportunities for the formulation and testing of integrative scientific hypotheses. Similarity-driven multi-view linear reconstruction (SiMLR) is an algorithm that exploits inter-modality relationships to transform large scientific datasets into smaller, more well-powered and interpretable low-dimensional spaces. SiMLR contributes an objective function for identifying joint signal, regularization based on sparse matrices representing prior within-modality relationships and an implementation that permits application to joint reduction of large data matrices. We demonstrate that SiMLR outperforms closely related methods on supervised learning problems in simulation data, a multi-omics cancer survival prediction dataset and multiple modality neuroimaging datasets. Taken together, this collection of results shows that SiMLR may be applied to joint signal estimation from disparate modalities and may yield practically useful results in a variety of application domains. 2021-02-22 2021-02 /pmc/articles/PMC8009088/ /pubmed/33796865 http://dx.doi.org/10.1038/s43588-021-00029-8 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Avants, Brian B.
Tustison, Nicholas J.
Stone, James R.
Similarity-driven multi-view embeddings from high-dimensional biomedical data
title Similarity-driven multi-view embeddings from high-dimensional biomedical data
title_full Similarity-driven multi-view embeddings from high-dimensional biomedical data
title_fullStr Similarity-driven multi-view embeddings from high-dimensional biomedical data
title_full_unstemmed Similarity-driven multi-view embeddings from high-dimensional biomedical data
title_short Similarity-driven multi-view embeddings from high-dimensional biomedical data
title_sort similarity-driven multi-view embeddings from high-dimensional biomedical data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8009088/
https://www.ncbi.nlm.nih.gov/pubmed/33796865
http://dx.doi.org/10.1038/s43588-021-00029-8
work_keys_str_mv AT avantsbrianb similaritydrivenmultiviewembeddingsfromhighdimensionalbiomedicaldata
AT tustisonnicholasj similaritydrivenmultiviewembeddingsfromhighdimensionalbiomedicaldata
AT stonejamesr similaritydrivenmultiviewembeddingsfromhighdimensionalbiomedicaldata