Cargando…

Scalable analysis of multi-modal biomedical data

BACKGROUND: Targeted diagnosis and treatment options are dependent on insights drawn from multi-modal analysis of large-scale biomedical datasets. Advances in genomics sequencing, image processing, and medical data management have supported data collection and management within medical institutions....

Descripción completa

Detalles Bibliográficos
Autores principales:	Smith, Jaclyn, Shi, Yao, Benedikt, Michael, Nikolic, Milos
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2021
Materias:	Technical Note
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434767/ https://www.ncbi.nlm.nih.gov/pubmed/34508579 http://dx.doi.org/10.1093/gigascience/giab058

_version_	1783751674449362944
author	Smith, Jaclyn Shi, Yao Benedikt, Michael Nikolic, Milos
author_facet	Smith, Jaclyn Shi, Yao Benedikt, Michael Nikolic, Milos
author_sort	Smith, Jaclyn
collection	PubMed
description	BACKGROUND: Targeted diagnosis and treatment options are dependent on insights drawn from multi-modal analysis of large-scale biomedical datasets. Advances in genomics sequencing, image processing, and medical data management have supported data collection and management within medical institutions. These efforts have produced large-scale datasets and have enabled integrative analyses that provide a more thorough look of the impact of a disease on the underlying system. The integration of large-scale biomedical data commonly involves several complex data transformation steps, such as combining datasets to build feature vectors for learning analysis. Thus, scalable data integration solutions play a key role in the future of targeted medicine. Though large-scale data processing frameworks have shown promising performance for many domains, they fail to support scalable processing of complex datatypes. SOLUTION: To address these issues and achieve scalable processing of multi-modal biomedical data, we present TraNCE, a framework that automates the difficulties of designing distributed analyses with complex biomedical data types. PERFORMANCE: We outline research and clinical applications for the platform, including data integration support for building feature sets for classification. We show that the system is capable of outperforming the common alternative, based on “flattening” complex data structures, and runs efficiently when alternative approaches are unable to perform at all.
format	Online Article Text
id	pubmed-8434767
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-84347672021-09-13 Scalable analysis of multi-modal biomedical data Smith, Jaclyn Shi, Yao Benedikt, Michael Nikolic, Milos Gigascience Technical Note BACKGROUND: Targeted diagnosis and treatment options are dependent on insights drawn from multi-modal analysis of large-scale biomedical datasets. Advances in genomics sequencing, image processing, and medical data management have supported data collection and management within medical institutions. These efforts have produced large-scale datasets and have enabled integrative analyses that provide a more thorough look of the impact of a disease on the underlying system. The integration of large-scale biomedical data commonly involves several complex data transformation steps, such as combining datasets to build feature vectors for learning analysis. Thus, scalable data integration solutions play a key role in the future of targeted medicine. Though large-scale data processing frameworks have shown promising performance for many domains, they fail to support scalable processing of complex datatypes. SOLUTION: To address these issues and achieve scalable processing of multi-modal biomedical data, we present TraNCE, a framework that automates the difficulties of designing distributed analyses with complex biomedical data types. PERFORMANCE: We outline research and clinical applications for the platform, including data integration support for building feature sets for classification. We show that the system is capable of outperforming the common alternative, based on “flattening” complex data structures, and runs efficiently when alternative approaches are unable to perform at all. Oxford University Press 2021-09-11 /pmc/articles/PMC8434767/ /pubmed/34508579 http://dx.doi.org/10.1093/gigascience/giab058 Text en © The Author(s) 2021. Published by Oxford University Press GigaScience. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Technical Note Smith, Jaclyn Shi, Yao Benedikt, Michael Nikolic, Milos Scalable analysis of multi-modal biomedical data
title	Scalable analysis of multi-modal biomedical data
title_full	Scalable analysis of multi-modal biomedical data
title_fullStr	Scalable analysis of multi-modal biomedical data
title_full_unstemmed	Scalable analysis of multi-modal biomedical data
title_short	Scalable analysis of multi-modal biomedical data
title_sort	scalable analysis of multi-modal biomedical data
topic	Technical Note
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8434767/ https://www.ncbi.nlm.nih.gov/pubmed/34508579 http://dx.doi.org/10.1093/gigascience/giab058
work_keys_str_mv	AT smithjaclyn scalableanalysisofmultimodalbiomedicaldata AT shiyao scalableanalysisofmultimodalbiomedicaldata AT benediktmichael scalableanalysisofmultimodalbiomedicaldata AT nikolicmilos scalableanalysisofmultimodalbiomedicaldata

Scalable analysis of multi-modal biomedical data

Ejemplares similares