Cargando…

ANIMA: Association network integration for multiscale analysis

Contextual functional interpretation of -omics data derived from clinical samples is a classical and difficult problem in computational systems biology. The measurement of thousands of data points on single samples has become routine but relating ‘big data’ datasets to the complexities of human path...

Descripción completa

Detalles Bibliográficos
Autores principales: Deffur, Armin, Wilkinson, Robert J., Mayosi, Bongani M., Mulder, Nicola M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6134339/
https://www.ncbi.nlm.nih.gov/pubmed/30271886
http://dx.doi.org/10.12688/wellcomeopenres.14073.3
_version_ 1783354649973096448
author Deffur, Armin
Wilkinson, Robert J.
Mayosi, Bongani M.
Mulder, Nicola M.
author_facet Deffur, Armin
Wilkinson, Robert J.
Mayosi, Bongani M.
Mulder, Nicola M.
author_sort Deffur, Armin
collection PubMed
description Contextual functional interpretation of -omics data derived from clinical samples is a classical and difficult problem in computational systems biology. The measurement of thousands of data points on single samples has become routine but relating ‘big data’ datasets to the complexities of human pathobiology is an area of ongoing research. Complicating this is the fact that many publicly available datasets use bulk transcriptomics data from complex tissues like blood. The most prevalent analytic approaches derive molecular ‘signatures’ of disease states or apply modular analysis frameworks to the data. Here we describe ANIMA (association network integration for multiscale analysis), a network-based data integration method using clinical phenotype and microarray data as inputs. ANIMA is implemented in R and Neo4j and runs in Docker containers. In short, the build algorithm iterates over one or more transcriptomics datasets to generate a large, multipartite association network by executing multiple independent analytic steps (differential expression, deconvolution, modular analysis based on co-expression, pathway analysis) and integrating the results. Once the network is built, it can be queried directly using Cypher (a graph query language), or by custom functions that communicate with the graph database via language-specific APIs. We developed a web application using Shiny, which provides fully interactive, multiscale views of the data. Using our approach, we show that we can reconstruct multiple features of disease states at various scales of organization, from transcript abundance patterns of individual genes through co-expression patterns of groups of genes to patterns of cellular behaviour in whole blood samples, both in single experiments as well in meta-analyses of multiple datasets.
format Online
Article
Text
id pubmed-6134339
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-61343392018-09-27 ANIMA: Association network integration for multiscale analysis Deffur, Armin Wilkinson, Robert J. Mayosi, Bongani M. Mulder, Nicola M. Wellcome Open Res Method Article Contextual functional interpretation of -omics data derived from clinical samples is a classical and difficult problem in computational systems biology. The measurement of thousands of data points on single samples has become routine but relating ‘big data’ datasets to the complexities of human pathobiology is an area of ongoing research. Complicating this is the fact that many publicly available datasets use bulk transcriptomics data from complex tissues like blood. The most prevalent analytic approaches derive molecular ‘signatures’ of disease states or apply modular analysis frameworks to the data. Here we describe ANIMA (association network integration for multiscale analysis), a network-based data integration method using clinical phenotype and microarray data as inputs. ANIMA is implemented in R and Neo4j and runs in Docker containers. In short, the build algorithm iterates over one or more transcriptomics datasets to generate a large, multipartite association network by executing multiple independent analytic steps (differential expression, deconvolution, modular analysis based on co-expression, pathway analysis) and integrating the results. Once the network is built, it can be queried directly using Cypher (a graph query language), or by custom functions that communicate with the graph database via language-specific APIs. We developed a web application using Shiny, which provides fully interactive, multiscale views of the data. Using our approach, we show that we can reconstruct multiple features of disease states at various scales of organization, from transcript abundance patterns of individual genes through co-expression patterns of groups of genes to patterns of cellular behaviour in whole blood samples, both in single experiments as well in meta-analyses of multiple datasets. F1000 Research Limited 2018-11-14 /pmc/articles/PMC6134339/ /pubmed/30271886 http://dx.doi.org/10.12688/wellcomeopenres.14073.3 Text en Copyright: © 2018 Deffur A et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Deffur, Armin
Wilkinson, Robert J.
Mayosi, Bongani M.
Mulder, Nicola M.
ANIMA: Association network integration for multiscale analysis
title ANIMA: Association network integration for multiscale analysis
title_full ANIMA: Association network integration for multiscale analysis
title_fullStr ANIMA: Association network integration for multiscale analysis
title_full_unstemmed ANIMA: Association network integration for multiscale analysis
title_short ANIMA: Association network integration for multiscale analysis
title_sort anima: association network integration for multiscale analysis
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6134339/
https://www.ncbi.nlm.nih.gov/pubmed/30271886
http://dx.doi.org/10.12688/wellcomeopenres.14073.3
work_keys_str_mv AT deffurarmin animaassociationnetworkintegrationformultiscaleanalysis
AT wilkinsonrobertj animaassociationnetworkintegrationformultiscaleanalysis
AT mayosibonganim animaassociationnetworkintegrationformultiscaleanalysis
AT muldernicolam animaassociationnetworkintegrationformultiscaleanalysis