Cargando…
Aristotle: stratified causal discovery for omics data
BACKGROUND: There has been a simultaneous increase in demand and accessibility across genomics, transcriptomics, proteomics and metabolomics data, known as omics data. This has encouraged widespread application of omics data in life sciences, from personalized medicine to the discovery of underlying...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8760642/ https://www.ncbi.nlm.nih.gov/pubmed/35033007 http://dx.doi.org/10.1186/s12859-021-04521-w |
_version_ | 1784633364750270464 |
---|---|
author | Mansouri, Mehrdad Khakabimamaghani, Sahand Chindelevitch, Leonid Ester, Martin |
author_facet | Mansouri, Mehrdad Khakabimamaghani, Sahand Chindelevitch, Leonid Ester, Martin |
author_sort | Mansouri, Mehrdad |
collection | PubMed |
description | BACKGROUND: There has been a simultaneous increase in demand and accessibility across genomics, transcriptomics, proteomics and metabolomics data, known as omics data. This has encouraged widespread application of omics data in life sciences, from personalized medicine to the discovery of underlying pathophysiology of diseases. Causal analysis of omics data may provide important insight into the underlying biological mechanisms. Existing causal analysis methods yield promising results when identifying potential general causes of an observed outcome based on omics data. However, they may fail to discover the causes specific to a particular stratum of individuals and missing from others. METHODS: To fill this gap, we introduce the problem of stratified causal discovery and propose a method, Aristotle, for solving it. Aristotle addresses the two challenges intrinsic to omics data: high dimensionality and hidden stratification. It employs existing biological knowledge and a state-of-the-art patient stratification method to tackle the above challenges and applies a quasi-experimental design method to each stratum to find stratum-specific potential causes. RESULTS: Evaluation based on synthetic data shows better performance for Aristotle in discovering true causes under different conditions compared to existing causal discovery methods. Experiments on a real dataset on Anthracycline Cardiotoxicity indicate that Aristotle’s predictions are consistent with the existing literature. Moreover, Aristotle makes additional predictions that suggest further investigations. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04521-w. |
format | Online Article Text |
id | pubmed-8760642 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-87606422022-01-18 Aristotle: stratified causal discovery for omics data Mansouri, Mehrdad Khakabimamaghani, Sahand Chindelevitch, Leonid Ester, Martin BMC Bioinformatics Methodology Article BACKGROUND: There has been a simultaneous increase in demand and accessibility across genomics, transcriptomics, proteomics and metabolomics data, known as omics data. This has encouraged widespread application of omics data in life sciences, from personalized medicine to the discovery of underlying pathophysiology of diseases. Causal analysis of omics data may provide important insight into the underlying biological mechanisms. Existing causal analysis methods yield promising results when identifying potential general causes of an observed outcome based on omics data. However, they may fail to discover the causes specific to a particular stratum of individuals and missing from others. METHODS: To fill this gap, we introduce the problem of stratified causal discovery and propose a method, Aristotle, for solving it. Aristotle addresses the two challenges intrinsic to omics data: high dimensionality and hidden stratification. It employs existing biological knowledge and a state-of-the-art patient stratification method to tackle the above challenges and applies a quasi-experimental design method to each stratum to find stratum-specific potential causes. RESULTS: Evaluation based on synthetic data shows better performance for Aristotle in discovering true causes under different conditions compared to existing causal discovery methods. Experiments on a real dataset on Anthracycline Cardiotoxicity indicate that Aristotle’s predictions are consistent with the existing literature. Moreover, Aristotle makes additional predictions that suggest further investigations. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04521-w. BioMed Central 2022-01-15 /pmc/articles/PMC8760642/ /pubmed/35033007 http://dx.doi.org/10.1186/s12859-021-04521-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Mansouri, Mehrdad Khakabimamaghani, Sahand Chindelevitch, Leonid Ester, Martin Aristotle: stratified causal discovery for omics data |
title | Aristotle: stratified causal discovery for omics data |
title_full | Aristotle: stratified causal discovery for omics data |
title_fullStr | Aristotle: stratified causal discovery for omics data |
title_full_unstemmed | Aristotle: stratified causal discovery for omics data |
title_short | Aristotle: stratified causal discovery for omics data |
title_sort | aristotle: stratified causal discovery for omics data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8760642/ https://www.ncbi.nlm.nih.gov/pubmed/35033007 http://dx.doi.org/10.1186/s12859-021-04521-w |
work_keys_str_mv | AT mansourimehrdad aristotlestratifiedcausaldiscoveryforomicsdata AT khakabimamaghanisahand aristotlestratifiedcausaldiscoveryforomicsdata AT chindelevitchleonid aristotlestratifiedcausaldiscoveryforomicsdata AT estermartin aristotlestratifiedcausaldiscoveryforomicsdata |