Cargando…
Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets
Distinct gene expression patterns within cells are foundational for the diversity of functions and unique characteristics observed in specific contexts, such as human tissues and cell types. Though some biological processes commonly occur across contexts, by harnessing the vast amounts of available...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9202106/ https://www.ncbi.nlm.nih.gov/pubmed/35705903 http://dx.doi.org/10.1186/s12859-022-04765-0 |
_version_ | 1784728460752584704 |
---|---|
author | Figueiredo, Rebeca Queiroz del Ser, Sara Díaz Raschka, Tamara Hofmann-Apitius, Martin Kodamullil, Alpha Tom Mubeen, Sarah Domingo-Fernández, Daniel |
author_facet | Figueiredo, Rebeca Queiroz del Ser, Sara Díaz Raschka, Tamara Hofmann-Apitius, Martin Kodamullil, Alpha Tom Mubeen, Sarah Domingo-Fernández, Daniel |
author_sort | Figueiredo, Rebeca Queiroz |
collection | PubMed |
description | Distinct gene expression patterns within cells are foundational for the diversity of functions and unique characteristics observed in specific contexts, such as human tissues and cell types. Though some biological processes commonly occur across contexts, by harnessing the vast amounts of available gene expression data, we can decipher the processes that are unique to a specific context. Therefore, with the goal of developing a portrait of context-specific patterns to better elucidate how they govern distinct biological processes, this work presents a large-scale exploration of transcriptomic signatures across three different contexts (i.e., tissues, cell types, and cell lines) by leveraging over 600 gene expression datasets categorized into 98 subcontexts. The strongest pairwise correlations between genes from these subcontexts are used for the construction of co-expression networks. Using a network-based approach, we then pinpoint patterns that are unique and common across these subcontexts. First, we focused on patterns at the level of individual nodes and evaluated their functional roles using a human protein–protein interactome as a referential network. Next, within each context, we systematically overlaid the co-expression networks to identify specific and shared correlations as well as relations already described in scientific literature. Additionally, in a pathway-level analysis, we overlaid node and edge sets from co-expression networks against pathway knowledge to identify biological processes that are related to specific subcontexts or groups of them. Finally, we have released our data and scripts at https://zenodo.org/record/5831786 and https://github.com/ContNeXt/, respectively and developed ContNeXt (https://contnext.scai.fraunhofer.de/), a web application to explore the networks generated in this work. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04765-0. |
format | Online Article Text |
id | pubmed-9202106 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-92021062022-06-17 Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets Figueiredo, Rebeca Queiroz del Ser, Sara Díaz Raschka, Tamara Hofmann-Apitius, Martin Kodamullil, Alpha Tom Mubeen, Sarah Domingo-Fernández, Daniel BMC Bioinformatics Research Distinct gene expression patterns within cells are foundational for the diversity of functions and unique characteristics observed in specific contexts, such as human tissues and cell types. Though some biological processes commonly occur across contexts, by harnessing the vast amounts of available gene expression data, we can decipher the processes that are unique to a specific context. Therefore, with the goal of developing a portrait of context-specific patterns to better elucidate how they govern distinct biological processes, this work presents a large-scale exploration of transcriptomic signatures across three different contexts (i.e., tissues, cell types, and cell lines) by leveraging over 600 gene expression datasets categorized into 98 subcontexts. The strongest pairwise correlations between genes from these subcontexts are used for the construction of co-expression networks. Using a network-based approach, we then pinpoint patterns that are unique and common across these subcontexts. First, we focused on patterns at the level of individual nodes and evaluated their functional roles using a human protein–protein interactome as a referential network. Next, within each context, we systematically overlaid the co-expression networks to identify specific and shared correlations as well as relations already described in scientific literature. Additionally, in a pathway-level analysis, we overlaid node and edge sets from co-expression networks against pathway knowledge to identify biological processes that are related to specific subcontexts or groups of them. Finally, we have released our data and scripts at https://zenodo.org/record/5831786 and https://github.com/ContNeXt/, respectively and developed ContNeXt (https://contnext.scai.fraunhofer.de/), a web application to explore the networks generated in this work. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04765-0. BioMed Central 2022-06-15 /pmc/articles/PMC9202106/ /pubmed/35705903 http://dx.doi.org/10.1186/s12859-022-04765-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Figueiredo, Rebeca Queiroz del Ser, Sara Díaz Raschka, Tamara Hofmann-Apitius, Martin Kodamullil, Alpha Tom Mubeen, Sarah Domingo-Fernández, Daniel Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets |
title | Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets |
title_full | Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets |
title_fullStr | Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets |
title_full_unstemmed | Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets |
title_short | Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets |
title_sort | elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9202106/ https://www.ncbi.nlm.nih.gov/pubmed/35705903 http://dx.doi.org/10.1186/s12859-022-04765-0 |
work_keys_str_mv | AT figueiredorebecaqueiroz elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets AT delsersaradiaz elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets AT raschkatamara elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets AT hofmannapitiusmartin elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets AT kodamullilalphatom elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets AT mubeensarah elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets AT domingofernandezdaniel elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets |