Cargando…

Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets

Distinct gene expression patterns within cells are foundational for the diversity of functions and unique characteristics observed in specific contexts, such as human tissues and cell types. Though some biological processes commonly occur across contexts, by harnessing the vast amounts of available...

Descripción completa

Detalles Bibliográficos
Autores principales: Figueiredo, Rebeca Queiroz, del Ser, Sara Díaz, Raschka, Tamara, Hofmann-Apitius, Martin, Kodamullil, Alpha Tom, Mubeen, Sarah, Domingo-Fernández, Daniel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9202106/
https://www.ncbi.nlm.nih.gov/pubmed/35705903
http://dx.doi.org/10.1186/s12859-022-04765-0
_version_ 1784728460752584704
author Figueiredo, Rebeca Queiroz
del Ser, Sara Díaz
Raschka, Tamara
Hofmann-Apitius, Martin
Kodamullil, Alpha Tom
Mubeen, Sarah
Domingo-Fernández, Daniel
author_facet Figueiredo, Rebeca Queiroz
del Ser, Sara Díaz
Raschka, Tamara
Hofmann-Apitius, Martin
Kodamullil, Alpha Tom
Mubeen, Sarah
Domingo-Fernández, Daniel
author_sort Figueiredo, Rebeca Queiroz
collection PubMed
description Distinct gene expression patterns within cells are foundational for the diversity of functions and unique characteristics observed in specific contexts, such as human tissues and cell types. Though some biological processes commonly occur across contexts, by harnessing the vast amounts of available gene expression data, we can decipher the processes that are unique to a specific context. Therefore, with the goal of developing a portrait of context-specific patterns to better elucidate how they govern distinct biological processes, this work presents a large-scale exploration of transcriptomic signatures across three different contexts (i.e., tissues, cell types, and cell lines) by leveraging over 600 gene expression datasets categorized into 98 subcontexts. The strongest pairwise correlations between genes from these subcontexts are used for the construction of co-expression networks. Using a network-based approach, we then pinpoint patterns that are unique and common across these subcontexts. First, we focused on patterns at the level of individual nodes and evaluated their functional roles using a human protein–protein interactome as a referential network. Next, within each context, we systematically overlaid the co-expression networks to identify specific and shared correlations as well as relations already described in scientific literature. Additionally, in a pathway-level analysis, we overlaid node and edge sets from co-expression networks against pathway knowledge to identify biological processes that are related to specific subcontexts or groups of them. Finally, we have released our data and scripts at https://zenodo.org/record/5831786 and https://github.com/ContNeXt/, respectively and developed ContNeXt (https://contnext.scai.fraunhofer.de/), a web application to explore the networks generated in this work. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04765-0.
format Online
Article
Text
id pubmed-9202106
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-92021062022-06-17 Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets Figueiredo, Rebeca Queiroz del Ser, Sara Díaz Raschka, Tamara Hofmann-Apitius, Martin Kodamullil, Alpha Tom Mubeen, Sarah Domingo-Fernández, Daniel BMC Bioinformatics Research Distinct gene expression patterns within cells are foundational for the diversity of functions and unique characteristics observed in specific contexts, such as human tissues and cell types. Though some biological processes commonly occur across contexts, by harnessing the vast amounts of available gene expression data, we can decipher the processes that are unique to a specific context. Therefore, with the goal of developing a portrait of context-specific patterns to better elucidate how they govern distinct biological processes, this work presents a large-scale exploration of transcriptomic signatures across three different contexts (i.e., tissues, cell types, and cell lines) by leveraging over 600 gene expression datasets categorized into 98 subcontexts. The strongest pairwise correlations between genes from these subcontexts are used for the construction of co-expression networks. Using a network-based approach, we then pinpoint patterns that are unique and common across these subcontexts. First, we focused on patterns at the level of individual nodes and evaluated their functional roles using a human protein–protein interactome as a referential network. Next, within each context, we systematically overlaid the co-expression networks to identify specific and shared correlations as well as relations already described in scientific literature. Additionally, in a pathway-level analysis, we overlaid node and edge sets from co-expression networks against pathway knowledge to identify biological processes that are related to specific subcontexts or groups of them. Finally, we have released our data and scripts at https://zenodo.org/record/5831786 and https://github.com/ContNeXt/, respectively and developed ContNeXt (https://contnext.scai.fraunhofer.de/), a web application to explore the networks generated in this work. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-022-04765-0. BioMed Central 2022-06-15 /pmc/articles/PMC9202106/ /pubmed/35705903 http://dx.doi.org/10.1186/s12859-022-04765-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Figueiredo, Rebeca Queiroz
del Ser, Sara Díaz
Raschka, Tamara
Hofmann-Apitius, Martin
Kodamullil, Alpha Tom
Mubeen, Sarah
Domingo-Fernández, Daniel
Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets
title Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets
title_full Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets
title_fullStr Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets
title_full_unstemmed Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets
title_short Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets
title_sort elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9202106/
https://www.ncbi.nlm.nih.gov/pubmed/35705903
http://dx.doi.org/10.1186/s12859-022-04765-0
work_keys_str_mv AT figueiredorebecaqueiroz elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets
AT delsersaradiaz elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets
AT raschkatamara elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets
AT hofmannapitiusmartin elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets
AT kodamullilalphatom elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets
AT mubeensarah elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets
AT domingofernandezdaniel elucidatinggeneexpressionpatternsacrossmultiplebiologicalcontextsthroughalargescaleinvestigationoftranscriptomicdatasets