Cargando…

Harnessing the biological complexity of Big Data from LINCS gene expression signatures

Gene expression profiling using transcriptional drug perturbations are useful for many biomedical discovery studies including drug repurposing and elucidation of drug mechanisms (MoA) and many other pharmacogenomic applications. However, limited data availability across cell types has severely hinde...

Descripción completa

Detalles Bibliográficos
Autores principales: Musa, Aliyu, Tripathi, Shailesh, Kandhavelu, Meenakshisundaram, Dehmer, Matthias, Emmert-Streib, Frank
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6114505/
https://www.ncbi.nlm.nih.gov/pubmed/30157183
http://dx.doi.org/10.1371/journal.pone.0201937
_version_ 1783351205109432320
author Musa, Aliyu
Tripathi, Shailesh
Kandhavelu, Meenakshisundaram
Dehmer, Matthias
Emmert-Streib, Frank
author_facet Musa, Aliyu
Tripathi, Shailesh
Kandhavelu, Meenakshisundaram
Dehmer, Matthias
Emmert-Streib, Frank
author_sort Musa, Aliyu
collection PubMed
description Gene expression profiling using transcriptional drug perturbations are useful for many biomedical discovery studies including drug repurposing and elucidation of drug mechanisms (MoA) and many other pharmacogenomic applications. However, limited data availability across cell types has severely hindered our capacity to progress in these areas. To fill this gap, recently, the LINCS program generated almost 1.3 million profiles for over 40,000 drug and genetic perturbations for over 70 different human cell types, including meta information about the experimental conditions and cell lines. Unfortunately, Big Data like the ones generated from the ongoing LINCS program do not enable easy insights from the data but possess considerable challenges toward their analysis. In this paper, we address some of these challenges. Specifically, first, we study the gene expression signature profiles from all cell lines and their perturbagents in order to obtain insights in the distributional characteristics of available conditions. Second, we investigate the differential expression of genes for all cell lines obtaining an understanding of condition dependent differential expression manifesting the biological complexity of perturbagents. As a result, our analysis helps the experimental design of follow-up studies, e.g., by selecting appropriate cell lines.
format Online
Article
Text
id pubmed-6114505
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61145052018-09-17 Harnessing the biological complexity of Big Data from LINCS gene expression signatures Musa, Aliyu Tripathi, Shailesh Kandhavelu, Meenakshisundaram Dehmer, Matthias Emmert-Streib, Frank PLoS One Research Article Gene expression profiling using transcriptional drug perturbations are useful for many biomedical discovery studies including drug repurposing and elucidation of drug mechanisms (MoA) and many other pharmacogenomic applications. However, limited data availability across cell types has severely hindered our capacity to progress in these areas. To fill this gap, recently, the LINCS program generated almost 1.3 million profiles for over 40,000 drug and genetic perturbations for over 70 different human cell types, including meta information about the experimental conditions and cell lines. Unfortunately, Big Data like the ones generated from the ongoing LINCS program do not enable easy insights from the data but possess considerable challenges toward their analysis. In this paper, we address some of these challenges. Specifically, first, we study the gene expression signature profiles from all cell lines and their perturbagents in order to obtain insights in the distributional characteristics of available conditions. Second, we investigate the differential expression of genes for all cell lines obtaining an understanding of condition dependent differential expression manifesting the biological complexity of perturbagents. As a result, our analysis helps the experimental design of follow-up studies, e.g., by selecting appropriate cell lines. Public Library of Science 2018-08-29 /pmc/articles/PMC6114505/ /pubmed/30157183 http://dx.doi.org/10.1371/journal.pone.0201937 Text en © 2018 Musa et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Musa, Aliyu
Tripathi, Shailesh
Kandhavelu, Meenakshisundaram
Dehmer, Matthias
Emmert-Streib, Frank
Harnessing the biological complexity of Big Data from LINCS gene expression signatures
title Harnessing the biological complexity of Big Data from LINCS gene expression signatures
title_full Harnessing the biological complexity of Big Data from LINCS gene expression signatures
title_fullStr Harnessing the biological complexity of Big Data from LINCS gene expression signatures
title_full_unstemmed Harnessing the biological complexity of Big Data from LINCS gene expression signatures
title_short Harnessing the biological complexity of Big Data from LINCS gene expression signatures
title_sort harnessing the biological complexity of big data from lincs gene expression signatures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6114505/
https://www.ncbi.nlm.nih.gov/pubmed/30157183
http://dx.doi.org/10.1371/journal.pone.0201937
work_keys_str_mv AT musaaliyu harnessingthebiologicalcomplexityofbigdatafromlincsgeneexpressionsignatures
AT tripathishailesh harnessingthebiologicalcomplexityofbigdatafromlincsgeneexpressionsignatures
AT kandhavelumeenakshisundaram harnessingthebiologicalcomplexityofbigdatafromlincsgeneexpressionsignatures
AT dehmermatthias harnessingthebiologicalcomplexityofbigdatafromlincsgeneexpressionsignatures
AT emmertstreibfrank harnessingthebiologicalcomplexityofbigdatafromlincsgeneexpressionsignatures