Cargando…

Representing high throughput expression profiles via perturbation barcodes reveals compound targets

High throughput mRNA expression profiling can be used to characterize the response of cell culture models to perturbations such as pharmacologic modulators and genetic perturbations. As profiling campaigns expand in scope, it is important to homogenize, summarize, and analyze the resulting data in a...

Descripción completa

Detalles Bibliográficos
Autores principales: Filzen, Tracey M., Kutchukian, Peter S., Hermes, Jeffrey D., Li, Jing, Tudor, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5300121/
https://www.ncbi.nlm.nih.gov/pubmed/28182661
http://dx.doi.org/10.1371/journal.pcbi.1005335
_version_ 1782506128707420160
author Filzen, Tracey M.
Kutchukian, Peter S.
Hermes, Jeffrey D.
Li, Jing
Tudor, Matthew
author_facet Filzen, Tracey M.
Kutchukian, Peter S.
Hermes, Jeffrey D.
Li, Jing
Tudor, Matthew
author_sort Filzen, Tracey M.
collection PubMed
description High throughput mRNA expression profiling can be used to characterize the response of cell culture models to perturbations such as pharmacologic modulators and genetic perturbations. As profiling campaigns expand in scope, it is important to homogenize, summarize, and analyze the resulting data in a manner that captures significant biological signals in spite of various noise sources such as batch effects and stochastic variation. We used the L1000 platform for large-scale profiling of 978 representative genes across thousands of compound treatments. Here, a method is described that uses deep learning techniques to convert the expression changes of the landmark genes into a perturbation barcode that reveals important features of the underlying data, performing better than the raw data in revealing important biological insights. The barcode captures compound structure and target information, and predicts a compound’s high throughput screening promiscuity, to a higher degree than the original data measurements, indicating that the approach uncovers underlying factors of the expression data that are otherwise entangled or masked by noise. Furthermore, we demonstrate that visualizations derived from the perturbation barcode can be used to more sensitively assign functions to unknown compounds through a guilt-by-association approach, which we use to predict and experimentally validate the activity of compounds on the MAPK pathway. The demonstrated application of deep metric learning to large-scale chemical genetics projects highlights the utility of this and related approaches to the extraction of insights and testable hypotheses from big, sometimes noisy data.
format Online
Article
Text
id pubmed-5300121
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53001212017-02-28 Representing high throughput expression profiles via perturbation barcodes reveals compound targets Filzen, Tracey M. Kutchukian, Peter S. Hermes, Jeffrey D. Li, Jing Tudor, Matthew PLoS Comput Biol Research Article High throughput mRNA expression profiling can be used to characterize the response of cell culture models to perturbations such as pharmacologic modulators and genetic perturbations. As profiling campaigns expand in scope, it is important to homogenize, summarize, and analyze the resulting data in a manner that captures significant biological signals in spite of various noise sources such as batch effects and stochastic variation. We used the L1000 platform for large-scale profiling of 978 representative genes across thousands of compound treatments. Here, a method is described that uses deep learning techniques to convert the expression changes of the landmark genes into a perturbation barcode that reveals important features of the underlying data, performing better than the raw data in revealing important biological insights. The barcode captures compound structure and target information, and predicts a compound’s high throughput screening promiscuity, to a higher degree than the original data measurements, indicating that the approach uncovers underlying factors of the expression data that are otherwise entangled or masked by noise. Furthermore, we demonstrate that visualizations derived from the perturbation barcode can be used to more sensitively assign functions to unknown compounds through a guilt-by-association approach, which we use to predict and experimentally validate the activity of compounds on the MAPK pathway. The demonstrated application of deep metric learning to large-scale chemical genetics projects highlights the utility of this and related approaches to the extraction of insights and testable hypotheses from big, sometimes noisy data. Public Library of Science 2017-02-09 /pmc/articles/PMC5300121/ /pubmed/28182661 http://dx.doi.org/10.1371/journal.pcbi.1005335 Text en © 2017 Filzen et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Filzen, Tracey M.
Kutchukian, Peter S.
Hermes, Jeffrey D.
Li, Jing
Tudor, Matthew
Representing high throughput expression profiles via perturbation barcodes reveals compound targets
title Representing high throughput expression profiles via perturbation barcodes reveals compound targets
title_full Representing high throughput expression profiles via perturbation barcodes reveals compound targets
title_fullStr Representing high throughput expression profiles via perturbation barcodes reveals compound targets
title_full_unstemmed Representing high throughput expression profiles via perturbation barcodes reveals compound targets
title_short Representing high throughput expression profiles via perturbation barcodes reveals compound targets
title_sort representing high throughput expression profiles via perturbation barcodes reveals compound targets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5300121/
https://www.ncbi.nlm.nih.gov/pubmed/28182661
http://dx.doi.org/10.1371/journal.pcbi.1005335
work_keys_str_mv AT filzentraceym representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets
AT kutchukianpeters representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets
AT hermesjeffreyd representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets
AT lijing representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets
AT tudormatthew representinghighthroughputexpressionprofilesviaperturbationbarcodesrevealscompoundtargets