Cargando…

An algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations

BACKGROUND: Chemical genomics is an interdisciplinary field that combines small molecule perturbation with traditional genomics to understand gene function and to study the mode(s) of drug action. A benefit of chemical genomic screens is their breadth; each screen can capture the sensitivity of comp...

Descripción completa

Detalles Bibliográficos
Autores principales:	Shabtai, Daniel, Giaever, Guri, Nislow, Corey
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Methodology Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3780717/ https://www.ncbi.nlm.nih.gov/pubmed/23009392 http://dx.doi.org/10.1186/1471-2105-13-245

_version_	1782285333352677376
author	Shabtai, Daniel Giaever, Guri Nislow, Corey
author_facet	Shabtai, Daniel Giaever, Guri Nislow, Corey
author_sort	Shabtai, Daniel
collection	PubMed
description	BACKGROUND: Chemical genomics is an interdisciplinary field that combines small molecule perturbation with traditional genomics to understand gene function and to study the mode(s) of drug action. A benefit of chemical genomic screens is their breadth; each screen can capture the sensitivity of comprehensive collections of mutants or, in the case of mammalian cells, gene knock-downs, simultaneously. As with other large-scale experimental platforms, to compare and contrast such profiles, e.g. for clustering known compounds with uncharacterized compounds, a robust means to compare a large cohort of profiles is required. Existing methods for correlating different chemical profiles include diverse statistical discriminant analysis-based methods and specific gene filtering or normalization methods. Though powerful, none are ideal because they typically require one to define the disrupting effects, commonly known as batch effects, to detect true signal from experimental variation. These effects are not always known, and they can mask true biological differences. We present a method, Bucket Evaluations (BE) that surmounts many of these problems and is extensible to other datasets such as those obtained via gene expression profiling and which is platform independent. RESULTS: We designed an algorithm to analyse chemogenomic profiles to identify potential targets of known drugs and new chemical compounds. We used levelled rank comparisons to identify drugs/compounds with similar profiles that minimizes batch effects and avoids the requirement of pre-defining the disrupting effects. This algorithm was also tested on gene expression microarray data and high throughput sequencing chemogenomic screens and found the method is applicable to a variety of dataset types. CONCLUSIONS: BE, along with various correlation methods on a collection of datasets proved to be highly accurate for locating similarity between experiments. BE is a non-parametric correlation approach, which is suitable for locating correlations in somewhat perturbed datasets such as chemical genomic profiles. We created software and a user interface for using BE, which is publically available.
format	Online Article Text
id	pubmed-3780717
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-37807172013-09-24 An algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations Shabtai, Daniel Giaever, Guri Nislow, Corey BMC Bioinformatics Methodology Article BACKGROUND: Chemical genomics is an interdisciplinary field that combines small molecule perturbation with traditional genomics to understand gene function and to study the mode(s) of drug action. A benefit of chemical genomic screens is their breadth; each screen can capture the sensitivity of comprehensive collections of mutants or, in the case of mammalian cells, gene knock-downs, simultaneously. As with other large-scale experimental platforms, to compare and contrast such profiles, e.g. for clustering known compounds with uncharacterized compounds, a robust means to compare a large cohort of profiles is required. Existing methods for correlating different chemical profiles include diverse statistical discriminant analysis-based methods and specific gene filtering or normalization methods. Though powerful, none are ideal because they typically require one to define the disrupting effects, commonly known as batch effects, to detect true signal from experimental variation. These effects are not always known, and they can mask true biological differences. We present a method, Bucket Evaluations (BE) that surmounts many of these problems and is extensible to other datasets such as those obtained via gene expression profiling and which is platform independent. RESULTS: We designed an algorithm to analyse chemogenomic profiles to identify potential targets of known drugs and new chemical compounds. We used levelled rank comparisons to identify drugs/compounds with similar profiles that minimizes batch effects and avoids the requirement of pre-defining the disrupting effects. This algorithm was also tested on gene expression microarray data and high throughput sequencing chemogenomic screens and found the method is applicable to a variety of dataset types. CONCLUSIONS: BE, along with various correlation methods on a collection of datasets proved to be highly accurate for locating similarity between experiments. BE is a non-parametric correlation approach, which is suitable for locating correlations in somewhat perturbed datasets such as chemical genomic profiles. We created software and a user interface for using BE, which is publically available. BioMed Central 2012-09-25 /pmc/articles/PMC3780717/ /pubmed/23009392 http://dx.doi.org/10.1186/1471-2105-13-245 Text en Copyright © 2012 Shabtai et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Methodology Article Shabtai, Daniel Giaever, Guri Nislow, Corey An algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations
title	An algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations
title_full	An algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations
title_fullStr	An algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations
title_full_unstemmed	An algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations
title_short	An algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations
title_sort	algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations
topic	Methodology Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3780717/ https://www.ncbi.nlm.nih.gov/pubmed/23009392 http://dx.doi.org/10.1186/1471-2105-13-245
work_keys_str_mv	AT shabtaidaniel analgorithmforchemicalgenomicprofilingthatminimizesbatcheffectsbucketevaluations AT giaeverguri analgorithmforchemicalgenomicprofilingthatminimizesbatcheffectsbucketevaluations AT nislowcorey analgorithmforchemicalgenomicprofilingthatminimizesbatcheffectsbucketevaluations AT shabtaidaniel algorithmforchemicalgenomicprofilingthatminimizesbatcheffectsbucketevaluations AT giaeverguri algorithmforchemicalgenomicprofilingthatminimizesbatcheffectsbucketevaluations AT nislowcorey algorithmforchemicalgenomicprofilingthatminimizesbatcheffectsbucketevaluations

An algorithm for chemical genomic profiling that minimizes batch effects: bucket evaluations

Ejemplares similares