Cargando…

Evaluating batch correction methods for image-based cell profiling

High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, s...

Descripción completa

Detalles Bibliográficos
Autores principales: Arevalo, John, van Dijk, Robert, Carpenter, Anne E., Singh, Shantanu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516049/
https://www.ncbi.nlm.nih.gov/pubmed/37745478
http://dx.doi.org/10.1101/2023.09.15.558001
_version_ 1785109061969117184
author Arevalo, John
van Dijk, Robert
Carpenter, Anne E.
Singh, Shantanu
author_facet Arevalo, John
van Dijk, Robert
Carpenter, Anne E.
Singh, Shantanu
author_sort Arevalo, John
collection PubMed
description High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects pose severe limitations to community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we evaluated seven top-ranked batch correction strategies for mRNA profiles in the context of a newly released Cell Painting dataset, the largest publicly accessible image-based dataset. We focused on five different use scenarios with varying complexity, and found that Harmony, a nonlinear method, consistently outperformed the other tested methods. Furthermore, we provide a framework, benchmark, and metrics for the future assessment of new batch correction methods. Overall, this work paves the way for improvements that allow the community to make best use of public Cell Painting data for scientific discovery.
format Online
Article
Text
id pubmed-10516049
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-105160492023-09-23 Evaluating batch correction methods for image-based cell profiling Arevalo, John van Dijk, Robert Carpenter, Anne E. Singh, Shantanu bioRxiv Article High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects pose severe limitations to community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we evaluated seven top-ranked batch correction strategies for mRNA profiles in the context of a newly released Cell Painting dataset, the largest publicly accessible image-based dataset. We focused on five different use scenarios with varying complexity, and found that Harmony, a nonlinear method, consistently outperformed the other tested methods. Furthermore, we provide a framework, benchmark, and metrics for the future assessment of new batch correction methods. Overall, this work paves the way for improvements that allow the community to make best use of public Cell Painting data for scientific discovery. Cold Spring Harbor Laboratory 2023-09-17 /pmc/articles/PMC10516049/ /pubmed/37745478 http://dx.doi.org/10.1101/2023.09.15.558001 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Arevalo, John
van Dijk, Robert
Carpenter, Anne E.
Singh, Shantanu
Evaluating batch correction methods for image-based cell profiling
title Evaluating batch correction methods for image-based cell profiling
title_full Evaluating batch correction methods for image-based cell profiling
title_fullStr Evaluating batch correction methods for image-based cell profiling
title_full_unstemmed Evaluating batch correction methods for image-based cell profiling
title_short Evaluating batch correction methods for image-based cell profiling
title_sort evaluating batch correction methods for image-based cell profiling
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516049/
https://www.ncbi.nlm.nih.gov/pubmed/37745478
http://dx.doi.org/10.1101/2023.09.15.558001
work_keys_str_mv AT arevalojohn evaluatingbatchcorrectionmethodsforimagebasedcellprofiling
AT vandijkrobert evaluatingbatchcorrectionmethodsforimagebasedcellprofiling
AT carpenterannee evaluatingbatchcorrectionmethodsforimagebasedcellprofiling
AT singhshantanu evaluatingbatchcorrectionmethodsforimagebasedcellprofiling