Cargando…
Evaluating batch correction methods for image-based cell profiling
High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, s...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516049/ https://www.ncbi.nlm.nih.gov/pubmed/37745478 http://dx.doi.org/10.1101/2023.09.15.558001 |
_version_ | 1785109061969117184 |
---|---|
author | Arevalo, John van Dijk, Robert Carpenter, Anne E. Singh, Shantanu |
author_facet | Arevalo, John van Dijk, Robert Carpenter, Anne E. Singh, Shantanu |
author_sort | Arevalo, John |
collection | PubMed |
description | High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects pose severe limitations to community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we evaluated seven top-ranked batch correction strategies for mRNA profiles in the context of a newly released Cell Painting dataset, the largest publicly accessible image-based dataset. We focused on five different use scenarios with varying complexity, and found that Harmony, a nonlinear method, consistently outperformed the other tested methods. Furthermore, we provide a framework, benchmark, and metrics for the future assessment of new batch correction methods. Overall, this work paves the way for improvements that allow the community to make best use of public Cell Painting data for scientific discovery. |
format | Online Article Text |
id | pubmed-10516049 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-105160492023-09-23 Evaluating batch correction methods for image-based cell profiling Arevalo, John van Dijk, Robert Carpenter, Anne E. Singh, Shantanu bioRxiv Article High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects pose severe limitations to community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we evaluated seven top-ranked batch correction strategies for mRNA profiles in the context of a newly released Cell Painting dataset, the largest publicly accessible image-based dataset. We focused on five different use scenarios with varying complexity, and found that Harmony, a nonlinear method, consistently outperformed the other tested methods. Furthermore, we provide a framework, benchmark, and metrics for the future assessment of new batch correction methods. Overall, this work paves the way for improvements that allow the community to make best use of public Cell Painting data for scientific discovery. Cold Spring Harbor Laboratory 2023-09-17 /pmc/articles/PMC10516049/ /pubmed/37745478 http://dx.doi.org/10.1101/2023.09.15.558001 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Arevalo, John van Dijk, Robert Carpenter, Anne E. Singh, Shantanu Evaluating batch correction methods for image-based cell profiling |
title | Evaluating batch correction methods for image-based cell profiling |
title_full | Evaluating batch correction methods for image-based cell profiling |
title_fullStr | Evaluating batch correction methods for image-based cell profiling |
title_full_unstemmed | Evaluating batch correction methods for image-based cell profiling |
title_short | Evaluating batch correction methods for image-based cell profiling |
title_sort | evaluating batch correction methods for image-based cell profiling |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10516049/ https://www.ncbi.nlm.nih.gov/pubmed/37745478 http://dx.doi.org/10.1101/2023.09.15.558001 |
work_keys_str_mv | AT arevalojohn evaluatingbatchcorrectionmethodsforimagebasedcellprofiling AT vandijkrobert evaluatingbatchcorrectionmethodsforimagebasedcellprofiling AT carpenterannee evaluatingbatchcorrectionmethodsforimagebasedcellprofiling AT singhshantanu evaluatingbatchcorrectionmethodsforimagebasedcellprofiling |