Cargando…

Normalization of Large-Scale Transcriptome Data Using Heuristic Methods

In this study, we introduce an artificial intelligent method for addressing the batch effect of a transcriptome data. The method has several clear advantages in comparison with the alternative methods presently in use. Batch effect refers to the discrepancy in gene expression data series, measured u...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yosef, Arthur, Shnaider, Eli, Schneider, Moti, Gurevich, Michael
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	SAGE Publications 2023
Materias:	Original Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10068970/ https://www.ncbi.nlm.nih.gov/pubmed/37020503 http://dx.doi.org/10.1177/11779322231160397

_version_	1785018766440005632
author	Yosef, Arthur Shnaider, Eli Schneider, Moti Gurevich, Michael
author_facet	Yosef, Arthur Shnaider, Eli Schneider, Moti Gurevich, Michael
author_sort	Yosef, Arthur
collection	PubMed
description	In this study, we introduce an artificial intelligent method for addressing the batch effect of a transcriptome data. The method has several clear advantages in comparison with the alternative methods presently in use. Batch effect refers to the discrepancy in gene expression data series, measured under different conditions. While the data from the same batch (measurements performed under the same conditions) are compatible, combining various batches into 1 data set is problematic because of incompatible measurements. Therefore, it is necessary to perform correction of the combined data (normalization), before performing biological analysis. There are numerous methods attempting to correct data set for batch effect. These methods rely on various assumptions regarding the distribution of the measurements. Forcing the data elements into pre-supposed distribution can severely distort biological signals, thus leading to incorrect results and conclusions. As the discrepancy between the assumptions regarding the data distribution and the actual distribution is wider, the biases introduced by such “correction methods” are greater. We introduce a heuristic method to reduce batch effect. The method does not rely on any assumptions regarding the distribution and the behavior of data elements. Hence, it does not introduce any new biases in the process of correcting the batch effect. It strictly maintains the integrity of measurements within the original batches.
format	Online Article Text
id	pubmed-10068970
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	SAGE Publications
record_format	MEDLINE/PubMed
spelling	pubmed-100689702023-04-04 Normalization of Large-Scale Transcriptome Data Using Heuristic Methods Yosef, Arthur Shnaider, Eli Schneider, Moti Gurevich, Michael Bioinform Biol Insights Original Research Article In this study, we introduce an artificial intelligent method for addressing the batch effect of a transcriptome data. The method has several clear advantages in comparison with the alternative methods presently in use. Batch effect refers to the discrepancy in gene expression data series, measured under different conditions. While the data from the same batch (measurements performed under the same conditions) are compatible, combining various batches into 1 data set is problematic because of incompatible measurements. Therefore, it is necessary to perform correction of the combined data (normalization), before performing biological analysis. There are numerous methods attempting to correct data set for batch effect. These methods rely on various assumptions regarding the distribution of the measurements. Forcing the data elements into pre-supposed distribution can severely distort biological signals, thus leading to incorrect results and conclusions. As the discrepancy between the assumptions regarding the data distribution and the actual distribution is wider, the biases introduced by such “correction methods” are greater. We introduce a heuristic method to reduce batch effect. The method does not rely on any assumptions regarding the distribution and the behavior of data elements. Hence, it does not introduce any new biases in the process of correcting the batch effect. It strictly maintains the integrity of measurements within the original batches. SAGE Publications 2023-03-31 /pmc/articles/PMC10068970/ /pubmed/37020503 http://dx.doi.org/10.1177/11779322231160397 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle	Original Research Article Yosef, Arthur Shnaider, Eli Schneider, Moti Gurevich, Michael Normalization of Large-Scale Transcriptome Data Using Heuristic Methods
title	Normalization of Large-Scale Transcriptome Data Using Heuristic Methods
title_full	Normalization of Large-Scale Transcriptome Data Using Heuristic Methods
title_fullStr	Normalization of Large-Scale Transcriptome Data Using Heuristic Methods
title_full_unstemmed	Normalization of Large-Scale Transcriptome Data Using Heuristic Methods
title_short	Normalization of Large-Scale Transcriptome Data Using Heuristic Methods
title_sort	normalization of large-scale transcriptome data using heuristic methods
topic	Original Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10068970/ https://www.ncbi.nlm.nih.gov/pubmed/37020503 http://dx.doi.org/10.1177/11779322231160397
work_keys_str_mv	AT yosefarthur normalizationoflargescaletranscriptomedatausingheuristicmethods AT shnaidereli normalizationoflargescaletranscriptomedatausingheuristicmethods AT schneidermoti normalizationoflargescaletranscriptomedatausingheuristicmethods AT gurevichmichael normalizationoflargescaletranscriptomedatausingheuristicmethods

Normalization of Large-Scale Transcriptome Data Using Heuristic Methods

Ejemplares similares