Cargando…

A protocol to evaluate RNA sequencing normalization methods

BACKGROUND: RNA sequencing technologies have allowed researchers to gain a better understanding of how the transcriptome affects disease. However, sequencing technologies often unintentionally introduce experimental error into RNA sequencing data. To counteract this, normalization methods are standa...

Descripción completa

Detalles Bibliográficos
Autores principales: Abrams, Zachary B., Johnson, Travis S., Huang, Kun, Payne, Philip R. O., Coombes, Kevin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923842/
https://www.ncbi.nlm.nih.gov/pubmed/31861985
http://dx.doi.org/10.1186/s12859-019-3247-x
_version_ 1783481603658350592
author Abrams, Zachary B.
Johnson, Travis S.
Huang, Kun
Payne, Philip R. O.
Coombes, Kevin
author_facet Abrams, Zachary B.
Johnson, Travis S.
Huang, Kun
Payne, Philip R. O.
Coombes, Kevin
author_sort Abrams, Zachary B.
collection PubMed
description BACKGROUND: RNA sequencing technologies have allowed researchers to gain a better understanding of how the transcriptome affects disease. However, sequencing technologies often unintentionally introduce experimental error into RNA sequencing data. To counteract this, normalization methods are standardly applied with the intent of reducing the non-biologically derived variability inherent in transcriptomic measurements. However, the comparative efficacy of the various normalization techniques has not been tested in a standardized manner. Here we propose tests that evaluate numerous normalization techniques and applied them to a large-scale standard data set. These tests comprise a protocol that allows researchers to measure the amount of non-biological variability which is present in any data set after normalization has been performed, a crucial step to assessing the biological validity of data following normalization. RESULTS: In this study we present two tests to assess the validity of normalization methods applied to a large-scale data set collected for systematic evaluation purposes. We tested various RNASeq normalization procedures and concluded that transcripts per million (TPM) was the best performing normalization method based on its preservation of biological signal as compared to the other methods tested. CONCLUSION: Normalization is of vital importance to accurately interpret the results of genomic and transcriptomic experiments. More work, however, needs to be performed to optimize normalization methods for RNASeq data. The present effort helps pave the way for more systematic evaluations of normalization methods across different platforms. With our proposed schema researchers can evaluate their own or future normalization methods to further improve the field of RNASeq normalization.
format Online
Article
Text
id pubmed-6923842
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69238422019-12-30 A protocol to evaluate RNA sequencing normalization methods Abrams, Zachary B. Johnson, Travis S. Huang, Kun Payne, Philip R. O. Coombes, Kevin BMC Bioinformatics Research BACKGROUND: RNA sequencing technologies have allowed researchers to gain a better understanding of how the transcriptome affects disease. However, sequencing technologies often unintentionally introduce experimental error into RNA sequencing data. To counteract this, normalization methods are standardly applied with the intent of reducing the non-biologically derived variability inherent in transcriptomic measurements. However, the comparative efficacy of the various normalization techniques has not been tested in a standardized manner. Here we propose tests that evaluate numerous normalization techniques and applied them to a large-scale standard data set. These tests comprise a protocol that allows researchers to measure the amount of non-biological variability which is present in any data set after normalization has been performed, a crucial step to assessing the biological validity of data following normalization. RESULTS: In this study we present two tests to assess the validity of normalization methods applied to a large-scale data set collected for systematic evaluation purposes. We tested various RNASeq normalization procedures and concluded that transcripts per million (TPM) was the best performing normalization method based on its preservation of biological signal as compared to the other methods tested. CONCLUSION: Normalization is of vital importance to accurately interpret the results of genomic and transcriptomic experiments. More work, however, needs to be performed to optimize normalization methods for RNASeq data. The present effort helps pave the way for more systematic evaluations of normalization methods across different platforms. With our proposed schema researchers can evaluate their own or future normalization methods to further improve the field of RNASeq normalization. BioMed Central 2019-12-20 /pmc/articles/PMC6923842/ /pubmed/31861985 http://dx.doi.org/10.1186/s12859-019-3247-x Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Abrams, Zachary B.
Johnson, Travis S.
Huang, Kun
Payne, Philip R. O.
Coombes, Kevin
A protocol to evaluate RNA sequencing normalization methods
title A protocol to evaluate RNA sequencing normalization methods
title_full A protocol to evaluate RNA sequencing normalization methods
title_fullStr A protocol to evaluate RNA sequencing normalization methods
title_full_unstemmed A protocol to evaluate RNA sequencing normalization methods
title_short A protocol to evaluate RNA sequencing normalization methods
title_sort protocol to evaluate rna sequencing normalization methods
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6923842/
https://www.ncbi.nlm.nih.gov/pubmed/31861985
http://dx.doi.org/10.1186/s12859-019-3247-x
work_keys_str_mv AT abramszacharyb aprotocoltoevaluaternasequencingnormalizationmethods
AT johnsontraviss aprotocoltoevaluaternasequencingnormalizationmethods
AT huangkun aprotocoltoevaluaternasequencingnormalizationmethods
AT paynephilipro aprotocoltoevaluaternasequencingnormalizationmethods
AT coombeskevin aprotocoltoevaluaternasequencingnormalizationmethods
AT abramszacharyb protocoltoevaluaternasequencingnormalizationmethods
AT johnsontraviss protocoltoevaluaternasequencingnormalizationmethods
AT huangkun protocoltoevaluaternasequencingnormalizationmethods
AT paynephilipro protocoltoevaluaternasequencingnormalizationmethods
AT coombeskevin protocoltoevaluaternasequencingnormalizationmethods