Cargando…

Measure transcript integrity using RNA-seq data

BACKGROUND: Stored biological samples with pathology information and medical records are invaluable resources for translational medical research. However, RNAs extracted from the archived clinical tissues are often substantially degraded. RNA degradation distorts the RNA-seq read coverage in a gene-...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Liguo, Nie, Jinfu, Sicotte, Hugues, Li, Ying, Eckel-Passow, Jeanette E., Dasari, Surendra, Vedell, Peter T., Barman, Poulami, Wang, Liewei, Weinshiboum, Richard, Jen, Jin, Huang, Haojie, Kohli, Manish, Kocher, Jean-Pierre A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4739097/
https://www.ncbi.nlm.nih.gov/pubmed/26842848
http://dx.doi.org/10.1186/s12859-016-0922-z
_version_ 1782413700172349440
author Wang, Liguo
Nie, Jinfu
Sicotte, Hugues
Li, Ying
Eckel-Passow, Jeanette E.
Dasari, Surendra
Vedell, Peter T.
Barman, Poulami
Wang, Liewei
Weinshiboum, Richard
Jen, Jin
Huang, Haojie
Kohli, Manish
Kocher, Jean-Pierre A.
author_facet Wang, Liguo
Nie, Jinfu
Sicotte, Hugues
Li, Ying
Eckel-Passow, Jeanette E.
Dasari, Surendra
Vedell, Peter T.
Barman, Poulami
Wang, Liewei
Weinshiboum, Richard
Jen, Jin
Huang, Haojie
Kohli, Manish
Kocher, Jean-Pierre A.
author_sort Wang, Liguo
collection PubMed
description BACKGROUND: Stored biological samples with pathology information and medical records are invaluable resources for translational medical research. However, RNAs extracted from the archived clinical tissues are often substantially degraded. RNA degradation distorts the RNA-seq read coverage in a gene-specific manner, and has profound influences on whole-genome gene expression profiling. RESULT: We developed the transcript integrity number (TIN) to measure RNA degradation. When applied to 3 independent RNA-seq datasets, we demonstrated TIN is a reliable and sensitive measure of the RNA degradation at both transcript and sample level. Through comparing 10 prostate cancer clinical samples with lower RNA integrity to 10 samples with higher RNA quality, we demonstrated that calibrating gene expression counts with TIN scores could effectively neutralize RNA degradation effects by reducing false positives and recovering biologically meaningful pathways. When further evaluating the performance of TIN correction using spike-in transcripts in RNA-seq data generated from the Sequencing Quality Control consortium, we found TIN adjustment had better control of false positives and false negatives (sensitivity = 0.89, specificity = 0.91, accuracy = 0.90), as compared to gene expression analysis results without TIN correction (sensitivity = 0.98, specificity = 0.50, accuracy = 0.86). CONCLUSION: TIN is a reliable measurement of RNA integrity and a valuable approach used to neutralize in vitro RNA degradation effect and improve differential gene expression analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-0922-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4739097
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47390972016-02-04 Measure transcript integrity using RNA-seq data Wang, Liguo Nie, Jinfu Sicotte, Hugues Li, Ying Eckel-Passow, Jeanette E. Dasari, Surendra Vedell, Peter T. Barman, Poulami Wang, Liewei Weinshiboum, Richard Jen, Jin Huang, Haojie Kohli, Manish Kocher, Jean-Pierre A. BMC Bioinformatics Methodology Article BACKGROUND: Stored biological samples with pathology information and medical records are invaluable resources for translational medical research. However, RNAs extracted from the archived clinical tissues are often substantially degraded. RNA degradation distorts the RNA-seq read coverage in a gene-specific manner, and has profound influences on whole-genome gene expression profiling. RESULT: We developed the transcript integrity number (TIN) to measure RNA degradation. When applied to 3 independent RNA-seq datasets, we demonstrated TIN is a reliable and sensitive measure of the RNA degradation at both transcript and sample level. Through comparing 10 prostate cancer clinical samples with lower RNA integrity to 10 samples with higher RNA quality, we demonstrated that calibrating gene expression counts with TIN scores could effectively neutralize RNA degradation effects by reducing false positives and recovering biologically meaningful pathways. When further evaluating the performance of TIN correction using spike-in transcripts in RNA-seq data generated from the Sequencing Quality Control consortium, we found TIN adjustment had better control of false positives and false negatives (sensitivity = 0.89, specificity = 0.91, accuracy = 0.90), as compared to gene expression analysis results without TIN correction (sensitivity = 0.98, specificity = 0.50, accuracy = 0.86). CONCLUSION: TIN is a reliable measurement of RNA integrity and a valuable approach used to neutralize in vitro RNA degradation effect and improve differential gene expression analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-0922-z) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-03 /pmc/articles/PMC4739097/ /pubmed/26842848 http://dx.doi.org/10.1186/s12859-016-0922-z Text en © Wang et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Wang, Liguo
Nie, Jinfu
Sicotte, Hugues
Li, Ying
Eckel-Passow, Jeanette E.
Dasari, Surendra
Vedell, Peter T.
Barman, Poulami
Wang, Liewei
Weinshiboum, Richard
Jen, Jin
Huang, Haojie
Kohli, Manish
Kocher, Jean-Pierre A.
Measure transcript integrity using RNA-seq data
title Measure transcript integrity using RNA-seq data
title_full Measure transcript integrity using RNA-seq data
title_fullStr Measure transcript integrity using RNA-seq data
title_full_unstemmed Measure transcript integrity using RNA-seq data
title_short Measure transcript integrity using RNA-seq data
title_sort measure transcript integrity using rna-seq data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4739097/
https://www.ncbi.nlm.nih.gov/pubmed/26842848
http://dx.doi.org/10.1186/s12859-016-0922-z
work_keys_str_mv AT wangliguo measuretranscriptintegrityusingrnaseqdata
AT niejinfu measuretranscriptintegrityusingrnaseqdata
AT sicottehugues measuretranscriptintegrityusingrnaseqdata
AT liying measuretranscriptintegrityusingrnaseqdata
AT eckelpassowjeanettee measuretranscriptintegrityusingrnaseqdata
AT dasarisurendra measuretranscriptintegrityusingrnaseqdata
AT vedellpetert measuretranscriptintegrityusingrnaseqdata
AT barmanpoulami measuretranscriptintegrityusingrnaseqdata
AT wangliewei measuretranscriptintegrityusingrnaseqdata
AT weinshiboumrichard measuretranscriptintegrityusingrnaseqdata
AT jenjin measuretranscriptintegrityusingrnaseqdata
AT huanghaojie measuretranscriptintegrityusingrnaseqdata
AT kohlimanish measuretranscriptintegrityusingrnaseqdata
AT kocherjeanpierrea measuretranscriptintegrityusingrnaseqdata