Cargando…

Preferred analysis methods for single genomic regions in RNA sequencing revealed by processing the shape of coverage

The informational content of RNA sequencing is currently far from being completely explored. Most of the analyses focus on processing tables of counts or finding isoform deconvolution via exon junctions. This article presents a comparison of several techniques that can be used to estimate differenti...

Descripción completa

Detalles Bibliográficos
Autores principales: Okoniewski, Michał J., Leśniewska, Anna, Szabelska, Alicja, Zyprych-Walczak, Joanna, Ryan, Martin, Wachtel, Marco, Morzy, Tadeusz, Schäfer, Beat, Schlapbach, Ralph
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3351146/
https://www.ncbi.nlm.nih.gov/pubmed/22210855
http://dx.doi.org/10.1093/nar/gkr1249
_version_ 1782232734719016960
author Okoniewski, Michał J.
Leśniewska, Anna
Szabelska, Alicja
Zyprych-Walczak, Joanna
Ryan, Martin
Wachtel, Marco
Morzy, Tadeusz
Schäfer, Beat
Schlapbach, Ralph
author_facet Okoniewski, Michał J.
Leśniewska, Anna
Szabelska, Alicja
Zyprych-Walczak, Joanna
Ryan, Martin
Wachtel, Marco
Morzy, Tadeusz
Schäfer, Beat
Schlapbach, Ralph
author_sort Okoniewski, Michał J.
collection PubMed
description The informational content of RNA sequencing is currently far from being completely explored. Most of the analyses focus on processing tables of counts or finding isoform deconvolution via exon junctions. This article presents a comparison of several techniques that can be used to estimate differential expression of exons or small genomic regions of expression, based on their coverage function shapes. The problem is defined as finding the differentially expressed exons between two samples using local expression profile normalization and statistical measures to spot the differences between two profile shapes. Initial experiments have been done using synthetic data, and real data modified with synthetically created differential patterns. Then, 160 pipelines (5 types of generator × 4 normalizations × 8 difference measures) are compared. As a result, the best analysis pipelines are selected based on linearity of the differential expression estimation and the area under the ROC curve. These platform-independent techniques have been implemented in the Bioconductor package rnaSeqMap. They point out the exons with differential expression or internal splicing, even if the counts of reads may not show this. The areas of application include significant difference searches, splicing identification algorithms and finding suitable regions for QPCR primers.
format Online
Article
Text
id pubmed-3351146
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33511462012-05-14 Preferred analysis methods for single genomic regions in RNA sequencing revealed by processing the shape of coverage Okoniewski, Michał J. Leśniewska, Anna Szabelska, Alicja Zyprych-Walczak, Joanna Ryan, Martin Wachtel, Marco Morzy, Tadeusz Schäfer, Beat Schlapbach, Ralph Nucleic Acids Res Methods Online The informational content of RNA sequencing is currently far from being completely explored. Most of the analyses focus on processing tables of counts or finding isoform deconvolution via exon junctions. This article presents a comparison of several techniques that can be used to estimate differential expression of exons or small genomic regions of expression, based on their coverage function shapes. The problem is defined as finding the differentially expressed exons between two samples using local expression profile normalization and statistical measures to spot the differences between two profile shapes. Initial experiments have been done using synthetic data, and real data modified with synthetically created differential patterns. Then, 160 pipelines (5 types of generator × 4 normalizations × 8 difference measures) are compared. As a result, the best analysis pipelines are selected based on linearity of the differential expression estimation and the area under the ROC curve. These platform-independent techniques have been implemented in the Bioconductor package rnaSeqMap. They point out the exons with differential expression or internal splicing, even if the counts of reads may not show this. The areas of application include significant difference searches, splicing identification algorithms and finding suitable regions for QPCR primers. Oxford University Press 2012-05 2011-12-29 /pmc/articles/PMC3351146/ /pubmed/22210855 http://dx.doi.org/10.1093/nar/gkr1249 Text en © The Author(s) 2011. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Okoniewski, Michał J.
Leśniewska, Anna
Szabelska, Alicja
Zyprych-Walczak, Joanna
Ryan, Martin
Wachtel, Marco
Morzy, Tadeusz
Schäfer, Beat
Schlapbach, Ralph
Preferred analysis methods for single genomic regions in RNA sequencing revealed by processing the shape of coverage
title Preferred analysis methods for single genomic regions in RNA sequencing revealed by processing the shape of coverage
title_full Preferred analysis methods for single genomic regions in RNA sequencing revealed by processing the shape of coverage
title_fullStr Preferred analysis methods for single genomic regions in RNA sequencing revealed by processing the shape of coverage
title_full_unstemmed Preferred analysis methods for single genomic regions in RNA sequencing revealed by processing the shape of coverage
title_short Preferred analysis methods for single genomic regions in RNA sequencing revealed by processing the shape of coverage
title_sort preferred analysis methods for single genomic regions in rna sequencing revealed by processing the shape of coverage
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3351146/
https://www.ncbi.nlm.nih.gov/pubmed/22210855
http://dx.doi.org/10.1093/nar/gkr1249
work_keys_str_mv AT okoniewskimichałj preferredanalysismethodsforsinglegenomicregionsinrnasequencingrevealedbyprocessingtheshapeofcoverage
AT lesniewskaanna preferredanalysismethodsforsinglegenomicregionsinrnasequencingrevealedbyprocessingtheshapeofcoverage
AT szabelskaalicja preferredanalysismethodsforsinglegenomicregionsinrnasequencingrevealedbyprocessingtheshapeofcoverage
AT zyprychwalczakjoanna preferredanalysismethodsforsinglegenomicregionsinrnasequencingrevealedbyprocessingtheshapeofcoverage
AT ryanmartin preferredanalysismethodsforsinglegenomicregionsinrnasequencingrevealedbyprocessingtheshapeofcoverage
AT wachtelmarco preferredanalysismethodsforsinglegenomicregionsinrnasequencingrevealedbyprocessingtheshapeofcoverage
AT morzytadeusz preferredanalysismethodsforsinglegenomicregionsinrnasequencingrevealedbyprocessingtheshapeofcoverage
AT schaferbeat preferredanalysismethodsforsinglegenomicregionsinrnasequencingrevealedbyprocessingtheshapeofcoverage
AT schlapbachralph preferredanalysismethodsforsinglegenomicregionsinrnasequencingrevealedbyprocessingtheshapeofcoverage