Cargando…

Optimizing a Massive Parallel Sequencing Workflow for Quantitative miRNA Expression Analysis

BACKGROUND: Massive Parallel Sequencing methods (MPS) can extend and improve the knowledge obtained by conventional microarray technology, both for mRNAs and short non-coding RNAs, e.g. miRNAs. The processing methods used to extract and interpret the information are an important aspect of dealing wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Cordero, Francesca, Beccuti, Marco, Arigoni, Maddalena, Donatelli, Susanna, Calogero, Raffaele A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3282730/
https://www.ncbi.nlm.nih.gov/pubmed/22363693
http://dx.doi.org/10.1371/journal.pone.0031630
_version_ 1782224113234870272
author Cordero, Francesca
Beccuti, Marco
Arigoni, Maddalena
Donatelli, Susanna
Calogero, Raffaele A.
author_facet Cordero, Francesca
Beccuti, Marco
Arigoni, Maddalena
Donatelli, Susanna
Calogero, Raffaele A.
author_sort Cordero, Francesca
collection PubMed
description BACKGROUND: Massive Parallel Sequencing methods (MPS) can extend and improve the knowledge obtained by conventional microarray technology, both for mRNAs and short non-coding RNAs, e.g. miRNAs. The processing methods used to extract and interpret the information are an important aspect of dealing with the vast amounts of data generated from short read sequencing. Although the number of computational tools for MPS data analysis is constantly growing, their strengths and weaknesses as part of a complex analytical pipe-line have not yet been well investigated. PRIMARY FINDINGS: A benchmark MPS miRNA dataset, resembling a situation in which miRNAs are spiked in biological replication experiments was assembled by merging a publicly available MPS spike-in miRNAs data set with MPS data derived from healthy donor peripheral blood mononuclear cells. Using this data set we observed that short reads counts estimation is strongly under estimated in case of duplicates miRNAs, if whole genome is used as reference. Furthermore, the sensitivity of miRNAs detection is strongly dependent by the primary tool used in the analysis. Within the six aligners tested, specifically devoted to miRNA detection, SHRiMP and MicroRazerS show the highest sensitivity. Differential expression estimation is quite efficient. Within the five tools investigated, two of them (DESseq, baySeq) show a very good specificity and sensitivity in the detection of differential expression. CONCLUSIONS: The results provided by our analysis allow the definition of a clear and simple analytical optimized workflow for miRNAs digital quantitative analysis.
format Online
Article
Text
id pubmed-3282730
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32827302012-02-23 Optimizing a Massive Parallel Sequencing Workflow for Quantitative miRNA Expression Analysis Cordero, Francesca Beccuti, Marco Arigoni, Maddalena Donatelli, Susanna Calogero, Raffaele A. PLoS One Research Article BACKGROUND: Massive Parallel Sequencing methods (MPS) can extend and improve the knowledge obtained by conventional microarray technology, both for mRNAs and short non-coding RNAs, e.g. miRNAs. The processing methods used to extract and interpret the information are an important aspect of dealing with the vast amounts of data generated from short read sequencing. Although the number of computational tools for MPS data analysis is constantly growing, their strengths and weaknesses as part of a complex analytical pipe-line have not yet been well investigated. PRIMARY FINDINGS: A benchmark MPS miRNA dataset, resembling a situation in which miRNAs are spiked in biological replication experiments was assembled by merging a publicly available MPS spike-in miRNAs data set with MPS data derived from healthy donor peripheral blood mononuclear cells. Using this data set we observed that short reads counts estimation is strongly under estimated in case of duplicates miRNAs, if whole genome is used as reference. Furthermore, the sensitivity of miRNAs detection is strongly dependent by the primary tool used in the analysis. Within the six aligners tested, specifically devoted to miRNA detection, SHRiMP and MicroRazerS show the highest sensitivity. Differential expression estimation is quite efficient. Within the five tools investigated, two of them (DESseq, baySeq) show a very good specificity and sensitivity in the detection of differential expression. CONCLUSIONS: The results provided by our analysis allow the definition of a clear and simple analytical optimized workflow for miRNAs digital quantitative analysis. Public Library of Science 2012-02-20 /pmc/articles/PMC3282730/ /pubmed/22363693 http://dx.doi.org/10.1371/journal.pone.0031630 Text en Cordero et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Cordero, Francesca
Beccuti, Marco
Arigoni, Maddalena
Donatelli, Susanna
Calogero, Raffaele A.
Optimizing a Massive Parallel Sequencing Workflow for Quantitative miRNA Expression Analysis
title Optimizing a Massive Parallel Sequencing Workflow for Quantitative miRNA Expression Analysis
title_full Optimizing a Massive Parallel Sequencing Workflow for Quantitative miRNA Expression Analysis
title_fullStr Optimizing a Massive Parallel Sequencing Workflow for Quantitative miRNA Expression Analysis
title_full_unstemmed Optimizing a Massive Parallel Sequencing Workflow for Quantitative miRNA Expression Analysis
title_short Optimizing a Massive Parallel Sequencing Workflow for Quantitative miRNA Expression Analysis
title_sort optimizing a massive parallel sequencing workflow for quantitative mirna expression analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3282730/
https://www.ncbi.nlm.nih.gov/pubmed/22363693
http://dx.doi.org/10.1371/journal.pone.0031630
work_keys_str_mv AT corderofrancesca optimizingamassiveparallelsequencingworkflowforquantitativemirnaexpressionanalysis
AT beccutimarco optimizingamassiveparallelsequencingworkflowforquantitativemirnaexpressionanalysis
AT arigonimaddalena optimizingamassiveparallelsequencingworkflowforquantitativemirnaexpressionanalysis
AT donatellisusanna optimizingamassiveparallelsequencingworkflowforquantitativemirnaexpressionanalysis
AT calogeroraffaelea optimizingamassiveparallelsequencingworkflowforquantitativemirnaexpressionanalysis