Cargando…

QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing

BACKGROUND: Genome-wide miRNA expression data can be used to study miRNA dysregulation comprehensively. Although many open-source tools for microRNA (miRNA)-seq data analyses are available, challenges remain in accurate miRNA quantification from large-scale miRNA-seq dataset. We implemented a pipeli...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Shanrong, Gordon, William, Du, Sarah, Zhang, Chi, He, Wen, Xi, Li, Mathur, Sachin, Agostino, Michael, Paradis, Theresa, von Schack, David, Vincent, Michael, Zhang, Baohong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5359966/
https://www.ncbi.nlm.nih.gov/pubmed/28320324
http://dx.doi.org/10.1186/s12859-017-1601-4
_version_ 1782516497144348672
author Zhao, Shanrong
Gordon, William
Du, Sarah
Zhang, Chi
He, Wen
Xi, Li
Mathur, Sachin
Agostino, Michael
Paradis, Theresa
von Schack, David
Vincent, Michael
Zhang, Baohong
author_facet Zhao, Shanrong
Gordon, William
Du, Sarah
Zhang, Chi
He, Wen
Xi, Li
Mathur, Sachin
Agostino, Michael
Paradis, Theresa
von Schack, David
Vincent, Michael
Zhang, Baohong
author_sort Zhao, Shanrong
collection PubMed
description BACKGROUND: Genome-wide miRNA expression data can be used to study miRNA dysregulation comprehensively. Although many open-source tools for microRNA (miRNA)-seq data analyses are available, challenges remain in accurate miRNA quantification from large-scale miRNA-seq dataset. We implemented a pipeline called QuickMIRSeq for accurate quantification of known miRNAs and miRNA isoforms (isomiRs) from multiple samples simultaneously. RESULTS: QuickMIRSeq considers the unique nature of miRNAs and combines many important features into its implementation. First, it takes advantage of high redundancy of miRNA reads and introduces joint mapping of multiple samples to reduce computational time. Second, it incorporates the strand information in the alignment step for more accurate quantification. Third, reads potentially arising from background noise are filtered out to improve the reliability of miRNA detection. Fourth, sequences aligned to miRNAs with mismatches are remapped to a reference genome to further reduce false positives. Finally, QuickMIRSeq generates a rich set of QC metrics and publication-ready plots. CONCLUSIONS: The rich visualization features implemented allow end users to interactively explore the results and gain more insights into miRNA-seq data analyses. The high degree of automation and interactivity in QuickMIRSeq leads to a substantial reduction in the time and effort required for miRNA-seq data analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1601-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5359966
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-53599662017-03-22 QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing Zhao, Shanrong Gordon, William Du, Sarah Zhang, Chi He, Wen Xi, Li Mathur, Sachin Agostino, Michael Paradis, Theresa von Schack, David Vincent, Michael Zhang, Baohong BMC Bioinformatics Software BACKGROUND: Genome-wide miRNA expression data can be used to study miRNA dysregulation comprehensively. Although many open-source tools for microRNA (miRNA)-seq data analyses are available, challenges remain in accurate miRNA quantification from large-scale miRNA-seq dataset. We implemented a pipeline called QuickMIRSeq for accurate quantification of known miRNAs and miRNA isoforms (isomiRs) from multiple samples simultaneously. RESULTS: QuickMIRSeq considers the unique nature of miRNAs and combines many important features into its implementation. First, it takes advantage of high redundancy of miRNA reads and introduces joint mapping of multiple samples to reduce computational time. Second, it incorporates the strand information in the alignment step for more accurate quantification. Third, reads potentially arising from background noise are filtered out to improve the reliability of miRNA detection. Fourth, sequences aligned to miRNAs with mismatches are remapped to a reference genome to further reduce false positives. Finally, QuickMIRSeq generates a rich set of QC metrics and publication-ready plots. CONCLUSIONS: The rich visualization features implemented allow end users to interactively explore the results and gain more insights into miRNA-seq data analyses. The high degree of automation and interactivity in QuickMIRSeq leads to a substantial reduction in the time and effort required for miRNA-seq data analysis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1601-4) contains supplementary material, which is available to authorized users. BioMed Central 2017-03-20 /pmc/articles/PMC5359966/ /pubmed/28320324 http://dx.doi.org/10.1186/s12859-017-1601-4 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Zhao, Shanrong
Gordon, William
Du, Sarah
Zhang, Chi
He, Wen
Xi, Li
Mathur, Sachin
Agostino, Michael
Paradis, Theresa
von Schack, David
Vincent, Michael
Zhang, Baohong
QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing
title QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing
title_full QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing
title_fullStr QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing
title_full_unstemmed QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing
title_short QuickMIRSeq: a pipeline for quick and accurate quantification of both known miRNAs and isomiRs by jointly processing multiple samples from microRNA sequencing
title_sort quickmirseq: a pipeline for quick and accurate quantification of both known mirnas and isomirs by jointly processing multiple samples from microrna sequencing
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5359966/
https://www.ncbi.nlm.nih.gov/pubmed/28320324
http://dx.doi.org/10.1186/s12859-017-1601-4
work_keys_str_mv AT zhaoshanrong quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT gordonwilliam quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT dusarah quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT zhangchi quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT hewen quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT xili quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT mathursachin quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT agostinomichael quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT paradistheresa quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT vonschackdavid quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT vincentmichael quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing
AT zhangbaohong quickmirseqapipelineforquickandaccuratequantificationofbothknownmirnasandisomirsbyjointlyprocessingmultiplesamplesfrommicrornasequencing