Cargando…

SePIA: RNA and small RNA sequence processing, integration, and analysis

BACKGROUND: Large-scale sequencing experiments are complex and require a wide spectrum of computational tools to extract and interpret relevant biological information. This is especially true in projects where individual processing and integrated analysis of both small RNA and complementary RNA data...

Descripción completa

Detalles Bibliográficos
Autores principales: Icay, Katherine, Chen, Ping, Cervera, Alejandra, Rantanen, Ville, Lehtonen, Rainer, Hautaniemi, Sampsa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4875694/
https://www.ncbi.nlm.nih.gov/pubmed/27213017
http://dx.doi.org/10.1186/s13040-016-0099-z
_version_ 1782433139790970880
author Icay, Katherine
Chen, Ping
Cervera, Alejandra
Rantanen, Ville
Lehtonen, Rainer
Hautaniemi, Sampsa
author_facet Icay, Katherine
Chen, Ping
Cervera, Alejandra
Rantanen, Ville
Lehtonen, Rainer
Hautaniemi, Sampsa
author_sort Icay, Katherine
collection PubMed
description BACKGROUND: Large-scale sequencing experiments are complex and require a wide spectrum of computational tools to extract and interpret relevant biological information. This is especially true in projects where individual processing and integrated analysis of both small RNA and complementary RNA data is needed. Such studies would benefit from a computational workflow that is easy to implement and standardizes the processing and analysis of both sequenced data types. RESULTS: We developed SePIA (Sequence Processing, Integration, and Analysis), a comprehensive small RNA and RNA workflow. It provides ready execution for over 20 commonly known RNA-seq tools on top of an established workflow engine and provides dynamic pipeline architecture to manage, individually analyze, and integrate both small RNA and RNA data. Implementation with Docker makes SePIA portable and easy to run. We demonstrate the workflow’s extensive utility with two case studies involving three breast cancer datasets. SePIA is straightforward to configure and organizes results into a perusable HTML report. Furthermore, the underlying pipeline engine supports computational resource management for optimal performance. CONCLUSION: SePIA is an open-source workflow introducing standardized processing and analysis of RNA and small RNA data. SePIA’s modular design enables robust customization to a given experiment while maintaining overall workflow structure. It is available at http://anduril.org/sepia. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-016-0099-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4875694
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48756942016-05-22 SePIA: RNA and small RNA sequence processing, integration, and analysis Icay, Katherine Chen, Ping Cervera, Alejandra Rantanen, Ville Lehtonen, Rainer Hautaniemi, Sampsa BioData Min Software Article BACKGROUND: Large-scale sequencing experiments are complex and require a wide spectrum of computational tools to extract and interpret relevant biological information. This is especially true in projects where individual processing and integrated analysis of both small RNA and complementary RNA data is needed. Such studies would benefit from a computational workflow that is easy to implement and standardizes the processing and analysis of both sequenced data types. RESULTS: We developed SePIA (Sequence Processing, Integration, and Analysis), a comprehensive small RNA and RNA workflow. It provides ready execution for over 20 commonly known RNA-seq tools on top of an established workflow engine and provides dynamic pipeline architecture to manage, individually analyze, and integrate both small RNA and RNA data. Implementation with Docker makes SePIA portable and easy to run. We demonstrate the workflow’s extensive utility with two case studies involving three breast cancer datasets. SePIA is straightforward to configure and organizes results into a perusable HTML report. Furthermore, the underlying pipeline engine supports computational resource management for optimal performance. CONCLUSION: SePIA is an open-source workflow introducing standardized processing and analysis of RNA and small RNA data. SePIA’s modular design enables robust customization to a given experiment while maintaining overall workflow structure. It is available at http://anduril.org/sepia. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13040-016-0099-z) contains supplementary material, which is available to authorized users. BioMed Central 2016-05-20 /pmc/articles/PMC4875694/ /pubmed/27213017 http://dx.doi.org/10.1186/s13040-016-0099-z Text en © Icay et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software Article
Icay, Katherine
Chen, Ping
Cervera, Alejandra
Rantanen, Ville
Lehtonen, Rainer
Hautaniemi, Sampsa
SePIA: RNA and small RNA sequence processing, integration, and analysis
title SePIA: RNA and small RNA sequence processing, integration, and analysis
title_full SePIA: RNA and small RNA sequence processing, integration, and analysis
title_fullStr SePIA: RNA and small RNA sequence processing, integration, and analysis
title_full_unstemmed SePIA: RNA and small RNA sequence processing, integration, and analysis
title_short SePIA: RNA and small RNA sequence processing, integration, and analysis
title_sort sepia: rna and small rna sequence processing, integration, and analysis
topic Software Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4875694/
https://www.ncbi.nlm.nih.gov/pubmed/27213017
http://dx.doi.org/10.1186/s13040-016-0099-z
work_keys_str_mv AT icaykatherine sepiarnaandsmallrnasequenceprocessingintegrationandanalysis
AT chenping sepiarnaandsmallrnasequenceprocessingintegrationandanalysis
AT cerveraalejandra sepiarnaandsmallrnasequenceprocessingintegrationandanalysis
AT rantanenville sepiarnaandsmallrnasequenceprocessingintegrationandanalysis
AT lehtonenrainer sepiarnaandsmallrnasequenceprocessingintegrationandanalysis
AT hautaniemisampsa sepiarnaandsmallrnasequenceprocessingintegrationandanalysis