Cargando…

Pincho: A Modular Approach to High Quality De Novo Transcriptomics

Transcriptomic reconstructions without reference (i.e., de novo) are common for data samples derived from non-model biological systems. These assemblies involve massive parallel short read sequence reconstructions from experiments, but they usually employ ad-hoc bioinformatic workflows that exhibit...

Descripción completa

Detalles Bibliográficos
Autores principales: Ortiz, Randy, Gera, Priyanka, Rivera, Christopher, Santos, Juan C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8304035/
https://www.ncbi.nlm.nih.gov/pubmed/34206353
http://dx.doi.org/10.3390/genes12070953
_version_ 1783727235990028288
author Ortiz, Randy
Gera, Priyanka
Rivera, Christopher
Santos, Juan C.
author_facet Ortiz, Randy
Gera, Priyanka
Rivera, Christopher
Santos, Juan C.
author_sort Ortiz, Randy
collection PubMed
description Transcriptomic reconstructions without reference (i.e., de novo) are common for data samples derived from non-model biological systems. These assemblies involve massive parallel short read sequence reconstructions from experiments, but they usually employ ad-hoc bioinformatic workflows that exhibit limited standardization and customization. The increasing number of transcriptome assembly software continues to provide little room for standardization which is exacerbated by the lack of studies on modularity that compare the effects of assembler synergy. We developed a customizable management workflow for de novo transcriptomics that includes modular units for short read cleaning, assembly, validation, annotation, and expression analysis by connecting twenty-five individual bioinformatic tools. With our software tool, we were able to compare the assessment scores based on 129 distinct single-, bi- and tri-assembler combinations with diverse k-mer size selections. Our results demonstrate a drastic increase in the quality of transcriptome assemblies with bi- and tri- assembler combinations. We aim for our software to improve de novo transcriptome reconstructions for the ever-growing landscape of RNA-seq data derived from non-model systems. We offer guidance to ensure the most complete transcriptomic reconstructions via the inclusion of modular multi-assembly software controlled from a single master console.
format Online
Article
Text
id pubmed-8304035
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83040352021-07-25 Pincho: A Modular Approach to High Quality De Novo Transcriptomics Ortiz, Randy Gera, Priyanka Rivera, Christopher Santos, Juan C. Genes (Basel) Technical Note Transcriptomic reconstructions without reference (i.e., de novo) are common for data samples derived from non-model biological systems. These assemblies involve massive parallel short read sequence reconstructions from experiments, but they usually employ ad-hoc bioinformatic workflows that exhibit limited standardization and customization. The increasing number of transcriptome assembly software continues to provide little room for standardization which is exacerbated by the lack of studies on modularity that compare the effects of assembler synergy. We developed a customizable management workflow for de novo transcriptomics that includes modular units for short read cleaning, assembly, validation, annotation, and expression analysis by connecting twenty-five individual bioinformatic tools. With our software tool, we were able to compare the assessment scores based on 129 distinct single-, bi- and tri-assembler combinations with diverse k-mer size selections. Our results demonstrate a drastic increase in the quality of transcriptome assemblies with bi- and tri- assembler combinations. We aim for our software to improve de novo transcriptome reconstructions for the ever-growing landscape of RNA-seq data derived from non-model systems. We offer guidance to ensure the most complete transcriptomic reconstructions via the inclusion of modular multi-assembly software controlled from a single master console. MDPI 2021-06-22 /pmc/articles/PMC8304035/ /pubmed/34206353 http://dx.doi.org/10.3390/genes12070953 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Technical Note
Ortiz, Randy
Gera, Priyanka
Rivera, Christopher
Santos, Juan C.
Pincho: A Modular Approach to High Quality De Novo Transcriptomics
title Pincho: A Modular Approach to High Quality De Novo Transcriptomics
title_full Pincho: A Modular Approach to High Quality De Novo Transcriptomics
title_fullStr Pincho: A Modular Approach to High Quality De Novo Transcriptomics
title_full_unstemmed Pincho: A Modular Approach to High Quality De Novo Transcriptomics
title_short Pincho: A Modular Approach to High Quality De Novo Transcriptomics
title_sort pincho: a modular approach to high quality de novo transcriptomics
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8304035/
https://www.ncbi.nlm.nih.gov/pubmed/34206353
http://dx.doi.org/10.3390/genes12070953
work_keys_str_mv AT ortizrandy pinchoamodularapproachtohighqualitydenovotranscriptomics
AT gerapriyanka pinchoamodularapproachtohighqualitydenovotranscriptomics
AT riverachristopher pinchoamodularapproachtohighqualitydenovotranscriptomics
AT santosjuanc pinchoamodularapproachtohighqualitydenovotranscriptomics