Cargando…
Pincho: A Modular Approach to High Quality De Novo Transcriptomics
Transcriptomic reconstructions without reference (i.e., de novo) are common for data samples derived from non-model biological systems. These assemblies involve massive parallel short read sequence reconstructions from experiments, but they usually employ ad-hoc bioinformatic workflows that exhibit...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8304035/ https://www.ncbi.nlm.nih.gov/pubmed/34206353 http://dx.doi.org/10.3390/genes12070953 |
_version_ | 1783727235990028288 |
---|---|
author | Ortiz, Randy Gera, Priyanka Rivera, Christopher Santos, Juan C. |
author_facet | Ortiz, Randy Gera, Priyanka Rivera, Christopher Santos, Juan C. |
author_sort | Ortiz, Randy |
collection | PubMed |
description | Transcriptomic reconstructions without reference (i.e., de novo) are common for data samples derived from non-model biological systems. These assemblies involve massive parallel short read sequence reconstructions from experiments, but they usually employ ad-hoc bioinformatic workflows that exhibit limited standardization and customization. The increasing number of transcriptome assembly software continues to provide little room for standardization which is exacerbated by the lack of studies on modularity that compare the effects of assembler synergy. We developed a customizable management workflow for de novo transcriptomics that includes modular units for short read cleaning, assembly, validation, annotation, and expression analysis by connecting twenty-five individual bioinformatic tools. With our software tool, we were able to compare the assessment scores based on 129 distinct single-, bi- and tri-assembler combinations with diverse k-mer size selections. Our results demonstrate a drastic increase in the quality of transcriptome assemblies with bi- and tri- assembler combinations. We aim for our software to improve de novo transcriptome reconstructions for the ever-growing landscape of RNA-seq data derived from non-model systems. We offer guidance to ensure the most complete transcriptomic reconstructions via the inclusion of modular multi-assembly software controlled from a single master console. |
format | Online Article Text |
id | pubmed-8304035 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83040352021-07-25 Pincho: A Modular Approach to High Quality De Novo Transcriptomics Ortiz, Randy Gera, Priyanka Rivera, Christopher Santos, Juan C. Genes (Basel) Technical Note Transcriptomic reconstructions without reference (i.e., de novo) are common for data samples derived from non-model biological systems. These assemblies involve massive parallel short read sequence reconstructions from experiments, but they usually employ ad-hoc bioinformatic workflows that exhibit limited standardization and customization. The increasing number of transcriptome assembly software continues to provide little room for standardization which is exacerbated by the lack of studies on modularity that compare the effects of assembler synergy. We developed a customizable management workflow for de novo transcriptomics that includes modular units for short read cleaning, assembly, validation, annotation, and expression analysis by connecting twenty-five individual bioinformatic tools. With our software tool, we were able to compare the assessment scores based on 129 distinct single-, bi- and tri-assembler combinations with diverse k-mer size selections. Our results demonstrate a drastic increase in the quality of transcriptome assemblies with bi- and tri- assembler combinations. We aim for our software to improve de novo transcriptome reconstructions for the ever-growing landscape of RNA-seq data derived from non-model systems. We offer guidance to ensure the most complete transcriptomic reconstructions via the inclusion of modular multi-assembly software controlled from a single master console. MDPI 2021-06-22 /pmc/articles/PMC8304035/ /pubmed/34206353 http://dx.doi.org/10.3390/genes12070953 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Technical Note Ortiz, Randy Gera, Priyanka Rivera, Christopher Santos, Juan C. Pincho: A Modular Approach to High Quality De Novo Transcriptomics |
title | Pincho: A Modular Approach to High Quality De Novo Transcriptomics |
title_full | Pincho: A Modular Approach to High Quality De Novo Transcriptomics |
title_fullStr | Pincho: A Modular Approach to High Quality De Novo Transcriptomics |
title_full_unstemmed | Pincho: A Modular Approach to High Quality De Novo Transcriptomics |
title_short | Pincho: A Modular Approach to High Quality De Novo Transcriptomics |
title_sort | pincho: a modular approach to high quality de novo transcriptomics |
topic | Technical Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8304035/ https://www.ncbi.nlm.nih.gov/pubmed/34206353 http://dx.doi.org/10.3390/genes12070953 |
work_keys_str_mv | AT ortizrandy pinchoamodularapproachtohighqualitydenovotranscriptomics AT gerapriyanka pinchoamodularapproachtohighqualitydenovotranscriptomics AT riverachristopher pinchoamodularapproachtohighqualitydenovotranscriptomics AT santosjuanc pinchoamodularapproachtohighqualitydenovotranscriptomics |