Cargando…

CAPRG: Sequence Assembling Pipeline for Next Generation Sequencing of Non-Model Organisms

Our goal is to introduce and describe the utility of a new pipeline “Contigs Assembly Pipeline using Reference Genome” (CAPRG), which has been developed to assemble “long sequence reads” for non-model organisms by leveraging a reference genome of a closely related phylogenetic relative. To facilitat...

Descripción completa

Detalles Bibliográficos
Autores principales: Rawat, Arun, Elasri, Mohamed O., Gust, Kurt A., George, Glover, Pham, Don, Scanlan, Leona D., Vulpe, Chris, Perkins, Edward J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3272009/
https://www.ncbi.nlm.nih.gov/pubmed/22319566
http://dx.doi.org/10.1371/journal.pone.0030370
_version_ 1782222773470363648
author Rawat, Arun
Elasri, Mohamed O.
Gust, Kurt A.
George, Glover
Pham, Don
Scanlan, Leona D.
Vulpe, Chris
Perkins, Edward J.
author_facet Rawat, Arun
Elasri, Mohamed O.
Gust, Kurt A.
George, Glover
Pham, Don
Scanlan, Leona D.
Vulpe, Chris
Perkins, Edward J.
author_sort Rawat, Arun
collection PubMed
description Our goal is to introduce and describe the utility of a new pipeline “Contigs Assembly Pipeline using Reference Genome” (CAPRG), which has been developed to assemble “long sequence reads” for non-model organisms by leveraging a reference genome of a closely related phylogenetic relative. To facilitate this effort, we utilized two avian transcriptomic datasets generated using ROCHE/454 technology as test cases for CAPRG assembly. We compared the results of CAPRG assembly using a reference genome with the results of existing methods that utilize de novo strategies such as VELVET, PAVE, and MIRA by employing parameter space comparisons (intra-assembling comparison). CAPRG performed as well or better than the existing assembly methods based on various benchmarks for “gene-hunting.” Further, CAPRG completed the assemblies in a fraction of the time required by the existing assembly algorithms. Additional advantages of CAPRG included reduced contig inflation resulting in lower computational resources for annotation, and functional identification for contigs that may be categorized as “unknowns” by de novo methods. In addition to providing evaluation of CAPRG performance, we observed that the different assembly (inter-assembly) results could be integrated to enhance the putative gene coverage for any transcriptomics study.
format Online
Article
Text
id pubmed-3272009
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-32720092012-02-08 CAPRG: Sequence Assembling Pipeline for Next Generation Sequencing of Non-Model Organisms Rawat, Arun Elasri, Mohamed O. Gust, Kurt A. George, Glover Pham, Don Scanlan, Leona D. Vulpe, Chris Perkins, Edward J. PLoS One Research Article Our goal is to introduce and describe the utility of a new pipeline “Contigs Assembly Pipeline using Reference Genome” (CAPRG), which has been developed to assemble “long sequence reads” for non-model organisms by leveraging a reference genome of a closely related phylogenetic relative. To facilitate this effort, we utilized two avian transcriptomic datasets generated using ROCHE/454 technology as test cases for CAPRG assembly. We compared the results of CAPRG assembly using a reference genome with the results of existing methods that utilize de novo strategies such as VELVET, PAVE, and MIRA by employing parameter space comparisons (intra-assembling comparison). CAPRG performed as well or better than the existing assembly methods based on various benchmarks for “gene-hunting.” Further, CAPRG completed the assemblies in a fraction of the time required by the existing assembly algorithms. Additional advantages of CAPRG included reduced contig inflation resulting in lower computational resources for annotation, and functional identification for contigs that may be categorized as “unknowns” by de novo methods. In addition to providing evaluation of CAPRG performance, we observed that the different assembly (inter-assembly) results could be integrated to enhance the putative gene coverage for any transcriptomics study. Public Library of Science 2012-02-03 /pmc/articles/PMC3272009/ /pubmed/22319566 http://dx.doi.org/10.1371/journal.pone.0030370 Text en Rawat et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Rawat, Arun
Elasri, Mohamed O.
Gust, Kurt A.
George, Glover
Pham, Don
Scanlan, Leona D.
Vulpe, Chris
Perkins, Edward J.
CAPRG: Sequence Assembling Pipeline for Next Generation Sequencing of Non-Model Organisms
title CAPRG: Sequence Assembling Pipeline for Next Generation Sequencing of Non-Model Organisms
title_full CAPRG: Sequence Assembling Pipeline for Next Generation Sequencing of Non-Model Organisms
title_fullStr CAPRG: Sequence Assembling Pipeline for Next Generation Sequencing of Non-Model Organisms
title_full_unstemmed CAPRG: Sequence Assembling Pipeline for Next Generation Sequencing of Non-Model Organisms
title_short CAPRG: Sequence Assembling Pipeline for Next Generation Sequencing of Non-Model Organisms
title_sort caprg: sequence assembling pipeline for next generation sequencing of non-model organisms
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3272009/
https://www.ncbi.nlm.nih.gov/pubmed/22319566
http://dx.doi.org/10.1371/journal.pone.0030370
work_keys_str_mv AT rawatarun caprgsequenceassemblingpipelinefornextgenerationsequencingofnonmodelorganisms
AT elasrimohamedo caprgsequenceassemblingpipelinefornextgenerationsequencingofnonmodelorganisms
AT gustkurta caprgsequenceassemblingpipelinefornextgenerationsequencingofnonmodelorganisms
AT georgeglover caprgsequenceassemblingpipelinefornextgenerationsequencingofnonmodelorganisms
AT phamdon caprgsequenceassemblingpipelinefornextgenerationsequencingofnonmodelorganisms
AT scanlanleonad caprgsequenceassemblingpipelinefornextgenerationsequencingofnonmodelorganisms
AT vulpechris caprgsequenceassemblingpipelinefornextgenerationsequencingofnonmodelorganisms
AT perkinsedwardj caprgsequenceassemblingpipelinefornextgenerationsequencingofnonmodelorganisms