Cargando…

In Silico Whole Genome Sequencer and Analyzer (iWGS): a Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies

The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding it...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Xiaofan, Peris, David, Kominek, Jacek, Kurtzman, Cletus P., Hittinger, Chris Todd, Rokas, Antonis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5100864/
https://www.ncbi.nlm.nih.gov/pubmed/27638685
http://dx.doi.org/10.1534/g3.116.034249
_version_ 1782466204928049152
author Zhou, Xiaofan
Peris, David
Kominek, Jacek
Kurtzman, Cletus P.
Hittinger, Chris Todd
Rokas, Antonis
author_facet Zhou, Xiaofan
Peris, David
Kominek, Jacek
Kurtzman, Cletus P.
Hittinger, Chris Todd
Rokas, Antonis
author_sort Zhou, Xiaofan
collection PubMed
description The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in nonmodel organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimental design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies and popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects, and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS.
format Online
Article
Text
id pubmed-5100864
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-51008642016-11-09 In Silico Whole Genome Sequencer and Analyzer (iWGS): a Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies Zhou, Xiaofan Peris, David Kominek, Jacek Kurtzman, Cletus P. Hittinger, Chris Todd Rokas, Antonis G3 (Bethesda) Investigations The availability of genomes across the tree of life is highly biased toward vertebrates, pathogens, human disease models, and organisms with relatively small and simple genomes. Recent progress in genomics has enabled the de novo decoding of the genome of virtually any organism, greatly expanding its potential for understanding the biology and evolution of the full spectrum of biodiversity. The increasing diversity of sequencing technologies, assays, and de novo assembly algorithms have augmented the complexity of de novo genome sequencing projects in nonmodel organisms. To reduce the costs and challenges in de novo genome sequencing projects and streamline their experimental design and analysis, we developed iWGS (in silico Whole Genome Sequencer and Analyzer), an automated pipeline for guiding the choice of appropriate sequencing strategy and assembly protocols. iWGS seamlessly integrates the four key steps of a de novo genome sequencing project: data generation (through simulation), data quality control, de novo assembly, and assembly evaluation and validation. The last three steps can also be applied to the analysis of real data. iWGS is designed to enable the user to have great flexibility in testing the range of experimental designs available for genome sequencing projects, and supports all major sequencing technologies and popular assembly tools. Three case studies illustrate how iWGS can guide the design of de novo genome sequencing projects, and evaluate the performance of a wide variety of user-specified sequencing strategies and assembly protocols on genomes of differing architectures. iWGS, along with a detailed documentation, is freely available at https://github.com/zhouxiaofan1983/iWGS. Genetics Society of America 2016-09-15 /pmc/articles/PMC5100864/ /pubmed/27638685 http://dx.doi.org/10.1534/g3.116.034249 Text en Copyright © 2016 Zhou et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Zhou, Xiaofan
Peris, David
Kominek, Jacek
Kurtzman, Cletus P.
Hittinger, Chris Todd
Rokas, Antonis
In Silico Whole Genome Sequencer and Analyzer (iWGS): a Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies
title In Silico Whole Genome Sequencer and Analyzer (iWGS): a Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies
title_full In Silico Whole Genome Sequencer and Analyzer (iWGS): a Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies
title_fullStr In Silico Whole Genome Sequencer and Analyzer (iWGS): a Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies
title_full_unstemmed In Silico Whole Genome Sequencer and Analyzer (iWGS): a Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies
title_short In Silico Whole Genome Sequencer and Analyzer (iWGS): a Computational Pipeline to Guide the Design and Analysis of de novo Genome Sequencing Studies
title_sort in silico whole genome sequencer and analyzer (iwgs): a computational pipeline to guide the design and analysis of de novo genome sequencing studies
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5100864/
https://www.ncbi.nlm.nih.gov/pubmed/27638685
http://dx.doi.org/10.1534/g3.116.034249
work_keys_str_mv AT zhouxiaofan insilicowholegenomesequencerandanalyzeriwgsacomputationalpipelinetoguidethedesignandanalysisofdenovogenomesequencingstudies
AT perisdavid insilicowholegenomesequencerandanalyzeriwgsacomputationalpipelinetoguidethedesignandanalysisofdenovogenomesequencingstudies
AT kominekjacek insilicowholegenomesequencerandanalyzeriwgsacomputationalpipelinetoguidethedesignandanalysisofdenovogenomesequencingstudies
AT kurtzmancletusp insilicowholegenomesequencerandanalyzeriwgsacomputationalpipelinetoguidethedesignandanalysisofdenovogenomesequencingstudies
AT hittingerchristodd insilicowholegenomesequencerandanalyzeriwgsacomputationalpipelinetoguidethedesignandanalysisofdenovogenomesequencingstudies
AT rokasantonis insilicowholegenomesequencerandanalyzeriwgsacomputationalpipelinetoguidethedesignandanalysisofdenovogenomesequencingstudies