Cargando…

BIGpre: A Quality Assessment Package for Next-Generation Sequencing Data

The emergence of next-generation sequencing (NGS) technologies has significantly improved sequencing throughput and reduced costs. However, the short read length, duplicate reads and massive volume of data make the data processing much more difficult and complicated than the first-generation sequenc...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhang, Tongwu, Luo, Yingfeng, Liu, Kan, Pan, Linlin, Zhang, Bing, Yu, Jun, Hu, Songnian
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2011
Materias:	Application Note
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054156/ https://www.ncbi.nlm.nih.gov/pubmed/22289480 http://dx.doi.org/10.1016/S1672-0229(11)60027-2

_version_	1782458539207294976
author	Zhang, Tongwu Luo, Yingfeng Liu, Kan Pan, Linlin Zhang, Bing Yu, Jun Hu, Songnian
author_facet	Zhang, Tongwu Luo, Yingfeng Liu, Kan Pan, Linlin Zhang, Bing Yu, Jun Hu, Songnian
author_sort	Zhang, Tongwu
collection	PubMed
description	The emergence of next-generation sequencing (NGS) technologies has significantly improved sequencing throughput and reduced costs. However, the short read length, duplicate reads and massive volume of data make the data processing much more difficult and complicated than the first-generation sequencing technology. Although there are some software packages developed to assess the data quality, those packages either are not easily available to users or require bioinformatics skills and computer resources. Moreover, almost all the quality assessment software currently available didn’t taken into account the sequencing errors when dealing with the duplicate assessment in NGS data. Here, we present a new user-friendly quality assessment software package called BIGpre, which works for both Illumina and 454 platforms. BIGpre contains all the functions of other quality assessment software, such as the correlation between forward and reverse reads, read GC-content distribution, and base Ns quality. More importantly, BIGpre incorporates associated programs to detect and remove duplicate reads after taking sequencing errors into account and trimming low quality reads from raw data as well. BIGpre is primarily written in Perl and integrates graphical capability from the statistics package R. This package produces both tabular and graphical summaries of data quality for sequencing datasets from Illumina and 454 platforms. Processing hundreds of millions reads within minutes, this package provides immediate diagnostic information for user to manipulate sequencing data for downstream analyses. BIGpre is freely available at http://bigpre.sourceforge.net/.
format	Online Article Text
id	pubmed-5054156
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-50541562016-10-14 BIGpre: A Quality Assessment Package for Next-Generation Sequencing Data Zhang, Tongwu Luo, Yingfeng Liu, Kan Pan, Linlin Zhang, Bing Yu, Jun Hu, Songnian Genomics Proteomics Bioinformatics Application Note The emergence of next-generation sequencing (NGS) technologies has significantly improved sequencing throughput and reduced costs. However, the short read length, duplicate reads and massive volume of data make the data processing much more difficult and complicated than the first-generation sequencing technology. Although there are some software packages developed to assess the data quality, those packages either are not easily available to users or require bioinformatics skills and computer resources. Moreover, almost all the quality assessment software currently available didn’t taken into account the sequencing errors when dealing with the duplicate assessment in NGS data. Here, we present a new user-friendly quality assessment software package called BIGpre, which works for both Illumina and 454 platforms. BIGpre contains all the functions of other quality assessment software, such as the correlation between forward and reverse reads, read GC-content distribution, and base Ns quality. More importantly, BIGpre incorporates associated programs to detect and remove duplicate reads after taking sequencing errors into account and trimming low quality reads from raw data as well. BIGpre is primarily written in Perl and integrates graphical capability from the statistics package R. This package produces both tabular and graphical summaries of data quality for sequencing datasets from Illumina and 454 platforms. Processing hundreds of millions reads within minutes, this package provides immediate diagnostic information for user to manipulate sequencing data for downstream analyses. BIGpre is freely available at http://bigpre.sourceforge.net/. Elsevier 2011-12 2012-01-28 /pmc/articles/PMC5054156/ /pubmed/22289480 http://dx.doi.org/10.1016/S1672-0229(11)60027-2 Text en © 2011 Beijing Institute of Genomics http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).
spellingShingle	Application Note Zhang, Tongwu Luo, Yingfeng Liu, Kan Pan, Linlin Zhang, Bing Yu, Jun Hu, Songnian BIGpre: A Quality Assessment Package for Next-Generation Sequencing Data
title	BIGpre: A Quality Assessment Package for Next-Generation Sequencing Data
title_full	BIGpre: A Quality Assessment Package for Next-Generation Sequencing Data
title_fullStr	BIGpre: A Quality Assessment Package for Next-Generation Sequencing Data
title_full_unstemmed	BIGpre: A Quality Assessment Package for Next-Generation Sequencing Data
title_short	BIGpre: A Quality Assessment Package for Next-Generation Sequencing Data
title_sort	bigpre: a quality assessment package for next-generation sequencing data
topic	Application Note
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5054156/ https://www.ncbi.nlm.nih.gov/pubmed/22289480 http://dx.doi.org/10.1016/S1672-0229(11)60027-2
work_keys_str_mv	AT zhangtongwu bigpreaqualityassessmentpackagefornextgenerationsequencingdata AT luoyingfeng bigpreaqualityassessmentpackagefornextgenerationsequencingdata AT liukan bigpreaqualityassessmentpackagefornextgenerationsequencingdata AT panlinlin bigpreaqualityassessmentpackagefornextgenerationsequencingdata AT zhangbing bigpreaqualityassessmentpackagefornextgenerationsequencingdata AT yujun bigpreaqualityassessmentpackagefornextgenerationsequencingdata AT husongnian bigpreaqualityassessmentpackagefornextgenerationsequencingdata

BIGpre: A Quality Assessment Package for Next-Generation Sequencing Data

Ejemplares similares