Cargando…

SRAdb: query and use public next-generation sequencing data from within R

BACKGROUND: The Sequence Read Archive (SRA) is the largest public repository of sequencing data from the next generation of sequencing platforms including Illumina (Genome Analyzer, HiSeq, MiSeq, .etc), Roche 454 GS System, Applied Biosystems SOLiD System, Helicos Heliscope, PacBio RS, and others. R...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhu, Yuelin, Stephens, Robert M, Meltzer, Paul S, Davis, Sean R
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2013
Materias:	Software
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3560148/ https://www.ncbi.nlm.nih.gov/pubmed/23323543 http://dx.doi.org/10.1186/1471-2105-14-19

_version_	1782257742940995584
author	Zhu, Yuelin Stephens, Robert M Meltzer, Paul S Davis, Sean R
author_facet	Zhu, Yuelin Stephens, Robert M Meltzer, Paul S Davis, Sean R
author_sort	Zhu, Yuelin
collection	PubMed
description	BACKGROUND: The Sequence Read Archive (SRA) is the largest public repository of sequencing data from the next generation of sequencing platforms including Illumina (Genome Analyzer, HiSeq, MiSeq, .etc), Roche 454 GS System, Applied Biosystems SOLiD System, Helicos Heliscope, PacBio RS, and others. RESULTS: SRAdb is an attempt to make queries of the metadata associated with SRA submission, study, sample, experiment and run more robust and precise, and make access to sequencing data in the SRA easier. We have parsed all the SRA metadata into a SQLite database that is routinely updated and can be easily distributed. The SRAdb R/Bioconductor package then utilizes this SQLite database for querying and accessing metadata. Full text search functionality makes querying metadata very flexible and powerful. Fastq files associated with query results can be downloaded easily for local analysis. The package also includes an interface from R to a popular genome browser, the Integrated Genomics Viewer. CONCLUSIONS: SRAdb Bioconductor package provides a convenient and integrated framework to query and access SRA metadata quickly and powerfully from within R.
format	Online Article Text
id	pubmed-3560148
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-35601482013-02-04 SRAdb: query and use public next-generation sequencing data from within R Zhu, Yuelin Stephens, Robert M Meltzer, Paul S Davis, Sean R BMC Bioinformatics Software BACKGROUND: The Sequence Read Archive (SRA) is the largest public repository of sequencing data from the next generation of sequencing platforms including Illumina (Genome Analyzer, HiSeq, MiSeq, .etc), Roche 454 GS System, Applied Biosystems SOLiD System, Helicos Heliscope, PacBio RS, and others. RESULTS: SRAdb is an attempt to make queries of the metadata associated with SRA submission, study, sample, experiment and run more robust and precise, and make access to sequencing data in the SRA easier. We have parsed all the SRA metadata into a SQLite database that is routinely updated and can be easily distributed. The SRAdb R/Bioconductor package then utilizes this SQLite database for querying and accessing metadata. Full text search functionality makes querying metadata very flexible and powerful. Fastq files associated with query results can be downloaded easily for local analysis. The package also includes an interface from R to a popular genome browser, the Integrated Genomics Viewer. CONCLUSIONS: SRAdb Bioconductor package provides a convenient and integrated framework to query and access SRA metadata quickly and powerfully from within R. BioMed Central 2013-01-17 /pmc/articles/PMC3560148/ /pubmed/23323543 http://dx.doi.org/10.1186/1471-2105-14-19 Text en Copyright ©2013 Zhu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Software Zhu, Yuelin Stephens, Robert M Meltzer, Paul S Davis, Sean R SRAdb: query and use public next-generation sequencing data from within R
title	SRAdb: query and use public next-generation sequencing data from within R
title_full	SRAdb: query and use public next-generation sequencing data from within R
title_fullStr	SRAdb: query and use public next-generation sequencing data from within R
title_full_unstemmed	SRAdb: query and use public next-generation sequencing data from within R
title_short	SRAdb: query and use public next-generation sequencing data from within R
title_sort	sradb: query and use public next-generation sequencing data from within r
topic	Software
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3560148/ https://www.ncbi.nlm.nih.gov/pubmed/23323543 http://dx.doi.org/10.1186/1471-2105-14-19
work_keys_str_mv	AT zhuyuelin sradbqueryandusepublicnextgenerationsequencingdatafromwithinr AT stephensrobertm sradbqueryandusepublicnextgenerationsequencingdatafromwithinr AT meltzerpauls sradbqueryandusepublicnextgenerationsequencingdatafromwithinr AT davisseanr sradbqueryandusepublicnextgenerationsequencingdatafromwithinr

SRAdb: query and use public next-generation sequencing data from within R

Ejemplares similares