Cargando…

Software for Computing and Annotating Genomic Ranges

We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These pa...

Descripción completa

Detalles Bibliográficos
Autores principales: Lawrence, Michael, Huber, Wolfgang, Pagès, Hervé, Aboyoun, Patrick, Carlson, Marc, Gentleman, Robert, Morgan, Martin T., Carey, Vincent J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3738458/
https://www.ncbi.nlm.nih.gov/pubmed/23950696
http://dx.doi.org/10.1371/journal.pcbi.1003118
Descripción
Sumario:We describe Bioconductor infrastructure for representing and computing on annotated genomic ranges and integrating genomic data with the statistical computing features of R and its extensions. At the core of the infrastructure are three packages: IRanges, GenomicRanges, and GenomicFeatures. These packages provide scalable data structures for representing annotated ranges on the genome, with special support for transcript structures, read alignments and coverage vectors. Computational facilities include efficient algorithms for overlap and nearest neighbor detection, coverage calculation and other range operations. This infrastructure directly supports more than 80 other Bioconductor packages, including those for sequence analysis, differential expression analysis and visualization.