Cargando…

Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing

BACKGROUND: Massively parallel sequencing readouts of epigenomic assays are enabling integrative genome-wide analyses of genomic and epigenomic variation. Pash 3.0 performs sequence comparison and read mapping and can be employed as a module within diverse configurable analysis pipelines, including...

Descripción completa

Detalles Bibliográficos
Autores principales: Coarfa, Cristian, Yu, Fuli, Miller, Christopher A, Chen, Zuozhou, Harris, R Alan, Milosavljevic, Aleksandar
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3001746/
https://www.ncbi.nlm.nih.gov/pubmed/21092284
http://dx.doi.org/10.1186/1471-2105-11-572
_version_ 1782193659701100544
author Coarfa, Cristian
Yu, Fuli
Miller, Christopher A
Chen, Zuozhou
Harris, R Alan
Milosavljevic, Aleksandar
author_facet Coarfa, Cristian
Yu, Fuli
Miller, Christopher A
Chen, Zuozhou
Harris, R Alan
Milosavljevic, Aleksandar
author_sort Coarfa, Cristian
collection PubMed
description BACKGROUND: Massively parallel sequencing readouts of epigenomic assays are enabling integrative genome-wide analyses of genomic and epigenomic variation. Pash 3.0 performs sequence comparison and read mapping and can be employed as a module within diverse configurable analysis pipelines, including ChIP-Seq and methylome mapping by whole-genome bisulfite sequencing. RESULTS: Pash 3.0 generally matches the accuracy and speed of niche programs for fast mapping of short reads, and exceeds their performance on longer reads generated by a new generation of massively parallel sequencing technologies. By exploiting longer read lengths, Pash 3.0 maps reads onto the large fraction of genomic DNA that contains repetitive elements and polymorphic sites, including indel polymorphisms. CONCLUSIONS: We demonstrate the versatility of Pash 3.0 by analyzing the interaction between CpG methylation, CpG SNPs, and imprinting based on publicly available whole-genome shotgun bisulfite sequencing data. Pash 3.0 makes use of gapped k-mer alignment, a non-seed based comparison method, which is implemented using multi-positional hash tables. This allows Pash 3.0 to run on diverse hardware platforms, including individual computers with standard RAM capacity, multi-core hardware architectures and large clusters.
format Text
id pubmed-3001746
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30017462010-12-15 Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing Coarfa, Cristian Yu, Fuli Miller, Christopher A Chen, Zuozhou Harris, R Alan Milosavljevic, Aleksandar BMC Bioinformatics Software BACKGROUND: Massively parallel sequencing readouts of epigenomic assays are enabling integrative genome-wide analyses of genomic and epigenomic variation. Pash 3.0 performs sequence comparison and read mapping and can be employed as a module within diverse configurable analysis pipelines, including ChIP-Seq and methylome mapping by whole-genome bisulfite sequencing. RESULTS: Pash 3.0 generally matches the accuracy and speed of niche programs for fast mapping of short reads, and exceeds their performance on longer reads generated by a new generation of massively parallel sequencing technologies. By exploiting longer read lengths, Pash 3.0 maps reads onto the large fraction of genomic DNA that contains repetitive elements and polymorphic sites, including indel polymorphisms. CONCLUSIONS: We demonstrate the versatility of Pash 3.0 by analyzing the interaction between CpG methylation, CpG SNPs, and imprinting based on publicly available whole-genome shotgun bisulfite sequencing data. Pash 3.0 makes use of gapped k-mer alignment, a non-seed based comparison method, which is implemented using multi-positional hash tables. This allows Pash 3.0 to run on diverse hardware platforms, including individual computers with standard RAM capacity, multi-core hardware architectures and large clusters. BioMed Central 2010-11-23 /pmc/articles/PMC3001746/ /pubmed/21092284 http://dx.doi.org/10.1186/1471-2105-11-572 Text en Copyright ©2010 Coarfa et al; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Coarfa, Cristian
Yu, Fuli
Miller, Christopher A
Chen, Zuozhou
Harris, R Alan
Milosavljevic, Aleksandar
Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing
title Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing
title_full Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing
title_fullStr Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing
title_full_unstemmed Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing
title_short Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing
title_sort pash 3.0: a versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel dna sequencing
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3001746/
https://www.ncbi.nlm.nih.gov/pubmed/21092284
http://dx.doi.org/10.1186/1471-2105-11-572
work_keys_str_mv AT coarfacristian pash30aversatilesoftwarepackageforreadmappingandintegrativeanalysisofgenomicandepigenomicvariationusingmassivelyparalleldnasequencing
AT yufuli pash30aversatilesoftwarepackageforreadmappingandintegrativeanalysisofgenomicandepigenomicvariationusingmassivelyparalleldnasequencing
AT millerchristophera pash30aversatilesoftwarepackageforreadmappingandintegrativeanalysisofgenomicandepigenomicvariationusingmassivelyparalleldnasequencing
AT chenzuozhou pash30aversatilesoftwarepackageforreadmappingandintegrativeanalysisofgenomicandepigenomicvariationusingmassivelyparalleldnasequencing
AT harrisralan pash30aversatilesoftwarepackageforreadmappingandintegrativeanalysisofgenomicandepigenomicvariationusingmassivelyparalleldnasequencing
AT milosavljevicaleksandar pash30aversatilesoftwarepackageforreadmappingandintegrativeanalysisofgenomicandepigenomicvariationusingmassivelyparalleldnasequencing