Cargando…

cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing

BACKGROUND: Next-generation sequencing can determine DNA bases and the results of sequence alignments are generally stored in files in the Sequence Alignment/Map (SAM) format and the compressed binary version (BAM) of it. SAMtools is a typical tool for dealing with files in the SAM/BAM format. SAMto...

Descripción completa

Detalles Bibliográficos
Autores principales: Takeuchi, Toshiki, Yamada, Atsuo, Aoki, Takashi, Nishimura, Kunihiro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987990/
https://www.ncbi.nlm.nih.gov/pubmed/27536334
http://dx.doi.org/10.1186/s13029-016-0058-6
_version_ 1782448385197867008
author Takeuchi, Toshiki
Yamada, Atsuo
Aoki, Takashi
Nishimura, Kunihiro
author_facet Takeuchi, Toshiki
Yamada, Atsuo
Aoki, Takashi
Nishimura, Kunihiro
author_sort Takeuchi, Toshiki
collection PubMed
description BACKGROUND: Next-generation sequencing can determine DNA bases and the results of sequence alignments are generally stored in files in the Sequence Alignment/Map (SAM) format and the compressed binary version (BAM) of it. SAMtools is a typical tool for dealing with files in the SAM/BAM format. SAMtools has various functions, including detection of variants, visualization of alignments, indexing, extraction of parts of the data and loci, and conversion of file formats. It is written in C and can execute fast. However, SAMtools requires an additional implementation to be used in parallel with, for example, OpenMP (Open Multi-Processing) libraries. For the accumulation of next-generation sequencing data, a simple parallelization program, which can support cloud and PC cluster environments, is required. RESULTS: We have developed cljam using the Clojure programming language, which simplifies parallel programming, to handle SAM/BAM data. Cljam can run in a Java runtime environment (e.g., Windows, Linux, Mac OS X) with Clojure. CONCLUSIONS: Cljam can process and analyze SAM/BAM files in parallel and at high speed. The execution time with cljam is almost the same as with SAMtools. The cljam code is written in Clojure and has fewer lines than other similar tools.
format Online
Article
Text
id pubmed-4987990
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49879902016-08-18 cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing Takeuchi, Toshiki Yamada, Atsuo Aoki, Takashi Nishimura, Kunihiro Source Code Biol Med Software BACKGROUND: Next-generation sequencing can determine DNA bases and the results of sequence alignments are generally stored in files in the Sequence Alignment/Map (SAM) format and the compressed binary version (BAM) of it. SAMtools is a typical tool for dealing with files in the SAM/BAM format. SAMtools has various functions, including detection of variants, visualization of alignments, indexing, extraction of parts of the data and loci, and conversion of file formats. It is written in C and can execute fast. However, SAMtools requires an additional implementation to be used in parallel with, for example, OpenMP (Open Multi-Processing) libraries. For the accumulation of next-generation sequencing data, a simple parallelization program, which can support cloud and PC cluster environments, is required. RESULTS: We have developed cljam using the Clojure programming language, which simplifies parallel programming, to handle SAM/BAM data. Cljam can run in a Java runtime environment (e.g., Windows, Linux, Mac OS X) with Clojure. CONCLUSIONS: Cljam can process and analyze SAM/BAM files in parallel and at high speed. The execution time with cljam is almost the same as with SAMtools. The cljam code is written in Clojure and has fewer lines than other similar tools. BioMed Central 2016-08-17 /pmc/articles/PMC4987990/ /pubmed/27536334 http://dx.doi.org/10.1186/s13029-016-0058-6 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Takeuchi, Toshiki
Yamada, Atsuo
Aoki, Takashi
Nishimura, Kunihiro
cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing
title cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing
title_full cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing
title_fullStr cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing
title_full_unstemmed cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing
title_short cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing
title_sort cljam: a library for handling dna sequence alignment/map (sam) with parallel processing
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987990/
https://www.ncbi.nlm.nih.gov/pubmed/27536334
http://dx.doi.org/10.1186/s13029-016-0058-6
work_keys_str_mv AT takeuchitoshiki cljamalibraryforhandlingdnasequencealignmentmapsamwithparallelprocessing
AT yamadaatsuo cljamalibraryforhandlingdnasequencealignmentmapsamwithparallelprocessing
AT aokitakashi cljamalibraryforhandlingdnasequencealignmentmapsamwithparallelprocessing
AT nishimurakunihiro cljamalibraryforhandlingdnasequencealignmentmapsamwithparallelprocessing