Cargando…
cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing
BACKGROUND: Next-generation sequencing can determine DNA bases and the results of sequence alignments are generally stored in files in the Sequence Alignment/Map (SAM) format and the compressed binary version (BAM) of it. SAMtools is a typical tool for dealing with files in the SAM/BAM format. SAMto...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987990/ https://www.ncbi.nlm.nih.gov/pubmed/27536334 http://dx.doi.org/10.1186/s13029-016-0058-6 |
_version_ | 1782448385197867008 |
---|---|
author | Takeuchi, Toshiki Yamada, Atsuo Aoki, Takashi Nishimura, Kunihiro |
author_facet | Takeuchi, Toshiki Yamada, Atsuo Aoki, Takashi Nishimura, Kunihiro |
author_sort | Takeuchi, Toshiki |
collection | PubMed |
description | BACKGROUND: Next-generation sequencing can determine DNA bases and the results of sequence alignments are generally stored in files in the Sequence Alignment/Map (SAM) format and the compressed binary version (BAM) of it. SAMtools is a typical tool for dealing with files in the SAM/BAM format. SAMtools has various functions, including detection of variants, visualization of alignments, indexing, extraction of parts of the data and loci, and conversion of file formats. It is written in C and can execute fast. However, SAMtools requires an additional implementation to be used in parallel with, for example, OpenMP (Open Multi-Processing) libraries. For the accumulation of next-generation sequencing data, a simple parallelization program, which can support cloud and PC cluster environments, is required. RESULTS: We have developed cljam using the Clojure programming language, which simplifies parallel programming, to handle SAM/BAM data. Cljam can run in a Java runtime environment (e.g., Windows, Linux, Mac OS X) with Clojure. CONCLUSIONS: Cljam can process and analyze SAM/BAM files in parallel and at high speed. The execution time with cljam is almost the same as with SAMtools. The cljam code is written in Clojure and has fewer lines than other similar tools. |
format | Online Article Text |
id | pubmed-4987990 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49879902016-08-18 cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing Takeuchi, Toshiki Yamada, Atsuo Aoki, Takashi Nishimura, Kunihiro Source Code Biol Med Software BACKGROUND: Next-generation sequencing can determine DNA bases and the results of sequence alignments are generally stored in files in the Sequence Alignment/Map (SAM) format and the compressed binary version (BAM) of it. SAMtools is a typical tool for dealing with files in the SAM/BAM format. SAMtools has various functions, including detection of variants, visualization of alignments, indexing, extraction of parts of the data and loci, and conversion of file formats. It is written in C and can execute fast. However, SAMtools requires an additional implementation to be used in parallel with, for example, OpenMP (Open Multi-Processing) libraries. For the accumulation of next-generation sequencing data, a simple parallelization program, which can support cloud and PC cluster environments, is required. RESULTS: We have developed cljam using the Clojure programming language, which simplifies parallel programming, to handle SAM/BAM data. Cljam can run in a Java runtime environment (e.g., Windows, Linux, Mac OS X) with Clojure. CONCLUSIONS: Cljam can process and analyze SAM/BAM files in parallel and at high speed. The execution time with cljam is almost the same as with SAMtools. The cljam code is written in Clojure and has fewer lines than other similar tools. BioMed Central 2016-08-17 /pmc/articles/PMC4987990/ /pubmed/27536334 http://dx.doi.org/10.1186/s13029-016-0058-6 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Takeuchi, Toshiki Yamada, Atsuo Aoki, Takashi Nishimura, Kunihiro cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing |
title | cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing |
title_full | cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing |
title_fullStr | cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing |
title_full_unstemmed | cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing |
title_short | cljam: a library for handling DNA sequence alignment/map (SAM) with parallel processing |
title_sort | cljam: a library for handling dna sequence alignment/map (sam) with parallel processing |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4987990/ https://www.ncbi.nlm.nih.gov/pubmed/27536334 http://dx.doi.org/10.1186/s13029-016-0058-6 |
work_keys_str_mv | AT takeuchitoshiki cljamalibraryforhandlingdnasequencealignmentmapsamwithparallelprocessing AT yamadaatsuo cljamalibraryforhandlingdnasequencealignmentmapsamwithparallelprocessing AT aokitakashi cljamalibraryforhandlingdnasequencealignmentmapsamwithparallelprocessing AT nishimurakunihiro cljamalibraryforhandlingdnasequencealignmentmapsamwithparallelprocessing |