Cargando…

A New Binning Method for Metagenomics by One-Dimensional Cellular Automata

More and more developed and inexpensive next-generation sequencing (NGS) technologies allow us to extract vast sequence data from a sample containing multiple species. Characterizing the taxonomic diversity for the planet-size data plays an important role in the metagenomic studies, while a crucial...

Descripción completa

Detalles Bibliográficos
Autor principal: Lin, Ying-Chih
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4628670/
https://www.ncbi.nlm.nih.gov/pubmed/26557648
http://dx.doi.org/10.1155/2015/197895
_version_ 1782398467159621632
author Lin, Ying-Chih
author_facet Lin, Ying-Chih
author_sort Lin, Ying-Chih
collection PubMed
description More and more developed and inexpensive next-generation sequencing (NGS) technologies allow us to extract vast sequence data from a sample containing multiple species. Characterizing the taxonomic diversity for the planet-size data plays an important role in the metagenomic studies, while a crucial step for doing the study is the binning process to group sequence reads from similar species or taxonomic classes. The metagenomic binning remains a challenge work because of not only the various read noises but also the tremendous data volume. In this work, we propose an unsupervised binning method for NGS reads based on the one-dimensional cellular automaton (1D-CA). Our binning method facilities to reduce the memory usage because 1D-CA costs only linear space. Experiments on synthetic dataset exhibit that our method is helpful to identify species of lower abundance compared to the proposed tool.
format Online
Article
Text
id pubmed-4628670
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-46286702015-11-09 A New Binning Method for Metagenomics by One-Dimensional Cellular Automata Lin, Ying-Chih Int J Genomics Research Article More and more developed and inexpensive next-generation sequencing (NGS) technologies allow us to extract vast sequence data from a sample containing multiple species. Characterizing the taxonomic diversity for the planet-size data plays an important role in the metagenomic studies, while a crucial step for doing the study is the binning process to group sequence reads from similar species or taxonomic classes. The metagenomic binning remains a challenge work because of not only the various read noises but also the tremendous data volume. In this work, we propose an unsupervised binning method for NGS reads based on the one-dimensional cellular automaton (1D-CA). Our binning method facilities to reduce the memory usage because 1D-CA costs only linear space. Experiments on synthetic dataset exhibit that our method is helpful to identify species of lower abundance compared to the proposed tool. Hindawi Publishing Corporation 2015 2015-10-18 /pmc/articles/PMC4628670/ /pubmed/26557648 http://dx.doi.org/10.1155/2015/197895 Text en Copyright © 2015 Ying-Chih Lin. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lin, Ying-Chih
A New Binning Method for Metagenomics by One-Dimensional Cellular Automata
title A New Binning Method for Metagenomics by One-Dimensional Cellular Automata
title_full A New Binning Method for Metagenomics by One-Dimensional Cellular Automata
title_fullStr A New Binning Method for Metagenomics by One-Dimensional Cellular Automata
title_full_unstemmed A New Binning Method for Metagenomics by One-Dimensional Cellular Automata
title_short A New Binning Method for Metagenomics by One-Dimensional Cellular Automata
title_sort new binning method for metagenomics by one-dimensional cellular automata
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4628670/
https://www.ncbi.nlm.nih.gov/pubmed/26557648
http://dx.doi.org/10.1155/2015/197895
work_keys_str_mv AT linyingchih anewbinningmethodformetagenomicsbyonedimensionalcellularautomata
AT linyingchih newbinningmethodformetagenomicsbyonedimensionalcellularautomata