Cargando…

Binpairs: Utilization of Illumina Paired-End Information for Improving Efficiency of Taxonomic Binning of Metagenomic Sequences

MOTIVATION: Paired-end sequencing protocols, offered by next generation sequencing (NGS) platforms like Illumia, generate a pair of reads for every DNA fragment in a sample. Although this protocol has been utilized for several metagenomics studies, most taxonomic binning approaches classify each of...

Descripción completa

Detalles Bibliográficos
Autores principales: Dutta, Anirban, Tandon, Disha, MH, Mohammed, Bose, Tungadri, Mande, Sharmila S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4281075/
https://www.ncbi.nlm.nih.gov/pubmed/25551450
http://dx.doi.org/10.1371/journal.pone.0114814
_version_ 1782350934667427840
author Dutta, Anirban
Tandon, Disha
MH, Mohammed
Bose, Tungadri
Mande, Sharmila S.
author_facet Dutta, Anirban
Tandon, Disha
MH, Mohammed
Bose, Tungadri
Mande, Sharmila S.
author_sort Dutta, Anirban
collection PubMed
description MOTIVATION: Paired-end sequencing protocols, offered by next generation sequencing (NGS) platforms like Illumia, generate a pair of reads for every DNA fragment in a sample. Although this protocol has been utilized for several metagenomics studies, most taxonomic binning approaches classify each of the reads (forming a pair), independently. The present work explores some simple but effective strategies of utilizing pairing-information of Illumina short reads for improving the accuracy of taxonomic binning of metagenomic datasets. The strategies proposed can be used in conjunction with all genres of existing binning methods. RESULTS: Validation results suggest that employment of these “Binpairs” strategies can provide significant improvements in the binning outcome. The quality of the taxonomic assignments thus obtained are often comparable to those that can only be achieved with relatively longer reads obtained using other NGS platforms (such as Roche). AVAILABILITY: An implementation of the proposed strategies of utilizing pairing information is freely available for academic users at https://metagenomics.atc.tcs.com/binning/binpairs.
format Online
Article
Text
id pubmed-4281075
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42810752015-01-07 Binpairs: Utilization of Illumina Paired-End Information for Improving Efficiency of Taxonomic Binning of Metagenomic Sequences Dutta, Anirban Tandon, Disha MH, Mohammed Bose, Tungadri Mande, Sharmila S. PLoS One Research Article MOTIVATION: Paired-end sequencing protocols, offered by next generation sequencing (NGS) platforms like Illumia, generate a pair of reads for every DNA fragment in a sample. Although this protocol has been utilized for several metagenomics studies, most taxonomic binning approaches classify each of the reads (forming a pair), independently. The present work explores some simple but effective strategies of utilizing pairing-information of Illumina short reads for improving the accuracy of taxonomic binning of metagenomic datasets. The strategies proposed can be used in conjunction with all genres of existing binning methods. RESULTS: Validation results suggest that employment of these “Binpairs” strategies can provide significant improvements in the binning outcome. The quality of the taxonomic assignments thus obtained are often comparable to those that can only be achieved with relatively longer reads obtained using other NGS platforms (such as Roche). AVAILABILITY: An implementation of the proposed strategies of utilizing pairing information is freely available for academic users at https://metagenomics.atc.tcs.com/binning/binpairs. Public Library of Science 2014-12-31 /pmc/articles/PMC4281075/ /pubmed/25551450 http://dx.doi.org/10.1371/journal.pone.0114814 Text en © 2014 Dutta et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Dutta, Anirban
Tandon, Disha
MH, Mohammed
Bose, Tungadri
Mande, Sharmila S.
Binpairs: Utilization of Illumina Paired-End Information for Improving Efficiency of Taxonomic Binning of Metagenomic Sequences
title Binpairs: Utilization of Illumina Paired-End Information for Improving Efficiency of Taxonomic Binning of Metagenomic Sequences
title_full Binpairs: Utilization of Illumina Paired-End Information for Improving Efficiency of Taxonomic Binning of Metagenomic Sequences
title_fullStr Binpairs: Utilization of Illumina Paired-End Information for Improving Efficiency of Taxonomic Binning of Metagenomic Sequences
title_full_unstemmed Binpairs: Utilization of Illumina Paired-End Information for Improving Efficiency of Taxonomic Binning of Metagenomic Sequences
title_short Binpairs: Utilization of Illumina Paired-End Information for Improving Efficiency of Taxonomic Binning of Metagenomic Sequences
title_sort binpairs: utilization of illumina paired-end information for improving efficiency of taxonomic binning of metagenomic sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4281075/
https://www.ncbi.nlm.nih.gov/pubmed/25551450
http://dx.doi.org/10.1371/journal.pone.0114814
work_keys_str_mv AT duttaanirban binpairsutilizationofilluminapairedendinformationforimprovingefficiencyoftaxonomicbinningofmetagenomicsequences
AT tandondisha binpairsutilizationofilluminapairedendinformationforimprovingefficiencyoftaxonomicbinningofmetagenomicsequences
AT mhmohammed binpairsutilizationofilluminapairedendinformationforimprovingefficiencyoftaxonomicbinningofmetagenomicsequences
AT bosetungadri binpairsutilizationofilluminapairedendinformationforimprovingefficiencyoftaxonomicbinningofmetagenomicsequences
AT mandesharmilas binpairsutilizationofilluminapairedendinformationforimprovingefficiencyoftaxonomicbinningofmetagenomicsequences