Cargando…

deML: robust demultiplexing of Illumina sequences using a likelihood-based approach

Motivation: Pooling multiple samples increases the efficiency and lowers the cost of DNA sequencing. One approach to multiplexing is to use short DNA indices to uniquely identify each sample. After sequencing, reads must be assigned in silico to the sample of origin, a process referred to as demulti...

Descripción completa

Detalles Bibliográficos
Autores principales: Renaud, Gabriel, Stenzel, Udo, Maricic, Tomislav, Wiebe, Victor, Kelso, Janet
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4341068/
https://www.ncbi.nlm.nih.gov/pubmed/25359895
http://dx.doi.org/10.1093/bioinformatics/btu719
_version_ 1782359112030355456
author Renaud, Gabriel
Stenzel, Udo
Maricic, Tomislav
Wiebe, Victor
Kelso, Janet
author_facet Renaud, Gabriel
Stenzel, Udo
Maricic, Tomislav
Wiebe, Victor
Kelso, Janet
author_sort Renaud, Gabriel
collection PubMed
description Motivation: Pooling multiple samples increases the efficiency and lowers the cost of DNA sequencing. One approach to multiplexing is to use short DNA indices to uniquely identify each sample. After sequencing, reads must be assigned in silico to the sample of origin, a process referred to as demultiplexing. Demultiplexing software typically identifies the sample of origin using a fixed number of mismatches between the read index and a reference index set. This approach may fail or misassign reads when the sequencing quality of the indices is poor. Results: We introduce deML, a maximum likelihood algorithm that demultiplexes Illumina sequences. deML computes the likelihood of an observed index sequence being derived from a specified sample. A quality score which reflects the probability of the assignment being correct is generated for each read. Using these quality scores, even very problematic datasets can be demultiplexed and an error threshold can be set. Availability and implementation: deML is freely available for use under the GPL (http://bioinf.eva.mpg.de/deml/). Contact: gabriel.reno@gmail.com or kelso@eva.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-4341068
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-43410682015-03-10 deML: robust demultiplexing of Illumina sequences using a likelihood-based approach Renaud, Gabriel Stenzel, Udo Maricic, Tomislav Wiebe, Victor Kelso, Janet Bioinformatics Applications Notes Motivation: Pooling multiple samples increases the efficiency and lowers the cost of DNA sequencing. One approach to multiplexing is to use short DNA indices to uniquely identify each sample. After sequencing, reads must be assigned in silico to the sample of origin, a process referred to as demultiplexing. Demultiplexing software typically identifies the sample of origin using a fixed number of mismatches between the read index and a reference index set. This approach may fail or misassign reads when the sequencing quality of the indices is poor. Results: We introduce deML, a maximum likelihood algorithm that demultiplexes Illumina sequences. deML computes the likelihood of an observed index sequence being derived from a specified sample. A quality score which reflects the probability of the assignment being correct is generated for each read. Using these quality scores, even very problematic datasets can be demultiplexed and an error threshold can be set. Availability and implementation: deML is freely available for use under the GPL (http://bioinf.eva.mpg.de/deml/). Contact: gabriel.reno@gmail.com or kelso@eva.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2015-03-01 2014-10-30 /pmc/articles/PMC4341068/ /pubmed/25359895 http://dx.doi.org/10.1093/bioinformatics/btu719 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Renaud, Gabriel
Stenzel, Udo
Maricic, Tomislav
Wiebe, Victor
Kelso, Janet
deML: robust demultiplexing of Illumina sequences using a likelihood-based approach
title deML: robust demultiplexing of Illumina sequences using a likelihood-based approach
title_full deML: robust demultiplexing of Illumina sequences using a likelihood-based approach
title_fullStr deML: robust demultiplexing of Illumina sequences using a likelihood-based approach
title_full_unstemmed deML: robust demultiplexing of Illumina sequences using a likelihood-based approach
title_short deML: robust demultiplexing of Illumina sequences using a likelihood-based approach
title_sort deml: robust demultiplexing of illumina sequences using a likelihood-based approach
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4341068/
https://www.ncbi.nlm.nih.gov/pubmed/25359895
http://dx.doi.org/10.1093/bioinformatics/btu719
work_keys_str_mv AT renaudgabriel demlrobustdemultiplexingofilluminasequencesusingalikelihoodbasedapproach
AT stenzeludo demlrobustdemultiplexingofilluminasequencesusingalikelihoodbasedapproach
AT maricictomislav demlrobustdemultiplexingofilluminasequencesusingalikelihoodbasedapproach
AT wiebevictor demlrobustdemultiplexingofilluminasequencesusingalikelihoodbasedapproach
AT kelsojanet demlrobustdemultiplexingofilluminasequencesusingalikelihoodbasedapproach