Cargando…

Decoding of Superimposed Traces Produced by Direct Sequencing of Heterozygous Indels

Direct Sanger sequencing of a diploid template containing a heterozygous insertion or deletion results in a difficult-to-interpret mixed trace formed by two allelic traces superimposed onto each other. Existing computational methods for deconvolution of such traces require knowledge of a reference s...

Descripción completa

Detalles Bibliográficos
Autores principales: Dmitriev, Dmitry A., Rakitov, Roman A.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2429969/
https://www.ncbi.nlm.nih.gov/pubmed/18654614
http://dx.doi.org/10.1371/journal.pcbi.1000113
_version_ 1782156350284890112
author Dmitriev, Dmitry A.
Rakitov, Roman A.
author_facet Dmitriev, Dmitry A.
Rakitov, Roman A.
author_sort Dmitriev, Dmitry A.
collection PubMed
description Direct Sanger sequencing of a diploid template containing a heterozygous insertion or deletion results in a difficult-to-interpret mixed trace formed by two allelic traces superimposed onto each other. Existing computational methods for deconvolution of such traces require knowledge of a reference sequence or the availability of both direct and reverse mixed sequences of the same template. We describe a simple yet accurate method, which uses dynamic programming optimization to predict superimposed allelic sequences solely from a string of letters representing peaks within an individual mixed trace. We used the method to decode 104 human traces (mean length 294 bp) containing heterozygous indels 5 to 30 bp with a mean of 99.1% bases per allelic sequence reconstructed correctly and unambiguously. Simulations with artificial sequences have demonstrated that the method yields accurate reconstructions when (1) the allelic sequences forming the mixed trace are sufficiently similar, (2) the analyzed fragment is significantly longer than the indel, and (3) multiple indels, if present, are well-spaced. Because these conditions occur in most encountered DNA sequences, the method is widely applicable. It is available as a free Web application Indelligent at http://ctap.inhs.uiuc.edu/dmitriev/indel.asp.
format Text
id pubmed-2429969
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-24299692008-07-25 Decoding of Superimposed Traces Produced by Direct Sequencing of Heterozygous Indels Dmitriev, Dmitry A. Rakitov, Roman A. PLoS Comput Biol Research Article Direct Sanger sequencing of a diploid template containing a heterozygous insertion or deletion results in a difficult-to-interpret mixed trace formed by two allelic traces superimposed onto each other. Existing computational methods for deconvolution of such traces require knowledge of a reference sequence or the availability of both direct and reverse mixed sequences of the same template. We describe a simple yet accurate method, which uses dynamic programming optimization to predict superimposed allelic sequences solely from a string of letters representing peaks within an individual mixed trace. We used the method to decode 104 human traces (mean length 294 bp) containing heterozygous indels 5 to 30 bp with a mean of 99.1% bases per allelic sequence reconstructed correctly and unambiguously. Simulations with artificial sequences have demonstrated that the method yields accurate reconstructions when (1) the allelic sequences forming the mixed trace are sufficiently similar, (2) the analyzed fragment is significantly longer than the indel, and (3) multiple indels, if present, are well-spaced. Because these conditions occur in most encountered DNA sequences, the method is widely applicable. It is available as a free Web application Indelligent at http://ctap.inhs.uiuc.edu/dmitriev/indel.asp. Public Library of Science 2008-07-25 /pmc/articles/PMC2429969/ /pubmed/18654614 http://dx.doi.org/10.1371/journal.pcbi.1000113 Text en Dmitriev, Rakitov. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Dmitriev, Dmitry A.
Rakitov, Roman A.
Decoding of Superimposed Traces Produced by Direct Sequencing of Heterozygous Indels
title Decoding of Superimposed Traces Produced by Direct Sequencing of Heterozygous Indels
title_full Decoding of Superimposed Traces Produced by Direct Sequencing of Heterozygous Indels
title_fullStr Decoding of Superimposed Traces Produced by Direct Sequencing of Heterozygous Indels
title_full_unstemmed Decoding of Superimposed Traces Produced by Direct Sequencing of Heterozygous Indels
title_short Decoding of Superimposed Traces Produced by Direct Sequencing of Heterozygous Indels
title_sort decoding of superimposed traces produced by direct sequencing of heterozygous indels
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2429969/
https://www.ncbi.nlm.nih.gov/pubmed/18654614
http://dx.doi.org/10.1371/journal.pcbi.1000113
work_keys_str_mv AT dmitrievdmitrya decodingofsuperimposedtracesproducedbydirectsequencingofheterozygousindels
AT rakitovromana decodingofsuperimposedtracesproducedbydirectsequencingofheterozygousindels