Cargando…

Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing

Next-generation DNA sequencing has revolutionized the study of biology. However, the short read lengths of the dominant instruments complicate assembly of complex genomes and haplotype phasing of mixtures of similar sequences. Here we demonstrate a method to reconstruct the sequences of individual n...

Descripción completa

Detalles Bibliográficos
Autores principales: Stapleton, James A., Kim, Jeongwoon, Hamilton, John P., Wu, Ming, Irber, Luiz C., Maddamsetti, Rohan, Briney, Bryan, Newton, Linsey, Burton, Dennis R., Brown, C. Titus, Chan, Christina, Buell, C. Robin, Whitehead, Timothy A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4720449/
https://www.ncbi.nlm.nih.gov/pubmed/26789840
http://dx.doi.org/10.1371/journal.pone.0147229
_version_ 1782411085837500416
author Stapleton, James A.
Kim, Jeongwoon
Hamilton, John P.
Wu, Ming
Irber, Luiz C.
Maddamsetti, Rohan
Briney, Bryan
Newton, Linsey
Burton, Dennis R.
Brown, C. Titus
Chan, Christina
Buell, C. Robin
Whitehead, Timothy A.
author_facet Stapleton, James A.
Kim, Jeongwoon
Hamilton, John P.
Wu, Ming
Irber, Luiz C.
Maddamsetti, Rohan
Briney, Bryan
Newton, Linsey
Burton, Dennis R.
Brown, C. Titus
Chan, Christina
Buell, C. Robin
Whitehead, Timothy A.
author_sort Stapleton, James A.
collection PubMed
description Next-generation DNA sequencing has revolutionized the study of biology. However, the short read lengths of the dominant instruments complicate assembly of complex genomes and haplotype phasing of mixtures of similar sequences. Here we demonstrate a method to reconstruct the sequences of individual nucleic acid molecules up to 11.6 kilobases in length from short (150-bp) reads. We show that our method can construct 99.97%-accurate synthetic reads from bacterial, plant, and animal genomic samples, full-length mRNA sequences from human cancer cell lines, and individual HIV env gene variants from a mixture. The preparation of multiple samples can be multiplexed into a single tube, further reducing effort and cost relative to competing approaches. Our approach generates sequencing libraries in three days from less than one microgram of DNA in a single-tube format without custom equipment or specialized expertise.
format Online
Article
Text
id pubmed-4720449
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-47204492016-01-30 Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing Stapleton, James A. Kim, Jeongwoon Hamilton, John P. Wu, Ming Irber, Luiz C. Maddamsetti, Rohan Briney, Bryan Newton, Linsey Burton, Dennis R. Brown, C. Titus Chan, Christina Buell, C. Robin Whitehead, Timothy A. PLoS One Research Article Next-generation DNA sequencing has revolutionized the study of biology. However, the short read lengths of the dominant instruments complicate assembly of complex genomes and haplotype phasing of mixtures of similar sequences. Here we demonstrate a method to reconstruct the sequences of individual nucleic acid molecules up to 11.6 kilobases in length from short (150-bp) reads. We show that our method can construct 99.97%-accurate synthetic reads from bacterial, plant, and animal genomic samples, full-length mRNA sequences from human cancer cell lines, and individual HIV env gene variants from a mixture. The preparation of multiple samples can be multiplexed into a single tube, further reducing effort and cost relative to competing approaches. Our approach generates sequencing libraries in three days from less than one microgram of DNA in a single-tube format without custom equipment or specialized expertise. Public Library of Science 2016-01-20 /pmc/articles/PMC4720449/ /pubmed/26789840 http://dx.doi.org/10.1371/journal.pone.0147229 Text en © 2016 Stapleton et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Stapleton, James A.
Kim, Jeongwoon
Hamilton, John P.
Wu, Ming
Irber, Luiz C.
Maddamsetti, Rohan
Briney, Bryan
Newton, Linsey
Burton, Dennis R.
Brown, C. Titus
Chan, Christina
Buell, C. Robin
Whitehead, Timothy A.
Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing
title Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing
title_full Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing
title_fullStr Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing
title_full_unstemmed Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing
title_short Haplotype-Phased Synthetic Long Reads from Short-Read Sequencing
title_sort haplotype-phased synthetic long reads from short-read sequencing
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4720449/
https://www.ncbi.nlm.nih.gov/pubmed/26789840
http://dx.doi.org/10.1371/journal.pone.0147229
work_keys_str_mv AT stapletonjamesa haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT kimjeongwoon haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT hamiltonjohnp haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT wuming haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT irberluizc haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT maddamsettirohan haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT brineybryan haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT newtonlinsey haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT burtondennisr haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT brownctitus haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT chanchristina haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT buellcrobin haplotypephasedsyntheticlongreadsfromshortreadsequencing
AT whiteheadtimothya haplotypephasedsyntheticlongreadsfromshortreadsequencing