Cargando…

Summarizing Specific Profiles in Illumina Sequencing from Whole-Genome Amplified DNA

Advances in both high-throughput sequencing and whole-genome amplification (WGA) protocols have allowed genomes to be sequenced from femtograms of DNA, for example from individual cells or from precious clinical and archived samples. Using the highly curated Caenorhabditis elegans genome as a refere...

Descripción completa

Detalles Bibliográficos
Autores principales: Tsai, Isheng J., Hunt, Martin, Holroyd, Nancy, Huckvale, Thomas, Berriman, Matthew, Kikuchi, Taisei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4060946/
https://www.ncbi.nlm.nih.gov/pubmed/24353264
http://dx.doi.org/10.1093/dnares/dst054
_version_ 1782321421128564736
author Tsai, Isheng J.
Hunt, Martin
Holroyd, Nancy
Huckvale, Thomas
Berriman, Matthew
Kikuchi, Taisei
author_facet Tsai, Isheng J.
Hunt, Martin
Holroyd, Nancy
Huckvale, Thomas
Berriman, Matthew
Kikuchi, Taisei
author_sort Tsai, Isheng J.
collection PubMed
description Advances in both high-throughput sequencing and whole-genome amplification (WGA) protocols have allowed genomes to be sequenced from femtograms of DNA, for example from individual cells or from precious clinical and archived samples. Using the highly curated Caenorhabditis elegans genome as a reference, we have sequenced and identified errors and biases associated with Illumina library construction, library insert size, different WGA methods and genome features such as GC bias and simple repeat content. Detailed analysis of the reads from amplified libraries revealed characteristics suggesting that majority of amplified fragment ends are identical but inverted versions of each other. Read coverage in amplified libraries is correlated with both tandem and inverted repeat content, while GC content only influences sequencing in long-insert libraries. Nevertheless, single nucleotide polymorphism (SNP) calls and assembly metrics from reads in amplified libraries show comparable results with unamplified libraries. To utilize the full potential of WGA to reveal the real biological interest, this article highlights the importance of recognizing additional sources of errors from amplified sequence reads and discusses the potential implications in downstream analyses.
format Online
Article
Text
id pubmed-4060946
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-40609462014-06-18 Summarizing Specific Profiles in Illumina Sequencing from Whole-Genome Amplified DNA Tsai, Isheng J. Hunt, Martin Holroyd, Nancy Huckvale, Thomas Berriman, Matthew Kikuchi, Taisei DNA Res Full Papers Advances in both high-throughput sequencing and whole-genome amplification (WGA) protocols have allowed genomes to be sequenced from femtograms of DNA, for example from individual cells or from precious clinical and archived samples. Using the highly curated Caenorhabditis elegans genome as a reference, we have sequenced and identified errors and biases associated with Illumina library construction, library insert size, different WGA methods and genome features such as GC bias and simple repeat content. Detailed analysis of the reads from amplified libraries revealed characteristics suggesting that majority of amplified fragment ends are identical but inverted versions of each other. Read coverage in amplified libraries is correlated with both tandem and inverted repeat content, while GC content only influences sequencing in long-insert libraries. Nevertheless, single nucleotide polymorphism (SNP) calls and assembly metrics from reads in amplified libraries show comparable results with unamplified libraries. To utilize the full potential of WGA to reveal the real biological interest, this article highlights the importance of recognizing additional sources of errors from amplified sequence reads and discusses the potential implications in downstream analyses. Oxford University Press 2014-06 2013-12-18 /pmc/articles/PMC4060946/ /pubmed/24353264 http://dx.doi.org/10.1093/dnares/dst054 Text en © The Author 2013. Published by Oxford University Press on behalf of Kazusa DNA Research Institute. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Full Papers
Tsai, Isheng J.
Hunt, Martin
Holroyd, Nancy
Huckvale, Thomas
Berriman, Matthew
Kikuchi, Taisei
Summarizing Specific Profiles in Illumina Sequencing from Whole-Genome Amplified DNA
title Summarizing Specific Profiles in Illumina Sequencing from Whole-Genome Amplified DNA
title_full Summarizing Specific Profiles in Illumina Sequencing from Whole-Genome Amplified DNA
title_fullStr Summarizing Specific Profiles in Illumina Sequencing from Whole-Genome Amplified DNA
title_full_unstemmed Summarizing Specific Profiles in Illumina Sequencing from Whole-Genome Amplified DNA
title_short Summarizing Specific Profiles in Illumina Sequencing from Whole-Genome Amplified DNA
title_sort summarizing specific profiles in illumina sequencing from whole-genome amplified dna
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4060946/
https://www.ncbi.nlm.nih.gov/pubmed/24353264
http://dx.doi.org/10.1093/dnares/dst054
work_keys_str_mv AT tsaiishengj summarizingspecificprofilesinilluminasequencingfromwholegenomeamplifieddna
AT huntmartin summarizingspecificprofilesinilluminasequencingfromwholegenomeamplifieddna
AT holroydnancy summarizingspecificprofilesinilluminasequencingfromwholegenomeamplifieddna
AT huckvalethomas summarizingspecificprofilesinilluminasequencingfromwholegenomeamplifieddna
AT berrimanmatthew summarizingspecificprofilesinilluminasequencingfromwholegenomeamplifieddna
AT kikuchitaisei summarizingspecificprofilesinilluminasequencingfromwholegenomeamplifieddna