Cargando…

Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster

Short-read sequencing techniques provide the opportunity to capture genome-wide sequence data in a single experiment. A current challenge is to identify questions that shallow-depth genomic data can address successfully and to develop corresponding analytical methods that are statistically sound. He...

Descripción completa

Detalles Bibliográficos
Autores principales: Sackton, Timothy B., Kulathinal, Rob J., Bergman, Casey M., Quinlan, Aaron R., Dopman, Erik B., Carneiro, Mauricio, Marth, Gabor T., Hartl, Daniel L., Clark, Andrew G.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2839279/
https://www.ncbi.nlm.nih.gov/pubmed/20333214
http://dx.doi.org/10.1093/gbe/evp048
_version_ 1782178936024727552
author Sackton, Timothy B.
Kulathinal, Rob J.
Bergman, Casey M.
Quinlan, Aaron R.
Dopman, Erik B.
Carneiro, Mauricio
Marth, Gabor T.
Hartl, Daniel L.
Clark, Andrew G.
author_facet Sackton, Timothy B.
Kulathinal, Rob J.
Bergman, Casey M.
Quinlan, Aaron R.
Dopman, Erik B.
Carneiro, Mauricio
Marth, Gabor T.
Hartl, Daniel L.
Clark, Andrew G.
author_sort Sackton, Timothy B.
collection PubMed
description Short-read sequencing techniques provide the opportunity to capture genome-wide sequence data in a single experiment. A current challenge is to identify questions that shallow-depth genomic data can address successfully and to develop corresponding analytical methods that are statistically sound. Here, we apply the Roche/454 platform to survey natural variation in strains of Drosophila melanogaster from an African (n = 3) and a North American (n = 6) population. Reads were aligned to the reference D. melanogaster genomic assembly, single nucleotide polymorphisms were identified, and nucleotide variation was quantified genome wide. Simulations and empirical results suggest that nucleotide diversity can be accurately estimated from sparse data with as little as 0.2× coverage per line. The unbiased genomic sampling provided by random short-read sequencing also allows insight into distributions of transposable elements and copy number polymorphisms found within populations and demonstrates that short-read sequencing methods provide an efficient means to quantify variation in genome organization and content. Continued development of methods for statistical inference of shallow-depth genome-wide sequencing data will allow such sparse, partial data sets to become the norm in the emerging field of population genomics.
format Text
id pubmed-2839279
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-28392792010-03-22 Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster Sackton, Timothy B. Kulathinal, Rob J. Bergman, Casey M. Quinlan, Aaron R. Dopman, Erik B. Carneiro, Mauricio Marth, Gabor T. Hartl, Daniel L. Clark, Andrew G. Genome Biol Evol Research Articles Short-read sequencing techniques provide the opportunity to capture genome-wide sequence data in a single experiment. A current challenge is to identify questions that shallow-depth genomic data can address successfully and to develop corresponding analytical methods that are statistically sound. Here, we apply the Roche/454 platform to survey natural variation in strains of Drosophila melanogaster from an African (n = 3) and a North American (n = 6) population. Reads were aligned to the reference D. melanogaster genomic assembly, single nucleotide polymorphisms were identified, and nucleotide variation was quantified genome wide. Simulations and empirical results suggest that nucleotide diversity can be accurately estimated from sparse data with as little as 0.2× coverage per line. The unbiased genomic sampling provided by random short-read sequencing also allows insight into distributions of transposable elements and copy number polymorphisms found within populations and demonstrates that short-read sequencing methods provide an efficient means to quantify variation in genome organization and content. Continued development of methods for statistical inference of shallow-depth genome-wide sequencing data will allow such sparse, partial data sets to become the norm in the emerging field of population genomics. Oxford University Press 2009 2009-11-18 /pmc/articles/PMC2839279/ /pubmed/20333214 http://dx.doi.org/10.1093/gbe/evp048 Text en © The Author(s) 2009. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Sackton, Timothy B.
Kulathinal, Rob J.
Bergman, Casey M.
Quinlan, Aaron R.
Dopman, Erik B.
Carneiro, Mauricio
Marth, Gabor T.
Hartl, Daniel L.
Clark, Andrew G.
Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster
title Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster
title_full Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster
title_fullStr Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster
title_full_unstemmed Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster
title_short Population Genomic Inferences from Sparse High-Throughput Sequencing of Two Populations of Drosophila melanogaster
title_sort population genomic inferences from sparse high-throughput sequencing of two populations of drosophila melanogaster
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2839279/
https://www.ncbi.nlm.nih.gov/pubmed/20333214
http://dx.doi.org/10.1093/gbe/evp048
work_keys_str_mv AT sacktontimothyb populationgenomicinferencesfromsparsehighthroughputsequencingoftwopopulationsofdrosophilamelanogaster
AT kulathinalrobj populationgenomicinferencesfromsparsehighthroughputsequencingoftwopopulationsofdrosophilamelanogaster
AT bergmancaseym populationgenomicinferencesfromsparsehighthroughputsequencingoftwopopulationsofdrosophilamelanogaster
AT quinlanaaronr populationgenomicinferencesfromsparsehighthroughputsequencingoftwopopulationsofdrosophilamelanogaster
AT dopmanerikb populationgenomicinferencesfromsparsehighthroughputsequencingoftwopopulationsofdrosophilamelanogaster
AT carneiromauricio populationgenomicinferencesfromsparsehighthroughputsequencingoftwopopulationsofdrosophilamelanogaster
AT marthgabort populationgenomicinferencesfromsparsehighthroughputsequencingoftwopopulationsofdrosophilamelanogaster
AT hartldaniell populationgenomicinferencesfromsparsehighthroughputsequencingoftwopopulationsofdrosophilamelanogaster
AT clarkandrewg populationgenomicinferencesfromsparsehighthroughputsequencingoftwopopulationsofdrosophilamelanogaster