Cargando…

The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population

Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and...

Descripción completa

Detalles Bibliográficos
Autores principales: Lack, Justin B., Cardeno, Charis M., Crepeau, Marc W., Taylor, William, Corbett-Detig, Russell B., Stevens, Kristian A., Langley, Charles H., Pool, John E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391556/
https://www.ncbi.nlm.nih.gov/pubmed/25631317
http://dx.doi.org/10.1534/genetics.115.174664
_version_ 1782365840392323072
author Lack, Justin B.
Cardeno, Charis M.
Crepeau, Marc W.
Taylor, William
Corbett-Detig, Russell B.
Stevens, Kristian A.
Langley, Charles H.
Pool, John E.
author_facet Lack, Justin B.
Cardeno, Charis M.
Crepeau, Marc W.
Taylor, William
Corbett-Detig, Russell B.
Stevens, Kristian A.
Langley, Charles H.
Pool, John E.
author_sort Lack, Justin B.
collection PubMed
description Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets.
format Online
Article
Text
id pubmed-4391556
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-43915562015-04-10 The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population Lack, Justin B. Cardeno, Charis M. Crepeau, Marc W. Taylor, William Corbett-Detig, Russell B. Stevens, Kristian A. Langley, Charles H. Pool, John E. Genetics Investigations Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets. Genetics Society of America 2015-04 2015-01-27 /pmc/articles/PMC4391556/ /pubmed/25631317 http://dx.doi.org/10.1534/genetics.115.174664 Text en Copyright © 2015 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Lack, Justin B.
Cardeno, Charis M.
Crepeau, Marc W.
Taylor, William
Corbett-Detig, Russell B.
Stevens, Kristian A.
Langley, Charles H.
Pool, John E.
The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population
title The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population
title_full The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population
title_fullStr The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population
title_full_unstemmed The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population
title_short The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population
title_sort drosophila genome nexus: a population genomic resource of 623 drosophila melanogaster genomes, including 197 from a single ancestral range population
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391556/
https://www.ncbi.nlm.nih.gov/pubmed/25631317
http://dx.doi.org/10.1534/genetics.115.174664
work_keys_str_mv AT lackjustinb thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT cardenocharism thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT crepeaumarcw thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT taylorwilliam thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT corbettdetigrussellb thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT stevenskristiana thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT langleycharlesh thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT pooljohne thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT lackjustinb drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT cardenocharism drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT crepeaumarcw drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT taylorwilliam drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT corbettdetigrussellb drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT stevenskristiana drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT langleycharlesh drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation
AT pooljohne drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation