Cargando…
The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population
Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391556/ https://www.ncbi.nlm.nih.gov/pubmed/25631317 http://dx.doi.org/10.1534/genetics.115.174664 |
_version_ | 1782365840392323072 |
---|---|
author | Lack, Justin B. Cardeno, Charis M. Crepeau, Marc W. Taylor, William Corbett-Detig, Russell B. Stevens, Kristian A. Langley, Charles H. Pool, John E. |
author_facet | Lack, Justin B. Cardeno, Charis M. Crepeau, Marc W. Taylor, William Corbett-Detig, Russell B. Stevens, Kristian A. Langley, Charles H. Pool, John E. |
author_sort | Lack, Justin B. |
collection | PubMed |
description | Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets. |
format | Online Article Text |
id | pubmed-4391556 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-43915562015-04-10 The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population Lack, Justin B. Cardeno, Charis M. Crepeau, Marc W. Taylor, William Corbett-Detig, Russell B. Stevens, Kristian A. Langley, Charles H. Pool, John E. Genetics Investigations Hundreds of wild-derived Drosophila melanogaster genomes have been published, but rigorous comparisons across data sets are precluded by differences in alignment methodology. The most common approach to reference-based genome assembly is a single round of alignment followed by quality filtering and variant detection. We evaluated variations and extensions of this approach and settled on an assembly strategy that utilizes two alignment programs and incorporates both substitutions and short indels to construct an updated reference for a second round of mapping prior to final variant detection. Utilizing this approach, we reassembled published D. melanogaster population genomic data sets and added unpublished genomes from several sub-Saharan populations. Most notably, we present aligned data from phase 3 of the Drosophila Population Genomics Project (DPGP3), which provides 197 genomes from a single ancestral range population of D. melanogaster (from Zambia). The large sample size, high genetic diversity, and potentially simpler demographic history of the DPGP3 sample will make this a highly valuable resource for fundamental population genetic research. The complete set of assemblies described here, termed the Drosophila Genome Nexus, presently comprises 623 consistently aligned genomes and is publicly available in multiple formats with supporting documentation and bioinformatic tools. This resource will greatly facilitate population genomic analysis in this model species by reducing the methodological differences between data sets. Genetics Society of America 2015-04 2015-01-27 /pmc/articles/PMC4391556/ /pubmed/25631317 http://dx.doi.org/10.1534/genetics.115.174664 Text en Copyright © 2015 by the Genetics Society of America Available freely online through the author-supported open access option. |
spellingShingle | Investigations Lack, Justin B. Cardeno, Charis M. Crepeau, Marc W. Taylor, William Corbett-Detig, Russell B. Stevens, Kristian A. Langley, Charles H. Pool, John E. The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population |
title | The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population |
title_full | The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population |
title_fullStr | The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population |
title_full_unstemmed | The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population |
title_short | The Drosophila Genome Nexus: A Population Genomic Resource of 623 Drosophila melanogaster Genomes, Including 197 from a Single Ancestral Range Population |
title_sort | drosophila genome nexus: a population genomic resource of 623 drosophila melanogaster genomes, including 197 from a single ancestral range population |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4391556/ https://www.ncbi.nlm.nih.gov/pubmed/25631317 http://dx.doi.org/10.1534/genetics.115.174664 |
work_keys_str_mv | AT lackjustinb thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT cardenocharism thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT crepeaumarcw thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT taylorwilliam thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT corbettdetigrussellb thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT stevenskristiana thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT langleycharlesh thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT pooljohne thedrosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT lackjustinb drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT cardenocharism drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT crepeaumarcw drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT taylorwilliam drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT corbettdetigrussellb drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT stevenskristiana drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT langleycharlesh drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation AT pooljohne drosophilagenomenexusapopulationgenomicresourceof623drosophilamelanogastergenomesincluding197fromasingleancestralrangepopulation |