Cargando…

Whole genome sequencing of peach (Prunus persica L.) for SNP identification and selection

BACKGROUND: The application of next generation sequencing technologies and bioinformatic scripts to identify high frequency SNPs distributed throughout the peach genome is described. Three peach genomes were sequenced using Roche 454 and Illumina/Solexa technologies to obtain long contigs for alignm...

Descripción completa

Detalles Bibliográficos
Autores principales: Ahmad, Riaz, Parfitt, Dan E, Fass, Joseph, Ogundiwin, Ebenezer, Dhingra, Amit, Gradziel, Thomas M, Lin, Dawei, Joshi, Nikhil A, Martinez-Garcia, Pedro J, Crisosto, Carlos H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3253712/
https://www.ncbi.nlm.nih.gov/pubmed/22108025
http://dx.doi.org/10.1186/1471-2164-12-569
_version_ 1782220765428449280
author Ahmad, Riaz
Parfitt, Dan E
Fass, Joseph
Ogundiwin, Ebenezer
Dhingra, Amit
Gradziel, Thomas M
Lin, Dawei
Joshi, Nikhil A
Martinez-Garcia, Pedro J
Crisosto, Carlos H
author_facet Ahmad, Riaz
Parfitt, Dan E
Fass, Joseph
Ogundiwin, Ebenezer
Dhingra, Amit
Gradziel, Thomas M
Lin, Dawei
Joshi, Nikhil A
Martinez-Garcia, Pedro J
Crisosto, Carlos H
author_sort Ahmad, Riaz
collection PubMed
description BACKGROUND: The application of next generation sequencing technologies and bioinformatic scripts to identify high frequency SNPs distributed throughout the peach genome is described. Three peach genomes were sequenced using Roche 454 and Illumina/Solexa technologies to obtain long contigs for alignment to the draft 'Lovell' peach sequence as well as sufficient depth of coverage for 'in silico' SNP discovery. DESCRIPTION: The sequences were aligned to the 'Lovell' peach genome released April 01, 2010 by the International Peach Genome Initiative (IPGI). 'Dr. Davis', 'F8, 1-42' and 'Georgia Belle' were sequenced to add SNPs segregating in two breeding populations, Pop DF ('Dr. Davis' × 'F8, 1-42') and Pop DG ('Dr. Davis' × 'Georgia Belle'). Roche 454 sequencing produced 980,000 total reads with 236 Mb sequence for 'Dr. Davis' and 735,000 total reads with 172 Mb sequence for 'F8, 1-42'. 84 bp × 84 bp paired end Illumina/Solexa sequences yielded 25.5, 21.4, 25.5 million sequences for 'Dr. Davis', 'F8, 1-42' and 'Georgia Belle', respectively. BWA/SAMtools were used for alignment of raw reads and SNP detection, with custom PERL scripts for SNP filtering. Velvet's Columbus module was used for sequence assembly. Comparison of aligned and overlapping sequences from both Roche 454 and Illumina/Solexa resulted in the selection of 6654 high quality SNPs for 'Dr. Davis' vs. 'F8, 1-42' and 'Georgia Belle', distributed on eight major peach genome scaffolds as defined from the 'Lovell' assembly. CONCLUSION: The eight scaffolds contained about 215-225 Mb of peach genomic sequences with one SNP/~ 40,000 bases. All sequences from Roche 454 and Illumina/Solexa have been submitted to NCBI for public use in the Short Read Archive database. SNPs have been deposited in the NCBI SNP database.
format Online
Article
Text
id pubmed-3253712
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32537122012-01-10 Whole genome sequencing of peach (Prunus persica L.) for SNP identification and selection Ahmad, Riaz Parfitt, Dan E Fass, Joseph Ogundiwin, Ebenezer Dhingra, Amit Gradziel, Thomas M Lin, Dawei Joshi, Nikhil A Martinez-Garcia, Pedro J Crisosto, Carlos H BMC Genomics Database BACKGROUND: The application of next generation sequencing technologies and bioinformatic scripts to identify high frequency SNPs distributed throughout the peach genome is described. Three peach genomes were sequenced using Roche 454 and Illumina/Solexa technologies to obtain long contigs for alignment to the draft 'Lovell' peach sequence as well as sufficient depth of coverage for 'in silico' SNP discovery. DESCRIPTION: The sequences were aligned to the 'Lovell' peach genome released April 01, 2010 by the International Peach Genome Initiative (IPGI). 'Dr. Davis', 'F8, 1-42' and 'Georgia Belle' were sequenced to add SNPs segregating in two breeding populations, Pop DF ('Dr. Davis' × 'F8, 1-42') and Pop DG ('Dr. Davis' × 'Georgia Belle'). Roche 454 sequencing produced 980,000 total reads with 236 Mb sequence for 'Dr. Davis' and 735,000 total reads with 172 Mb sequence for 'F8, 1-42'. 84 bp × 84 bp paired end Illumina/Solexa sequences yielded 25.5, 21.4, 25.5 million sequences for 'Dr. Davis', 'F8, 1-42' and 'Georgia Belle', respectively. BWA/SAMtools were used for alignment of raw reads and SNP detection, with custom PERL scripts for SNP filtering. Velvet's Columbus module was used for sequence assembly. Comparison of aligned and overlapping sequences from both Roche 454 and Illumina/Solexa resulted in the selection of 6654 high quality SNPs for 'Dr. Davis' vs. 'F8, 1-42' and 'Georgia Belle', distributed on eight major peach genome scaffolds as defined from the 'Lovell' assembly. CONCLUSION: The eight scaffolds contained about 215-225 Mb of peach genomic sequences with one SNP/~ 40,000 bases. All sequences from Roche 454 and Illumina/Solexa have been submitted to NCBI for public use in the Short Read Archive database. SNPs have been deposited in the NCBI SNP database. BioMed Central 2011-11-22 /pmc/articles/PMC3253712/ /pubmed/22108025 http://dx.doi.org/10.1186/1471-2164-12-569 Text en Copyright ©2011 Ahmad et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database
Ahmad, Riaz
Parfitt, Dan E
Fass, Joseph
Ogundiwin, Ebenezer
Dhingra, Amit
Gradziel, Thomas M
Lin, Dawei
Joshi, Nikhil A
Martinez-Garcia, Pedro J
Crisosto, Carlos H
Whole genome sequencing of peach (Prunus persica L.) for SNP identification and selection
title Whole genome sequencing of peach (Prunus persica L.) for SNP identification and selection
title_full Whole genome sequencing of peach (Prunus persica L.) for SNP identification and selection
title_fullStr Whole genome sequencing of peach (Prunus persica L.) for SNP identification and selection
title_full_unstemmed Whole genome sequencing of peach (Prunus persica L.) for SNP identification and selection
title_short Whole genome sequencing of peach (Prunus persica L.) for SNP identification and selection
title_sort whole genome sequencing of peach (prunus persica l.) for snp identification and selection
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3253712/
https://www.ncbi.nlm.nih.gov/pubmed/22108025
http://dx.doi.org/10.1186/1471-2164-12-569
work_keys_str_mv AT ahmadriaz wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection
AT parfittdane wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection
AT fassjoseph wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection
AT ogundiwinebenezer wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection
AT dhingraamit wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection
AT gradzielthomasm wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection
AT lindawei wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection
AT joshinikhila wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection
AT martinezgarciapedroj wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection
AT crisostocarlosh wholegenomesequencingofpeachprunuspersicalforsnpidentificationandselection