Cargando…
Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms
BACKGROUND: A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3527177/ https://www.ncbi.nlm.nih.gov/pubmed/22849334 http://dx.doi.org/10.1186/1471-2164-13-354 |
_version_ | 1782253667864281088 |
---|---|
author | You, Frank M Deal, Karin R Wang, Jirui Britton, Monica T Fass, Joseph N Lin, Dawei Dandekar, Abhaya M Leslie, Charles A Aradhya, Mallikarjuna Luo, Ming-Cheng Dvorak, Jan |
author_facet | You, Frank M Deal, Karin R Wang, Jirui Britton, Monica T Fass, Joseph N Lin, Dawei Dandekar, Abhaya M Leslie, Charles A Aradhya, Mallikarjuna Luo, Ming-Cheng Dvorak, Jan |
author_sort | You, Frank M |
collection | PubMed |
description | BACKGROUND: A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a pipeline (AGSNP) for genome-wide SNP discovery in coding sequences and other single-copy DNA without a complete genome sequence in self-pollinating (autogamous) plants. Here we updated this pipeline for SNP discovery in outcrossing (allogamous) species and demonstrated its efficacy in SNP discovery in walnut (Juglans regia L.). RESULTS: The first step in the original implementation of the AGSNP pipeline was the construction of a reference sequence and the identification of single-copy sequences in it. To identify single-copy sequences, multiple genome equivalents of short SOLiD reads of another individual were mapped to shallow genome coverage of long Sanger or Roche 454 reads making up the reference sequence. The relative depth of SOLiD reads was used to filter out repeated sequences from single-copy sequences in the reference sequence. The second step was a search for SNPs between SOLiD reads and the reference sequence. Polymorphism within the mapped SOLiD reads would have precluded SNP discovery; hence both individuals had to be homozygous. The AGSNP pipeline was updated here for using SOLiD or other type of short reads of a heterozygous individual for these two principal steps. A total of 32.6X walnut genome equivalents of SOLiD reads of vegetatively propagated walnut scion cultivar ‘Chandler’ were mapped to 48,661 ‘Chandler’ bacterial artificial chromosome (BAC) end sequences (BESs) produced by Sanger sequencing during the construction of a walnut physical map. A total of 22,799 putative SNPs were initially identified. A total of 6,000 Infinium II type SNPs evenly distributed along the walnut physical map were selected for the construction of an Infinium BeadChip, which was used to genotype a walnut mapping population having ‘Chandler’ as one of the parents. Genotyping results were used to adjust the filtering parameters of the updated AGSNP pipeline. With the adjusted filtering criteria, 69.6% of SNPs discovered with the updated pipeline were real and could be mapped on the walnut genetic map. A total of 13,439 SNPs were discovered by BES re-sequencing. BESs harboring SNPs were in 677 FPC contigs covering 98% of the physical map of the walnut genome. CONCLUSION: The updated AGSNP pipeline is a versatile SNP discovery tool for a high-throughput, genome-wide SNP discovery in both autogamous and allogamous species. With this pipeline, a large set of SNPs were identified in a single walnut cultivar. |
format | Online Article Text |
id | pubmed-3527177 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35271772013-01-03 Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms You, Frank M Deal, Karin R Wang, Jirui Britton, Monica T Fass, Joseph N Lin, Dawei Dandekar, Abhaya M Leslie, Charles A Aradhya, Mallikarjuna Luo, Ming-Cheng Dvorak, Jan BMC Genomics Research Article BACKGROUND: A genome-wide set of single nucleotide polymorphisms (SNPs) is a valuable resource in genetic research and breeding and is usually developed by re-sequencing a genome. If a genome sequence is not available, an alternative strategy must be used. We previously reported the development of a pipeline (AGSNP) for genome-wide SNP discovery in coding sequences and other single-copy DNA without a complete genome sequence in self-pollinating (autogamous) plants. Here we updated this pipeline for SNP discovery in outcrossing (allogamous) species and demonstrated its efficacy in SNP discovery in walnut (Juglans regia L.). RESULTS: The first step in the original implementation of the AGSNP pipeline was the construction of a reference sequence and the identification of single-copy sequences in it. To identify single-copy sequences, multiple genome equivalents of short SOLiD reads of another individual were mapped to shallow genome coverage of long Sanger or Roche 454 reads making up the reference sequence. The relative depth of SOLiD reads was used to filter out repeated sequences from single-copy sequences in the reference sequence. The second step was a search for SNPs between SOLiD reads and the reference sequence. Polymorphism within the mapped SOLiD reads would have precluded SNP discovery; hence both individuals had to be homozygous. The AGSNP pipeline was updated here for using SOLiD or other type of short reads of a heterozygous individual for these two principal steps. A total of 32.6X walnut genome equivalents of SOLiD reads of vegetatively propagated walnut scion cultivar ‘Chandler’ were mapped to 48,661 ‘Chandler’ bacterial artificial chromosome (BAC) end sequences (BESs) produced by Sanger sequencing during the construction of a walnut physical map. A total of 22,799 putative SNPs were initially identified. A total of 6,000 Infinium II type SNPs evenly distributed along the walnut physical map were selected for the construction of an Infinium BeadChip, which was used to genotype a walnut mapping population having ‘Chandler’ as one of the parents. Genotyping results were used to adjust the filtering parameters of the updated AGSNP pipeline. With the adjusted filtering criteria, 69.6% of SNPs discovered with the updated pipeline were real and could be mapped on the walnut genetic map. A total of 13,439 SNPs were discovered by BES re-sequencing. BESs harboring SNPs were in 677 FPC contigs covering 98% of the physical map of the walnut genome. CONCLUSION: The updated AGSNP pipeline is a versatile SNP discovery tool for a high-throughput, genome-wide SNP discovery in both autogamous and allogamous species. With this pipeline, a large set of SNPs were identified in a single walnut cultivar. BioMed Central 2012-07-31 /pmc/articles/PMC3527177/ /pubmed/22849334 http://dx.doi.org/10.1186/1471-2164-13-354 Text en Copyright ©2012 You et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article You, Frank M Deal, Karin R Wang, Jirui Britton, Monica T Fass, Joseph N Lin, Dawei Dandekar, Abhaya M Leslie, Charles A Aradhya, Mallikarjuna Luo, Ming-Cheng Dvorak, Jan Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms |
title | Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms |
title_full | Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms |
title_fullStr | Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms |
title_full_unstemmed | Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms |
title_short | Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms |
title_sort | genome-wide snp discovery in walnut with an agsnp pipeline updated for snp discovery in allogamous organisms |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3527177/ https://www.ncbi.nlm.nih.gov/pubmed/22849334 http://dx.doi.org/10.1186/1471-2164-13-354 |
work_keys_str_mv | AT youfrankm genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT dealkarinr genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT wangjirui genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT brittonmonicat genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT fassjosephn genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT lindawei genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT dandekarabhayam genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT lesliecharlesa genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT aradhyamallikarjuna genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT luomingcheng genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms AT dvorakjan genomewidesnpdiscoveryinwalnutwithanagsnppipelineupdatedforsnpdiscoveryinallogamousorganisms |