Cargando…

A pipeline for high throughput detection and mapping of SNPs from EST databases

Single nucleotide polymorphisms (SNPs) represent the most abundant type of genetic variation that can be used as molecular markers. The SNPs that are hidden in sequence databases can be unlocked using bioinformatic tools. For efficient application of these SNPs, the sequence set should be error-free...

Descripción completa

Detalles Bibliográficos
Autores principales: Anithakumari, A. M., Tang, Jifeng, van Eck, Herman J., Visser, Richard G. F., Leunissen, Jack A. M., Vosman, Ben, van der Linden, C. Gerard
Formato: Texto
Lenguaje:English
Publicado: Springer Netherlands 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2869401/
https://www.ncbi.nlm.nih.gov/pubmed/20502512
http://dx.doi.org/10.1007/s11032-009-9377-5
_version_ 1782181126181224448
author Anithakumari, A. M.
Tang, Jifeng
van Eck, Herman J.
Visser, Richard G. F.
Leunissen, Jack A. M.
Vosman, Ben
van der Linden, C. Gerard
author_facet Anithakumari, A. M.
Tang, Jifeng
van Eck, Herman J.
Visser, Richard G. F.
Leunissen, Jack A. M.
Vosman, Ben
van der Linden, C. Gerard
author_sort Anithakumari, A. M.
collection PubMed
description Single nucleotide polymorphisms (SNPs) represent the most abundant type of genetic variation that can be used as molecular markers. The SNPs that are hidden in sequence databases can be unlocked using bioinformatic tools. For efficient application of these SNPs, the sequence set should be error-free as much as possible, targeting single loci and suitable for the SNP scoring platform of choice. We have developed a pipeline to effectively mine SNPs from public EST databases with or without quality information using QualitySNP software, select reliable SNP and prepare the loci for analysis on the Illumina GoldenGate genotyping platform. The applicability of the pipeline was demonstrated using publicly available potato EST data, genotyping individuals from two diploid mapping populations and subsequently mapping the SNP markers (putative genes) in both populations. Over 7000 reliable SNPs were identified that met the criteria for genotyping on the GoldenGate platform. Of the 384 SNPs on the SNP array approximately 12% dropped out. For the two potato mapping populations 165 and 185 SNPs segregating SNP loci could be mapped on the respective genetic maps, illustrating the effectiveness of our pipeline for SNP selection and validation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11032-009-9377-5) contains supplementary material, which is available to authorized users.
format Text
id pubmed-2869401
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-28694012010-05-24 A pipeline for high throughput detection and mapping of SNPs from EST databases Anithakumari, A. M. Tang, Jifeng van Eck, Herman J. Visser, Richard G. F. Leunissen, Jack A. M. Vosman, Ben van der Linden, C. Gerard Mol Breed Article Single nucleotide polymorphisms (SNPs) represent the most abundant type of genetic variation that can be used as molecular markers. The SNPs that are hidden in sequence databases can be unlocked using bioinformatic tools. For efficient application of these SNPs, the sequence set should be error-free as much as possible, targeting single loci and suitable for the SNP scoring platform of choice. We have developed a pipeline to effectively mine SNPs from public EST databases with or without quality information using QualitySNP software, select reliable SNP and prepare the loci for analysis on the Illumina GoldenGate genotyping platform. The applicability of the pipeline was demonstrated using publicly available potato EST data, genotyping individuals from two diploid mapping populations and subsequently mapping the SNP markers (putative genes) in both populations. Over 7000 reliable SNPs were identified that met the criteria for genotyping on the GoldenGate platform. Of the 384 SNPs on the SNP array approximately 12% dropped out. For the two potato mapping populations 165 and 185 SNPs segregating SNP loci could be mapped on the respective genetic maps, illustrating the effectiveness of our pipeline for SNP selection and validation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s11032-009-9377-5) contains supplementary material, which is available to authorized users. Springer Netherlands 2010-01-20 2010 /pmc/articles/PMC2869401/ /pubmed/20502512 http://dx.doi.org/10.1007/s11032-009-9377-5 Text en © The Author(s) 2010 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle Article
Anithakumari, A. M.
Tang, Jifeng
van Eck, Herman J.
Visser, Richard G. F.
Leunissen, Jack A. M.
Vosman, Ben
van der Linden, C. Gerard
A pipeline for high throughput detection and mapping of SNPs from EST databases
title A pipeline for high throughput detection and mapping of SNPs from EST databases
title_full A pipeline for high throughput detection and mapping of SNPs from EST databases
title_fullStr A pipeline for high throughput detection and mapping of SNPs from EST databases
title_full_unstemmed A pipeline for high throughput detection and mapping of SNPs from EST databases
title_short A pipeline for high throughput detection and mapping of SNPs from EST databases
title_sort pipeline for high throughput detection and mapping of snps from est databases
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2869401/
https://www.ncbi.nlm.nih.gov/pubmed/20502512
http://dx.doi.org/10.1007/s11032-009-9377-5
work_keys_str_mv AT anithakumariam apipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT tangjifeng apipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT vaneckhermanj apipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT visserrichardgf apipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT leunissenjackam apipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT vosmanben apipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT vanderlindencgerard apipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT anithakumariam pipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT tangjifeng pipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT vaneckhermanj pipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT visserrichardgf pipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT leunissenjackam pipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT vosmanben pipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases
AT vanderlindencgerard pipelineforhighthroughputdetectionandmappingofsnpsfromestdatabases