Cargando…
Seq4SNPs: new software for retrieval of multiple, accurately annotated DNA sequences, ready formatted for SNP assay design
BACKGROUND: In moderate-throughput SNP genotyping there was a gap in the workflow, between choosing a set of SNPs and submitting their sequences to proprietary assay design software, which was not met by existing software. Retrieval and formatting of sequences flanking each SNP, prior to assay desig...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2711078/ https://www.ncbi.nlm.nih.gov/pubmed/19523221 http://dx.doi.org/10.1186/1471-2105-10-180 |
_version_ | 1782169406425530368 |
---|---|
author | Field, Helen I Scollen, Serena A Luccarini, Craig Baynes, Caroline Morrison, Jonathan Dunning, Alison M Easton, Douglas F Pharoah, Paul DP |
author_facet | Field, Helen I Scollen, Serena A Luccarini, Craig Baynes, Caroline Morrison, Jonathan Dunning, Alison M Easton, Douglas F Pharoah, Paul DP |
author_sort | Field, Helen I |
collection | PubMed |
description | BACKGROUND: In moderate-throughput SNP genotyping there was a gap in the workflow, between choosing a set of SNPs and submitting their sequences to proprietary assay design software, which was not met by existing software. Retrieval and formatting of sequences flanking each SNP, prior to assay design, becomes rate-limiting for more than about ten SNPs, especially if annotated for repetitive regions and adjacent variations. We routinely process up to 50 SNPs at once. IMPLEMENTATION: We created Seq4SNPs, a web-based, walk-away software that can process one to several hundred SNPs given rs numbers as input. It outputs a file of fully annotated sequences formatted for one of three proprietary design softwares: TaqMan's Primer-By-Design FileBuilder, Sequenom's iPLEX or SNPstream's Autoprimer, as well as unannotated fasta sequences. We found genotyping assays to be inhibited by repetitive sequences or the presence of additional variations flanking the SNP under test, and in multiplexes, repetitive sequence flanking one SNP adversely affects multiple assays. Assay design software programs avoid such regions if the input sequences are appropriately annotated, so we used Seq4SNPs to provide suitably annotated input sequences, and improved our genotyping success rate. Adjacent SNPs can also be avoided, by annotating sequences used as input for primer design. CONCLUSION: The accuracy of annotation by Seq4SNPs is significantly better than manual annotation (P < 1e-5). Using Seq4SNPs to incorporate all annotation for additional SNPs and repetitive elements into sequences, for genotyping assay designer software, minimizes assay failure at the design stage, reducing the cost of genotyping. Seq4SNPs provides a rapid route for replacement of poor test SNP sequences. We routinely use this software for assay sequence preparation. Seq4SNPs is available as a service at and , currently for human SNPs, but easily extended to include any species in dbSNP. |
format | Text |
id | pubmed-2711078 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27110782009-07-16 Seq4SNPs: new software for retrieval of multiple, accurately annotated DNA sequences, ready formatted for SNP assay design Field, Helen I Scollen, Serena A Luccarini, Craig Baynes, Caroline Morrison, Jonathan Dunning, Alison M Easton, Douglas F Pharoah, Paul DP BMC Bioinformatics Software BACKGROUND: In moderate-throughput SNP genotyping there was a gap in the workflow, between choosing a set of SNPs and submitting their sequences to proprietary assay design software, which was not met by existing software. Retrieval and formatting of sequences flanking each SNP, prior to assay design, becomes rate-limiting for more than about ten SNPs, especially if annotated for repetitive regions and adjacent variations. We routinely process up to 50 SNPs at once. IMPLEMENTATION: We created Seq4SNPs, a web-based, walk-away software that can process one to several hundred SNPs given rs numbers as input. It outputs a file of fully annotated sequences formatted for one of three proprietary design softwares: TaqMan's Primer-By-Design FileBuilder, Sequenom's iPLEX or SNPstream's Autoprimer, as well as unannotated fasta sequences. We found genotyping assays to be inhibited by repetitive sequences or the presence of additional variations flanking the SNP under test, and in multiplexes, repetitive sequence flanking one SNP adversely affects multiple assays. Assay design software programs avoid such regions if the input sequences are appropriately annotated, so we used Seq4SNPs to provide suitably annotated input sequences, and improved our genotyping success rate. Adjacent SNPs can also be avoided, by annotating sequences used as input for primer design. CONCLUSION: The accuracy of annotation by Seq4SNPs is significantly better than manual annotation (P < 1e-5). Using Seq4SNPs to incorporate all annotation for additional SNPs and repetitive elements into sequences, for genotyping assay designer software, minimizes assay failure at the design stage, reducing the cost of genotyping. Seq4SNPs provides a rapid route for replacement of poor test SNP sequences. We routinely use this software for assay sequence preparation. Seq4SNPs is available as a service at and , currently for human SNPs, but easily extended to include any species in dbSNP. BioMed Central 2009-06-12 /pmc/articles/PMC2711078/ /pubmed/19523221 http://dx.doi.org/10.1186/1471-2105-10-180 Text en Copyright © 2009 Field et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Field, Helen I Scollen, Serena A Luccarini, Craig Baynes, Caroline Morrison, Jonathan Dunning, Alison M Easton, Douglas F Pharoah, Paul DP Seq4SNPs: new software for retrieval of multiple, accurately annotated DNA sequences, ready formatted for SNP assay design |
title | Seq4SNPs: new software for retrieval of multiple, accurately annotated DNA sequences, ready formatted for SNP assay design |
title_full | Seq4SNPs: new software for retrieval of multiple, accurately annotated DNA sequences, ready formatted for SNP assay design |
title_fullStr | Seq4SNPs: new software for retrieval of multiple, accurately annotated DNA sequences, ready formatted for SNP assay design |
title_full_unstemmed | Seq4SNPs: new software for retrieval of multiple, accurately annotated DNA sequences, ready formatted for SNP assay design |
title_short | Seq4SNPs: new software for retrieval of multiple, accurately annotated DNA sequences, ready formatted for SNP assay design |
title_sort | seq4snps: new software for retrieval of multiple, accurately annotated dna sequences, ready formatted for snp assay design |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2711078/ https://www.ncbi.nlm.nih.gov/pubmed/19523221 http://dx.doi.org/10.1186/1471-2105-10-180 |
work_keys_str_mv | AT fieldheleni seq4snpsnewsoftwareforretrievalofmultipleaccuratelyannotateddnasequencesreadyformattedforsnpassaydesign AT scollenserenaa seq4snpsnewsoftwareforretrievalofmultipleaccuratelyannotateddnasequencesreadyformattedforsnpassaydesign AT luccarinicraig seq4snpsnewsoftwareforretrievalofmultipleaccuratelyannotateddnasequencesreadyformattedforsnpassaydesign AT baynescaroline seq4snpsnewsoftwareforretrievalofmultipleaccuratelyannotateddnasequencesreadyformattedforsnpassaydesign AT morrisonjonathan seq4snpsnewsoftwareforretrievalofmultipleaccuratelyannotateddnasequencesreadyformattedforsnpassaydesign AT dunningalisonm seq4snpsnewsoftwareforretrievalofmultipleaccuratelyannotateddnasequencesreadyformattedforsnpassaydesign AT eastondouglasf seq4snpsnewsoftwareforretrievalofmultipleaccuratelyannotateddnasequencesreadyformattedforsnpassaydesign AT pharoahpauldp seq4snpsnewsoftwareforretrievalofmultipleaccuratelyannotateddnasequencesreadyformattedforsnpassaydesign |