Cargando…

Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation

BACKGROUND: High-throughput custom designed genotyping arrays are a valuable resource for biologically focused research studies and increasingly for validation of variation predicted by next-generation sequencing (NGS) technologies. We investigate the Illumina GoldenGate chemistry using custom desig...

Descripción completa

Detalles Bibliográficos
Autores principales: Tindall, Elizabeth A, Petersen, Desiree C, Nikolaysen, Stina, Miller, Webb, Schuster, Stephan C, Hayes, Vanessa M
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2848685/
https://www.ncbi.nlm.nih.gov/pubmed/20175893
http://dx.doi.org/10.1186/1756-0500-3-39
_version_ 1782179709703946240
author Tindall, Elizabeth A
Petersen, Desiree C
Nikolaysen, Stina
Miller, Webb
Schuster, Stephan C
Hayes, Vanessa M
author_facet Tindall, Elizabeth A
Petersen, Desiree C
Nikolaysen, Stina
Miller, Webb
Schuster, Stephan C
Hayes, Vanessa M
author_sort Tindall, Elizabeth A
collection PubMed
description BACKGROUND: High-throughput custom designed genotyping arrays are a valuable resource for biologically focused research studies and increasingly for validation of variation predicted by next-generation sequencing (NGS) technologies. We investigate the Illumina GoldenGate chemistry using custom designed VeraCode and sentrix array matrix (SAM) assays for each of these applications, respectively. We highlight applications for interpretation of Illumina generated genotype cluster plots to maximise data inclusion and reduce genotyping errors. FINDINGS: We illustrate the dramatic effect of outliers in genotype calling and data interpretation, as well as suggest simple means to avoid genotyping errors. Furthermore we present this platform as a successful method for two-cluster rare or non-autosomal variant calling. The success of high-throughput technologies to accurately call rare variants will become an essential feature for future association studies. Finally, we highlight additional advantages of the Illumina GoldenGate chemistry in generating unusually segregated cluster plots that identify potential NGS generated sequencing error resulting from minimal coverage. CONCLUSIONS: We demonstrate the importance of visually inspecting genotype cluster plots generated by the Illumina software and issue warnings regarding commonly accepted quality control parameters. In addition to suggesting applications to minimise data exclusion, we propose that the Illumina cluster plots may be helpful in identifying potential in-put sequence errors, particularly important for studies to validate NGS generated variation.
format Text
id pubmed-2848685
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28486852010-04-02 Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation Tindall, Elizabeth A Petersen, Desiree C Nikolaysen, Stina Miller, Webb Schuster, Stephan C Hayes, Vanessa M BMC Res Notes Short Report BACKGROUND: High-throughput custom designed genotyping arrays are a valuable resource for biologically focused research studies and increasingly for validation of variation predicted by next-generation sequencing (NGS) technologies. We investigate the Illumina GoldenGate chemistry using custom designed VeraCode and sentrix array matrix (SAM) assays for each of these applications, respectively. We highlight applications for interpretation of Illumina generated genotype cluster plots to maximise data inclusion and reduce genotyping errors. FINDINGS: We illustrate the dramatic effect of outliers in genotype calling and data interpretation, as well as suggest simple means to avoid genotyping errors. Furthermore we present this platform as a successful method for two-cluster rare or non-autosomal variant calling. The success of high-throughput technologies to accurately call rare variants will become an essential feature for future association studies. Finally, we highlight additional advantages of the Illumina GoldenGate chemistry in generating unusually segregated cluster plots that identify potential NGS generated sequencing error resulting from minimal coverage. CONCLUSIONS: We demonstrate the importance of visually inspecting genotype cluster plots generated by the Illumina software and issue warnings regarding commonly accepted quality control parameters. In addition to suggesting applications to minimise data exclusion, we propose that the Illumina cluster plots may be helpful in identifying potential in-put sequence errors, particularly important for studies to validate NGS generated variation. BioMed Central 2010-02-22 /pmc/articles/PMC2848685/ /pubmed/20175893 http://dx.doi.org/10.1186/1756-0500-3-39 Text en Copyright ©2010 Hayes et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Short Report
Tindall, Elizabeth A
Petersen, Desiree C
Nikolaysen, Stina
Miller, Webb
Schuster, Stephan C
Hayes, Vanessa M
Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation
title Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation
title_full Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation
title_fullStr Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation
title_full_unstemmed Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation
title_short Interpretation of custom designed Illumina genotype cluster plots for targeted association studies and next-generation sequence validation
title_sort interpretation of custom designed illumina genotype cluster plots for targeted association studies and next-generation sequence validation
topic Short Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2848685/
https://www.ncbi.nlm.nih.gov/pubmed/20175893
http://dx.doi.org/10.1186/1756-0500-3-39
work_keys_str_mv AT tindallelizabetha interpretationofcustomdesignedilluminagenotypeclusterplotsfortargetedassociationstudiesandnextgenerationsequencevalidation
AT petersendesireec interpretationofcustomdesignedilluminagenotypeclusterplotsfortargetedassociationstudiesandnextgenerationsequencevalidation
AT nikolaysenstina interpretationofcustomdesignedilluminagenotypeclusterplotsfortargetedassociationstudiesandnextgenerationsequencevalidation
AT millerwebb interpretationofcustomdesignedilluminagenotypeclusterplotsfortargetedassociationstudiesandnextgenerationsequencevalidation
AT schusterstephanc interpretationofcustomdesignedilluminagenotypeclusterplotsfortargetedassociationstudiesandnextgenerationsequencevalidation
AT hayesvanessam interpretationofcustomdesignedilluminagenotypeclusterplotsfortargetedassociationstudiesandnextgenerationsequencevalidation