Cargando…

Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves

INTRODUCTION: High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available...

Descripción completa

Detalles Bibliográficos
Autores principales: Kanderian, Sami, Jiang, Lingxia, Knight, Ivor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4659556/
https://www.ncbi.nlm.nih.gov/pubmed/26605797
http://dx.doi.org/10.1371/journal.pone.0143295
_version_ 1782402639425699840
author Kanderian, Sami
Jiang, Lingxia
Knight, Ivor
author_facet Kanderian, Sami
Jiang, Lingxia
Knight, Ivor
author_sort Kanderian, Sami
collection PubMed
description INTRODUCTION: High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available to aid in the distinction of different genotypes, but they are not fully automated, used only for research purposes, or require some level of interaction or confirmation from an analyst. MATERIALS AND METHODS: We describe a fully automated machine learning software algorithm that classifies unknown genotypes. Dynamic melt curves are transformed to multidimensional clusters of points whereby a training set is used to establish the distribution of genotype clusters. Subsequently, probabilistic and statistical methods were used to classify the genotypes of unknown DNA samples on 4 different assays (40 VKORC1, CYP2C9*2, CYP2C9*3 samples in triplicate, and 49 MTHFR c.665C>T samples in triplicate) run on the Roche LC480. Melt curves of each of the triplicates were genotyped separately. RESULTS: Automated genotyping called 100% of VKORC1, CYP2C9*3 and MTHFR c.665C>T samples correctly. 97.5% of CYP2C9*2 melt curves were genotyped correctly with the remaining 2.5% given a no call due to the inability to decipher 3 melt curves in close proximity as either homozygous mutant or wild-type with greater than 99.5% posterior probability. CONCLUSIONS: We demonstrate the ability to fully automate DNA genotyping from HRM curves systematically and accurately without requiring any user interpretation or interaction with the data. Visualization of genotype clusters and quantification of the expected misclassification rate is also available to provide feedback to assay scientists and engineers as changes are made to the assay or instrument.
format Online
Article
Text
id pubmed-4659556
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46595562015-12-02 Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves Kanderian, Sami Jiang, Lingxia Knight, Ivor PLoS One Research Article INTRODUCTION: High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available to aid in the distinction of different genotypes, but they are not fully automated, used only for research purposes, or require some level of interaction or confirmation from an analyst. MATERIALS AND METHODS: We describe a fully automated machine learning software algorithm that classifies unknown genotypes. Dynamic melt curves are transformed to multidimensional clusters of points whereby a training set is used to establish the distribution of genotype clusters. Subsequently, probabilistic and statistical methods were used to classify the genotypes of unknown DNA samples on 4 different assays (40 VKORC1, CYP2C9*2, CYP2C9*3 samples in triplicate, and 49 MTHFR c.665C>T samples in triplicate) run on the Roche LC480. Melt curves of each of the triplicates were genotyped separately. RESULTS: Automated genotyping called 100% of VKORC1, CYP2C9*3 and MTHFR c.665C>T samples correctly. 97.5% of CYP2C9*2 melt curves were genotyped correctly with the remaining 2.5% given a no call due to the inability to decipher 3 melt curves in close proximity as either homozygous mutant or wild-type with greater than 99.5% posterior probability. CONCLUSIONS: We demonstrate the ability to fully automate DNA genotyping from HRM curves systematically and accurately without requiring any user interpretation or interaction with the data. Visualization of genotype clusters and quantification of the expected misclassification rate is also available to provide feedback to assay scientists and engineers as changes are made to the assay or instrument. Public Library of Science 2015-11-25 /pmc/articles/PMC4659556/ /pubmed/26605797 http://dx.doi.org/10.1371/journal.pone.0143295 Text en © 2015 Kanderian et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Kanderian, Sami
Jiang, Lingxia
Knight, Ivor
Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves
title Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves
title_full Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves
title_fullStr Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves
title_full_unstemmed Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves
title_short Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves
title_sort automated classification and cluster visualization of genotypes derived from high resolution melt curves
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4659556/
https://www.ncbi.nlm.nih.gov/pubmed/26605797
http://dx.doi.org/10.1371/journal.pone.0143295
work_keys_str_mv AT kanderiansami automatedclassificationandclustervisualizationofgenotypesderivedfromhighresolutionmeltcurves
AT jianglingxia automatedclassificationandclustervisualizationofgenotypesderivedfromhighresolutionmeltcurves
AT knightivor automatedclassificationandclustervisualizationofgenotypesderivedfromhighresolutionmeltcurves