Cargando…
Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves
INTRODUCTION: High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4659556/ https://www.ncbi.nlm.nih.gov/pubmed/26605797 http://dx.doi.org/10.1371/journal.pone.0143295 |
_version_ | 1782402639425699840 |
---|---|
author | Kanderian, Sami Jiang, Lingxia Knight, Ivor |
author_facet | Kanderian, Sami Jiang, Lingxia Knight, Ivor |
author_sort | Kanderian, Sami |
collection | PubMed |
description | INTRODUCTION: High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available to aid in the distinction of different genotypes, but they are not fully automated, used only for research purposes, or require some level of interaction or confirmation from an analyst. MATERIALS AND METHODS: We describe a fully automated machine learning software algorithm that classifies unknown genotypes. Dynamic melt curves are transformed to multidimensional clusters of points whereby a training set is used to establish the distribution of genotype clusters. Subsequently, probabilistic and statistical methods were used to classify the genotypes of unknown DNA samples on 4 different assays (40 VKORC1, CYP2C9*2, CYP2C9*3 samples in triplicate, and 49 MTHFR c.665C>T samples in triplicate) run on the Roche LC480. Melt curves of each of the triplicates were genotyped separately. RESULTS: Automated genotyping called 100% of VKORC1, CYP2C9*3 and MTHFR c.665C>T samples correctly. 97.5% of CYP2C9*2 melt curves were genotyped correctly with the remaining 2.5% given a no call due to the inability to decipher 3 melt curves in close proximity as either homozygous mutant or wild-type with greater than 99.5% posterior probability. CONCLUSIONS: We demonstrate the ability to fully automate DNA genotyping from HRM curves systematically and accurately without requiring any user interpretation or interaction with the data. Visualization of genotype clusters and quantification of the expected misclassification rate is also available to provide feedback to assay scientists and engineers as changes are made to the assay or instrument. |
format | Online Article Text |
id | pubmed-4659556 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-46595562015-12-02 Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves Kanderian, Sami Jiang, Lingxia Knight, Ivor PLoS One Research Article INTRODUCTION: High Resolution Melting (HRM) following PCR has been used to identify DNA genotypes. Fluorescent dyes bounded to double strand DNA lose their fluorescence with increasing temperature, yielding different signatures for different genotypes. Recent software tools have been made available to aid in the distinction of different genotypes, but they are not fully automated, used only for research purposes, or require some level of interaction or confirmation from an analyst. MATERIALS AND METHODS: We describe a fully automated machine learning software algorithm that classifies unknown genotypes. Dynamic melt curves are transformed to multidimensional clusters of points whereby a training set is used to establish the distribution of genotype clusters. Subsequently, probabilistic and statistical methods were used to classify the genotypes of unknown DNA samples on 4 different assays (40 VKORC1, CYP2C9*2, CYP2C9*3 samples in triplicate, and 49 MTHFR c.665C>T samples in triplicate) run on the Roche LC480. Melt curves of each of the triplicates were genotyped separately. RESULTS: Automated genotyping called 100% of VKORC1, CYP2C9*3 and MTHFR c.665C>T samples correctly. 97.5% of CYP2C9*2 melt curves were genotyped correctly with the remaining 2.5% given a no call due to the inability to decipher 3 melt curves in close proximity as either homozygous mutant or wild-type with greater than 99.5% posterior probability. CONCLUSIONS: We demonstrate the ability to fully automate DNA genotyping from HRM curves systematically and accurately without requiring any user interpretation or interaction with the data. Visualization of genotype clusters and quantification of the expected misclassification rate is also available to provide feedback to assay scientists and engineers as changes are made to the assay or instrument. Public Library of Science 2015-11-25 /pmc/articles/PMC4659556/ /pubmed/26605797 http://dx.doi.org/10.1371/journal.pone.0143295 Text en © 2015 Kanderian et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Kanderian, Sami Jiang, Lingxia Knight, Ivor Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves |
title | Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves |
title_full | Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves |
title_fullStr | Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves |
title_full_unstemmed | Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves |
title_short | Automated Classification and Cluster Visualization of Genotypes Derived from High Resolution Melt Curves |
title_sort | automated classification and cluster visualization of genotypes derived from high resolution melt curves |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4659556/ https://www.ncbi.nlm.nih.gov/pubmed/26605797 http://dx.doi.org/10.1371/journal.pone.0143295 |
work_keys_str_mv | AT kanderiansami automatedclassificationandclustervisualizationofgenotypesderivedfromhighresolutionmeltcurves AT jianglingxia automatedclassificationandclustervisualizationofgenotypesderivedfromhighresolutionmeltcurves AT knightivor automatedclassificationandclustervisualizationofgenotypesderivedfromhighresolutionmeltcurves |