Cargando…

Machine learning algorithm improved automated droplet classification of ddPCR for detection of BRAF V600E in paraffin-embedded samples

Somatic mutations in cancer driver genes can help diagnosis, prognosis and treatment decisions. Formalin-fixed paraffin-embedded (FFPE) specimen is the main source of DNA for somatic mutation detection. To overcome constraints of DNA isolated from FFPE, we compared pyrosequencing and ddPCR analysis...

Descripción completa

Detalles Bibliográficos
Autores principales: Colozza-Gama, Gabriel A., Callegari, Fabiano, Bešič, Nikola, Paniza, Ana Carolina de J., Cerutti, Janete M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8209227/
https://www.ncbi.nlm.nih.gov/pubmed/34135377
http://dx.doi.org/10.1038/s41598-021-92014-4
_version_ 1783709082585137152
author Colozza-Gama, Gabriel A.
Callegari, Fabiano
Bešič, Nikola
Paniza, Ana Carolina de J.
Cerutti, Janete M.
author_facet Colozza-Gama, Gabriel A.
Callegari, Fabiano
Bešič, Nikola
Paniza, Ana Carolina de J.
Cerutti, Janete M.
author_sort Colozza-Gama, Gabriel A.
collection PubMed
description Somatic mutations in cancer driver genes can help diagnosis, prognosis and treatment decisions. Formalin-fixed paraffin-embedded (FFPE) specimen is the main source of DNA for somatic mutation detection. To overcome constraints of DNA isolated from FFPE, we compared pyrosequencing and ddPCR analysis for absolute quantification of BRAF V600E mutation in the DNA extracted from FFPE specimens and compared the results to the qualitative detection information obtained by Sanger Sequencing. Sanger sequencing was able to detect BRAF V600E mutation only when it was present in more than 15% total alleles. Although the sensitivity of ddPCR is higher than that observed for Sanger, it was less consistent than pyrosequencing, likely due to droplet classification bias of FFPE-derived DNA. To address the droplet allocation bias in ddPCR analysis, we have compared different algorithms for automated droplet classification and next correlated these findings with those obtained from pyrosequencing. By examining the addition of non-classifiable droplets (rain) in ddPCR, it was possible to obtain better qualitative classification of droplets and better quantitative classification compared to no rain droplets, when considering pyrosequencing results. Notable, only the Machine learning k-NN algorithm was able to automatically classify the samples, surpassing manual classification based on no-template controls, which shows promise in clinical practice.
format Online
Article
Text
id pubmed-8209227
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-82092272021-06-17 Machine learning algorithm improved automated droplet classification of ddPCR for detection of BRAF V600E in paraffin-embedded samples Colozza-Gama, Gabriel A. Callegari, Fabiano Bešič, Nikola Paniza, Ana Carolina de J. Cerutti, Janete M. Sci Rep Article Somatic mutations in cancer driver genes can help diagnosis, prognosis and treatment decisions. Formalin-fixed paraffin-embedded (FFPE) specimen is the main source of DNA for somatic mutation detection. To overcome constraints of DNA isolated from FFPE, we compared pyrosequencing and ddPCR analysis for absolute quantification of BRAF V600E mutation in the DNA extracted from FFPE specimens and compared the results to the qualitative detection information obtained by Sanger Sequencing. Sanger sequencing was able to detect BRAF V600E mutation only when it was present in more than 15% total alleles. Although the sensitivity of ddPCR is higher than that observed for Sanger, it was less consistent than pyrosequencing, likely due to droplet classification bias of FFPE-derived DNA. To address the droplet allocation bias in ddPCR analysis, we have compared different algorithms for automated droplet classification and next correlated these findings with those obtained from pyrosequencing. By examining the addition of non-classifiable droplets (rain) in ddPCR, it was possible to obtain better qualitative classification of droplets and better quantitative classification compared to no rain droplets, when considering pyrosequencing results. Notable, only the Machine learning k-NN algorithm was able to automatically classify the samples, surpassing manual classification based on no-template controls, which shows promise in clinical practice. Nature Publishing Group UK 2021-06-16 /pmc/articles/PMC8209227/ /pubmed/34135377 http://dx.doi.org/10.1038/s41598-021-92014-4 Text en © The Author(s) 2021, corrected publication 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Colozza-Gama, Gabriel A.
Callegari, Fabiano
Bešič, Nikola
Paniza, Ana Carolina de J.
Cerutti, Janete M.
Machine learning algorithm improved automated droplet classification of ddPCR for detection of BRAF V600E in paraffin-embedded samples
title Machine learning algorithm improved automated droplet classification of ddPCR for detection of BRAF V600E in paraffin-embedded samples
title_full Machine learning algorithm improved automated droplet classification of ddPCR for detection of BRAF V600E in paraffin-embedded samples
title_fullStr Machine learning algorithm improved automated droplet classification of ddPCR for detection of BRAF V600E in paraffin-embedded samples
title_full_unstemmed Machine learning algorithm improved automated droplet classification of ddPCR for detection of BRAF V600E in paraffin-embedded samples
title_short Machine learning algorithm improved automated droplet classification of ddPCR for detection of BRAF V600E in paraffin-embedded samples
title_sort machine learning algorithm improved automated droplet classification of ddpcr for detection of braf v600e in paraffin-embedded samples
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8209227/
https://www.ncbi.nlm.nih.gov/pubmed/34135377
http://dx.doi.org/10.1038/s41598-021-92014-4
work_keys_str_mv AT colozzagamagabriela machinelearningalgorithmimprovedautomateddropletclassificationofddpcrfordetectionofbrafv600einparaffinembeddedsamples
AT callegarifabiano machinelearningalgorithmimprovedautomateddropletclassificationofddpcrfordetectionofbrafv600einparaffinembeddedsamples
AT besicnikola machinelearningalgorithmimprovedautomateddropletclassificationofddpcrfordetectionofbrafv600einparaffinembeddedsamples
AT panizaanacarolinadej machinelearningalgorithmimprovedautomateddropletclassificationofddpcrfordetectionofbrafv600einparaffinembeddedsamples
AT ceruttijanetem machinelearningalgorithmimprovedautomateddropletclassificationofddpcrfordetectionofbrafv600einparaffinembeddedsamples