Cargando…

Anomaly Identification during Polymerase Chain Reaction for Detecting SARS-CoV-2 Using Artificial Intelligence Trained from Simulated Data

Real-time reverse transcription (RT) PCR is the gold standard for detecting Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), owing to its sensitivity and specificity, thereby meeting the demand for the rising number of cases. The scarcity of trained molecular biologists for analyzing PC...

Descripción completa

Detalles Bibliográficos
Autores principales: Villarreal-González, Reynaldo, Acosta-Hoyos, Antonio J., Garzon-Ochoa, Jaime A., Galán-Freyle, Nataly J., Amar-Sepúlveda, Paola, Pacheco-Londoño, Leonardo C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7793083/
https://www.ncbi.nlm.nih.gov/pubmed/33374492
http://dx.doi.org/10.3390/molecules26010020
_version_ 1783633911993073664
author Villarreal-González, Reynaldo
Acosta-Hoyos, Antonio J.
Garzon-Ochoa, Jaime A.
Galán-Freyle, Nataly J.
Amar-Sepúlveda, Paola
Pacheco-Londoño, Leonardo C.
author_facet Villarreal-González, Reynaldo
Acosta-Hoyos, Antonio J.
Garzon-Ochoa, Jaime A.
Galán-Freyle, Nataly J.
Amar-Sepúlveda, Paola
Pacheco-Londoño, Leonardo C.
author_sort Villarreal-González, Reynaldo
collection PubMed
description Real-time reverse transcription (RT) PCR is the gold standard for detecting Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), owing to its sensitivity and specificity, thereby meeting the demand for the rising number of cases. The scarcity of trained molecular biologists for analyzing PCR results makes data verification a challenge. Artificial intelligence (AI) was designed to ease verification, by detecting atypical profiles in PCR curves caused by contamination or artifacts. Four classes of simulated real-time RT-PCR curves were generated, namely, positive, early, no, and abnormal amplifications. Machine learning (ML) models were generated and tested using small amounts of data from each class. The best model was used for classifying the big data obtained by the Virology Laboratory of Simon Bolivar University from real-time RT-PCR curves for SARS-CoV-2, and the model was retrained and implemented in a software that correlated patient data with test and AI diagnoses. The best strategy for AI included a binary classification model, which was generated from simulated data, where data analyzed by the first model were classified as either positive or negative and abnormal. To differentiate between negative and abnormal, the data were reevaluated using the second model. In the first model, the data required preanalysis through a combination of prepossessing. The early amplification class was eliminated from the models because the numbers of cases in big data was negligible. ML models can be created from simulated data using minimum available information. During analysis, changes or variations can be incorporated by generating simulated data, avoiding the incorporation of large amounts of experimental data encompassing all possible changes. For diagnosing SARS-CoV-2, this type of AI is critical for optimizing PCR tests because it enables rapid diagnosis and reduces false positives. Our method can also be used for other types of molecular analyses.
format Online
Article
Text
id pubmed-7793083
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77930832021-01-09 Anomaly Identification during Polymerase Chain Reaction for Detecting SARS-CoV-2 Using Artificial Intelligence Trained from Simulated Data Villarreal-González, Reynaldo Acosta-Hoyos, Antonio J. Garzon-Ochoa, Jaime A. Galán-Freyle, Nataly J. Amar-Sepúlveda, Paola Pacheco-Londoño, Leonardo C. Molecules Article Real-time reverse transcription (RT) PCR is the gold standard for detecting Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), owing to its sensitivity and specificity, thereby meeting the demand for the rising number of cases. The scarcity of trained molecular biologists for analyzing PCR results makes data verification a challenge. Artificial intelligence (AI) was designed to ease verification, by detecting atypical profiles in PCR curves caused by contamination or artifacts. Four classes of simulated real-time RT-PCR curves were generated, namely, positive, early, no, and abnormal amplifications. Machine learning (ML) models were generated and tested using small amounts of data from each class. The best model was used for classifying the big data obtained by the Virology Laboratory of Simon Bolivar University from real-time RT-PCR curves for SARS-CoV-2, and the model was retrained and implemented in a software that correlated patient data with test and AI diagnoses. The best strategy for AI included a binary classification model, which was generated from simulated data, where data analyzed by the first model were classified as either positive or negative and abnormal. To differentiate between negative and abnormal, the data were reevaluated using the second model. In the first model, the data required preanalysis through a combination of prepossessing. The early amplification class was eliminated from the models because the numbers of cases in big data was negligible. ML models can be created from simulated data using minimum available information. During analysis, changes or variations can be incorporated by generating simulated data, avoiding the incorporation of large amounts of experimental data encompassing all possible changes. For diagnosing SARS-CoV-2, this type of AI is critical for optimizing PCR tests because it enables rapid diagnosis and reduces false positives. Our method can also be used for other types of molecular analyses. MDPI 2020-12-23 /pmc/articles/PMC7793083/ /pubmed/33374492 http://dx.doi.org/10.3390/molecules26010020 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Villarreal-González, Reynaldo
Acosta-Hoyos, Antonio J.
Garzon-Ochoa, Jaime A.
Galán-Freyle, Nataly J.
Amar-Sepúlveda, Paola
Pacheco-Londoño, Leonardo C.
Anomaly Identification during Polymerase Chain Reaction for Detecting SARS-CoV-2 Using Artificial Intelligence Trained from Simulated Data
title Anomaly Identification during Polymerase Chain Reaction for Detecting SARS-CoV-2 Using Artificial Intelligence Trained from Simulated Data
title_full Anomaly Identification during Polymerase Chain Reaction for Detecting SARS-CoV-2 Using Artificial Intelligence Trained from Simulated Data
title_fullStr Anomaly Identification during Polymerase Chain Reaction for Detecting SARS-CoV-2 Using Artificial Intelligence Trained from Simulated Data
title_full_unstemmed Anomaly Identification during Polymerase Chain Reaction for Detecting SARS-CoV-2 Using Artificial Intelligence Trained from Simulated Data
title_short Anomaly Identification during Polymerase Chain Reaction for Detecting SARS-CoV-2 Using Artificial Intelligence Trained from Simulated Data
title_sort anomaly identification during polymerase chain reaction for detecting sars-cov-2 using artificial intelligence trained from simulated data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7793083/
https://www.ncbi.nlm.nih.gov/pubmed/33374492
http://dx.doi.org/10.3390/molecules26010020
work_keys_str_mv AT villarrealgonzalezreynaldo anomalyidentificationduringpolymerasechainreactionfordetectingsarscov2usingartificialintelligencetrainedfromsimulateddata
AT acostahoyosantonioj anomalyidentificationduringpolymerasechainreactionfordetectingsarscov2usingartificialintelligencetrainedfromsimulateddata
AT garzonochoajaimea anomalyidentificationduringpolymerasechainreactionfordetectingsarscov2usingartificialintelligencetrainedfromsimulateddata
AT galanfreylenatalyj anomalyidentificationduringpolymerasechainreactionfordetectingsarscov2usingartificialintelligencetrainedfromsimulateddata
AT amarsepulvedapaola anomalyidentificationduringpolymerasechainreactionfordetectingsarscov2usingartificialintelligencetrainedfromsimulateddata
AT pachecolondonoleonardoc anomalyidentificationduringpolymerasechainreactionfordetectingsarscov2usingartificialintelligencetrainedfromsimulateddata