Cargando…
PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2
The optimization of resources for research in developing countries forces us to consider strategies in the wet lab that allow the reuse of molecular biology reagents to reduce costs. In this study, we used linear regression as a method for predictive modeling of coverage depth given the number of Mi...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744141/ https://www.ncbi.nlm.nih.gov/pubmed/36518292 http://dx.doi.org/10.7717/peerj.14425 |
_version_ | 1784848854636560384 |
---|---|
author | Valencia-Valencia, David E. Lopez-Alvarez, Diana Rivera-Franco, Nelson Castillo, Andres Piña, Johan S. Pardo, Carlos A. Parra, Beatriz |
author_facet | Valencia-Valencia, David E. Lopez-Alvarez, Diana Rivera-Franco, Nelson Castillo, Andres Piña, Johan S. Pardo, Carlos A. Parra, Beatriz |
author_sort | Valencia-Valencia, David E. |
collection | PubMed |
description | The optimization of resources for research in developing countries forces us to consider strategies in the wet lab that allow the reuse of molecular biology reagents to reduce costs. In this study, we used linear regression as a method for predictive modeling of coverage depth given the number of MinION reads sequenced to define the optimum number of reads necessary to obtain >200X coverage depth with a good lineage-clade assignment of SARS-CoV-2 genomes. The research aimed to create and implement a model based on machine learning algorithms to predict different variables (e.g., coverage depth) given the number of MinION reads produced by Nanopore sequencing to maximize the yield of high-quality SARS-CoV-2 genomes, determine the best sequencing runtime, and to be able to reuse the flow cell with the remaining nanopores available for sequencing in a new run. The best accuracy was −0.98 according to the R squared performance metric of the models. A demo version is available at https://genomicdashboard.herokuapp.com/. |
format | Online Article Text |
id | pubmed-9744141 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97441412022-12-13 PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2 Valencia-Valencia, David E. Lopez-Alvarez, Diana Rivera-Franco, Nelson Castillo, Andres Piña, Johan S. Pardo, Carlos A. Parra, Beatriz PeerJ Bioinformatics The optimization of resources for research in developing countries forces us to consider strategies in the wet lab that allow the reuse of molecular biology reagents to reduce costs. In this study, we used linear regression as a method for predictive modeling of coverage depth given the number of MinION reads sequenced to define the optimum number of reads necessary to obtain >200X coverage depth with a good lineage-clade assignment of SARS-CoV-2 genomes. The research aimed to create and implement a model based on machine learning algorithms to predict different variables (e.g., coverage depth) given the number of MinION reads produced by Nanopore sequencing to maximize the yield of high-quality SARS-CoV-2 genomes, determine the best sequencing runtime, and to be able to reuse the flow cell with the remaining nanopores available for sequencing in a new run. The best accuracy was −0.98 according to the R squared performance metric of the models. A demo version is available at https://genomicdashboard.herokuapp.com/. PeerJ Inc. 2022-11-30 /pmc/articles/PMC9744141/ /pubmed/36518292 http://dx.doi.org/10.7717/peerj.14425 Text en ©2022 Valencia-Valencia et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Bioinformatics Valencia-Valencia, David E. Lopez-Alvarez, Diana Rivera-Franco, Nelson Castillo, Andres Piña, Johan S. Pardo, Carlos A. Parra, Beatriz PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2 |
title | PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2 |
title_full | PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2 |
title_fullStr | PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2 |
title_full_unstemmed | PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2 |
title_short | PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2 |
title_sort | prediction: a predictive model to establish the performance of oxford sequencing reads of sars-cov-2 |
topic | Bioinformatics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744141/ https://www.ncbi.nlm.nih.gov/pubmed/36518292 http://dx.doi.org/10.7717/peerj.14425 |
work_keys_str_mv | AT valenciavalenciadavide predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2 AT lopezalvarezdiana predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2 AT riverafranconelson predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2 AT castilloandres predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2 AT pinajohans predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2 AT pardocarlosa predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2 AT parrabeatriz predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2 |