Cargando…

PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2

The optimization of resources for research in developing countries forces us to consider strategies in the wet lab that allow the reuse of molecular biology reagents to reduce costs. In this study, we used linear regression as a method for predictive modeling of coverage depth given the number of Mi...

Descripción completa

Detalles Bibliográficos
Autores principales: Valencia-Valencia, David E., Lopez-Alvarez, Diana, Rivera-Franco, Nelson, Castillo, Andres, Piña, Johan S., Pardo, Carlos A., Parra, Beatriz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744141/
https://www.ncbi.nlm.nih.gov/pubmed/36518292
http://dx.doi.org/10.7717/peerj.14425
_version_ 1784848854636560384
author Valencia-Valencia, David E.
Lopez-Alvarez, Diana
Rivera-Franco, Nelson
Castillo, Andres
Piña, Johan S.
Pardo, Carlos A.
Parra, Beatriz
author_facet Valencia-Valencia, David E.
Lopez-Alvarez, Diana
Rivera-Franco, Nelson
Castillo, Andres
Piña, Johan S.
Pardo, Carlos A.
Parra, Beatriz
author_sort Valencia-Valencia, David E.
collection PubMed
description The optimization of resources for research in developing countries forces us to consider strategies in the wet lab that allow the reuse of molecular biology reagents to reduce costs. In this study, we used linear regression as a method for predictive modeling of coverage depth given the number of MinION reads sequenced to define the optimum number of reads necessary to obtain >200X coverage depth with a good lineage-clade assignment of SARS-CoV-2 genomes. The research aimed to create and implement a model based on machine learning algorithms to predict different variables (e.g., coverage depth) given the number of MinION reads produced by Nanopore sequencing to maximize the yield of high-quality SARS-CoV-2 genomes, determine the best sequencing runtime, and to be able to reuse the flow cell with the remaining nanopores available for sequencing in a new run. The best accuracy was −0.98 according to the R squared performance metric of the models. A demo version is available at https://genomicdashboard.herokuapp.com/.
format Online
Article
Text
id pubmed-9744141
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-97441412022-12-13 PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2 Valencia-Valencia, David E. Lopez-Alvarez, Diana Rivera-Franco, Nelson Castillo, Andres Piña, Johan S. Pardo, Carlos A. Parra, Beatriz PeerJ Bioinformatics The optimization of resources for research in developing countries forces us to consider strategies in the wet lab that allow the reuse of molecular biology reagents to reduce costs. In this study, we used linear regression as a method for predictive modeling of coverage depth given the number of MinION reads sequenced to define the optimum number of reads necessary to obtain >200X coverage depth with a good lineage-clade assignment of SARS-CoV-2 genomes. The research aimed to create and implement a model based on machine learning algorithms to predict different variables (e.g., coverage depth) given the number of MinION reads produced by Nanopore sequencing to maximize the yield of high-quality SARS-CoV-2 genomes, determine the best sequencing runtime, and to be able to reuse the flow cell with the remaining nanopores available for sequencing in a new run. The best accuracy was −0.98 according to the R squared performance metric of the models. A demo version is available at https://genomicdashboard.herokuapp.com/. PeerJ Inc. 2022-11-30 /pmc/articles/PMC9744141/ /pubmed/36518292 http://dx.doi.org/10.7717/peerj.14425 Text en ©2022 Valencia-Valencia et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Valencia-Valencia, David E.
Lopez-Alvarez, Diana
Rivera-Franco, Nelson
Castillo, Andres
Piña, Johan S.
Pardo, Carlos A.
Parra, Beatriz
PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2
title PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2
title_full PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2
title_fullStr PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2
title_full_unstemmed PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2
title_short PredictION: a predictive model to establish the performance of Oxford sequencing reads of SARS-CoV-2
title_sort prediction: a predictive model to establish the performance of oxford sequencing reads of sars-cov-2
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9744141/
https://www.ncbi.nlm.nih.gov/pubmed/36518292
http://dx.doi.org/10.7717/peerj.14425
work_keys_str_mv AT valenciavalenciadavide predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2
AT lopezalvarezdiana predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2
AT riverafranconelson predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2
AT castilloandres predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2
AT pinajohans predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2
AT pardocarlosa predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2
AT parrabeatriz predictionapredictivemodeltoestablishtheperformanceofoxfordsequencingreadsofsarscov2