Cargando…

Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution

Next-generation sequencing technologies have enabled many advances across diverse areas of biology, with many benefiting from increased sample size. Although the cost of running next-generation sequencing instruments has dropped substantially over time, the cost of sample preparation methods has lag...

Descripción completa

Detalles Bibliográficos
Autores principales:	Brennan, Caitriona, Salido, Rodolfo A., Belda-Ferre, Pedro, Bryant, MacKenzie, Cowart, Charles, Tiu, Maria D., González, Antonio, McDonald, Daniel, Tribelhorn, Caitlin, Zarrinpar, Amir, Knight, Rob
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Society for Microbiology 2023
Materias:	Observation
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10469589/ https://www.ncbi.nlm.nih.gov/pubmed/37350611 http://dx.doi.org/10.1128/msystems.00006-23

_version_	1785099474708725760
author	Brennan, Caitriona Salido, Rodolfo A. Belda-Ferre, Pedro Bryant, MacKenzie Cowart, Charles Tiu, Maria D. González, Antonio McDonald, Daniel Tribelhorn, Caitlin Zarrinpar, Amir Knight, Rob
author_facet	Brennan, Caitriona Salido, Rodolfo A. Belda-Ferre, Pedro Bryant, MacKenzie Cowart, Charles Tiu, Maria D. González, Antonio McDonald, Daniel Tribelhorn, Caitlin Zarrinpar, Amir Knight, Rob
author_sort	Brennan, Caitriona
collection	PubMed
description	Next-generation sequencing technologies have enabled many advances across diverse areas of biology, with many benefiting from increased sample size. Although the cost of running next-generation sequencing instruments has dropped substantially over time, the cost of sample preparation methods has lagged behind. To counter this, researchers have adapted library miniaturization protocols and large sample pools to maximize the number of samples that can be prepared by a certain amount of reagents and sequenced in a single run. However, due to high variability of sample quality, over and underrepresentation of samples in a sequencing run has become a major issue in high-throughput sequencing. This leads to misinterpretation of results due to increased noise, and additional time and cost rerunning underrepresented samples. To overcome this problem, we present a normalization method that uses shallow iSeq sequencing to accurately inform pooling volumes based on read distribution. This method is superior to the widely used fluorometry methods, which cannot specifically target adapter-ligated molecules that contribute to sequencing output. Our normalization method not only quantifies adapter-ligated molecules but also allows normalization of feature space; for example, we can normalize to reads of interest such as non-ribosomal reads. As a result, this normalization method improves the efficiency of high-throughput next-generation sequencing by reducing noise and producing higher average reads per sample with more even sequencing depth. IMPORTANCE: High-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology.
format	Online Article Text
id	pubmed-10469589
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	American Society for Microbiology
record_format	MEDLINE/PubMed
spelling	pubmed-104695892023-09-01 Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution Brennan, Caitriona Salido, Rodolfo A. Belda-Ferre, Pedro Bryant, MacKenzie Cowart, Charles Tiu, Maria D. González, Antonio McDonald, Daniel Tribelhorn, Caitlin Zarrinpar, Amir Knight, Rob mSystems Observation Next-generation sequencing technologies have enabled many advances across diverse areas of biology, with many benefiting from increased sample size. Although the cost of running next-generation sequencing instruments has dropped substantially over time, the cost of sample preparation methods has lagged behind. To counter this, researchers have adapted library miniaturization protocols and large sample pools to maximize the number of samples that can be prepared by a certain amount of reagents and sequenced in a single run. However, due to high variability of sample quality, over and underrepresentation of samples in a sequencing run has become a major issue in high-throughput sequencing. This leads to misinterpretation of results due to increased noise, and additional time and cost rerunning underrepresented samples. To overcome this problem, we present a normalization method that uses shallow iSeq sequencing to accurately inform pooling volumes based on read distribution. This method is superior to the widely used fluorometry methods, which cannot specifically target adapter-ligated molecules that contribute to sequencing output. Our normalization method not only quantifies adapter-ligated molecules but also allows normalization of feature space; for example, we can normalize to reads of interest such as non-ribosomal reads. As a result, this normalization method improves the efficiency of high-throughput next-generation sequencing by reducing noise and producing higher average reads per sample with more even sequencing depth. IMPORTANCE: High-throughput next generation sequencing (NGS) has significantly contributed to the field of genomics; however, further improvements can maximize the potential of this important tool. Uneven sequencing of samples in a multiplexed run is a common issue that leads to unexpected extra costs or low-quality data. To mitigate this problem, we introduce a normalization method based on read counts rather than library concentration. This method allows for an even distribution of features of interest across samples, improving the statistical power of data sets and preventing the financial loss associated with resequencing libraries. This method optimizes NGS, which already has huge importance across many areas of biology. American Society for Microbiology 2023-06-23 /pmc/articles/PMC10469589/ /pubmed/37350611 http://dx.doi.org/10.1128/msystems.00006-23 Text en Copyright © 2023 Brennan et al. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Observation Brennan, Caitriona Salido, Rodolfo A. Belda-Ferre, Pedro Bryant, MacKenzie Cowart, Charles Tiu, Maria D. González, Antonio McDonald, Daniel Tribelhorn, Caitlin Zarrinpar, Amir Knight, Rob Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution
title	Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution
title_full	Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution
title_fullStr	Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution
title_full_unstemmed	Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution
title_short	Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution
title_sort	maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution
topic	Observation
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10469589/ https://www.ncbi.nlm.nih.gov/pubmed/37350611 http://dx.doi.org/10.1128/msystems.00006-23
work_keys_str_mv	AT brennancaitriona maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT salidorodolfoa maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT beldaferrepedro maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT bryantmackenzie maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT cowartcharles maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT tiumariad maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT gonzalezantonio maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT mcdonalddaniel maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT tribelhorncaitlin maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT zarrinparamir maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution AT knightrob maximizingthepotentialofhighthroughputnextgenerationsequencingthroughprecisenormalizationbasedonreadcountdistribution

Maximizing the potential of high-throughput next-generation sequencing through precise normalization based on read count distribution

Ejemplares similares