Cargando…

noisyR: enhancing biological signal in sequencing datasets by characterizing random technical noise

High-throughput sequencing enables an unprecedented resolution in transcript quantification, at the cost of magnifying the impact of technical noise. The consistent reduction of random background noise to capture functionally meaningful biological signals is still challenging. Intrinsic sequencing v...

Descripción completa

Detalles Bibliográficos
Autores principales: Moutsopoulos, Ilias, Maischak, Lukas, Lauzikaite, Elze, Vasquez Urbina, Sergio A, Williams, Eleanor C, Drost, Hajk-Georg, Mohorianu, Irina I
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8373073/
https://www.ncbi.nlm.nih.gov/pubmed/34076236
http://dx.doi.org/10.1093/nar/gkab433
_version_ 1783739880381087744
author Moutsopoulos, Ilias
Maischak, Lukas
Lauzikaite, Elze
Vasquez Urbina, Sergio A
Williams, Eleanor C
Drost, Hajk-Georg
Mohorianu, Irina I
author_facet Moutsopoulos, Ilias
Maischak, Lukas
Lauzikaite, Elze
Vasquez Urbina, Sergio A
Williams, Eleanor C
Drost, Hajk-Georg
Mohorianu, Irina I
author_sort Moutsopoulos, Ilias
collection PubMed
description High-throughput sequencing enables an unprecedented resolution in transcript quantification, at the cost of magnifying the impact of technical noise. The consistent reduction of random background noise to capture functionally meaningful biological signals is still challenging. Intrinsic sequencing variability introducing low-level expression variations can obscure patterns in downstream analyses. We introduce noisyR, a comprehensive noise filter to assess the variation in signal distribution and achieve an optimal information-consistency across replicates and samples; this selection also facilitates meaningful pattern recognition outside the background-noise range. noisyR is applicable to count matrices and sequencing data; it outputs sample-specific signal/noise thresholds and filtered expression matrices. We exemplify the effects of minimizing technical noise on several datasets, across various sequencing assays: coding, non-coding RNAs and interactions, at bulk and single-cell level. An immediate consequence of filtering out noise is the convergence of predictions (differential-expression calls, enrichment analyses and inference of gene regulatory networks) across different approaches.
format Online
Article
Text
id pubmed-8373073
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-83730732021-08-19 noisyR: enhancing biological signal in sequencing datasets by characterizing random technical noise Moutsopoulos, Ilias Maischak, Lukas Lauzikaite, Elze Vasquez Urbina, Sergio A Williams, Eleanor C Drost, Hajk-Georg Mohorianu, Irina I Nucleic Acids Res Methods Online High-throughput sequencing enables an unprecedented resolution in transcript quantification, at the cost of magnifying the impact of technical noise. The consistent reduction of random background noise to capture functionally meaningful biological signals is still challenging. Intrinsic sequencing variability introducing low-level expression variations can obscure patterns in downstream analyses. We introduce noisyR, a comprehensive noise filter to assess the variation in signal distribution and achieve an optimal information-consistency across replicates and samples; this selection also facilitates meaningful pattern recognition outside the background-noise range. noisyR is applicable to count matrices and sequencing data; it outputs sample-specific signal/noise thresholds and filtered expression matrices. We exemplify the effects of minimizing technical noise on several datasets, across various sequencing assays: coding, non-coding RNAs and interactions, at bulk and single-cell level. An immediate consequence of filtering out noise is the convergence of predictions (differential-expression calls, enrichment analyses and inference of gene regulatory networks) across different approaches. Oxford University Press 2021-06-02 /pmc/articles/PMC8373073/ /pubmed/34076236 http://dx.doi.org/10.1093/nar/gkab433 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Moutsopoulos, Ilias
Maischak, Lukas
Lauzikaite, Elze
Vasquez Urbina, Sergio A
Williams, Eleanor C
Drost, Hajk-Georg
Mohorianu, Irina I
noisyR: enhancing biological signal in sequencing datasets by characterizing random technical noise
title noisyR: enhancing biological signal in sequencing datasets by characterizing random technical noise
title_full noisyR: enhancing biological signal in sequencing datasets by characterizing random technical noise
title_fullStr noisyR: enhancing biological signal in sequencing datasets by characterizing random technical noise
title_full_unstemmed noisyR: enhancing biological signal in sequencing datasets by characterizing random technical noise
title_short noisyR: enhancing biological signal in sequencing datasets by characterizing random technical noise
title_sort noisyr: enhancing biological signal in sequencing datasets by characterizing random technical noise
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8373073/
https://www.ncbi.nlm.nih.gov/pubmed/34076236
http://dx.doi.org/10.1093/nar/gkab433
work_keys_str_mv AT moutsopoulosilias noisyrenhancingbiologicalsignalinsequencingdatasetsbycharacterizingrandomtechnicalnoise
AT maischaklukas noisyrenhancingbiologicalsignalinsequencingdatasetsbycharacterizingrandomtechnicalnoise
AT lauzikaiteelze noisyrenhancingbiologicalsignalinsequencingdatasetsbycharacterizingrandomtechnicalnoise
AT vasquezurbinasergioa noisyrenhancingbiologicalsignalinsequencingdatasetsbycharacterizingrandomtechnicalnoise
AT williamseleanorc noisyrenhancingbiologicalsignalinsequencingdatasetsbycharacterizingrandomtechnicalnoise
AT drosthajkgeorg noisyrenhancingbiologicalsignalinsequencingdatasetsbycharacterizingrandomtechnicalnoise
AT mohorianuirinai noisyrenhancingbiologicalsignalinsequencingdatasetsbycharacterizingrandomtechnicalnoise