Cargando…

Fast, Ungapped Reads Mapping Using Squid

Advances in Next Generation Sequencing technologies allow us to inspect and unlock the genome to a level of detail that was unimaginable only a few decades ago. Omics-based studies are casting a light on the patterns and determinants of disease conditions in populations, as well as on the influence...

Descripción completa

Detalles Bibliográficos
Autores principales:	Riccardi, Christopher, Innocenti, Gabriel, Fondi, Marco, Bacci, Giovanni
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9103773/ https://www.ncbi.nlm.nih.gov/pubmed/35564837 http://dx.doi.org/10.3390/ijerph19095442

_version_	1784707632435560448
author	Riccardi, Christopher Innocenti, Gabriel Fondi, Marco Bacci, Giovanni
author_facet	Riccardi, Christopher Innocenti, Gabriel Fondi, Marco Bacci, Giovanni
author_sort	Riccardi, Christopher
collection	PubMed
description	Advances in Next Generation Sequencing technologies allow us to inspect and unlock the genome to a level of detail that was unimaginable only a few decades ago. Omics-based studies are casting a light on the patterns and determinants of disease conditions in populations, as well as on the influence of microbial communities on human health, just to name a few. Through increasing volumes of sequencing information, for example, it is possible to compare genomic features and analyze the modulation of the transcriptome under different environmental stimuli. Although protocols for NGS preparation are intended to leave little to no space for contamination of any kind, a noticeable fraction of sequencing reads still may not uniquely represent what was intended to be sequenced in the first place. If a natural consequence of a sequencing sample is to assess the presence of features of interest by mapping the obtained reads to a genome of reference, sometimes it is useful to determine the fraction of those that do not map, or that map discordantly, and store this information to a new file for subsequent analyses. Here we propose a new mapper, which we called Squid, that among other accessory functionalities finds and returns sequencing reads that match or do not match to a reference sequence database in any orientation. We encourage the use of Squid prior to any quantification pipeline to assess, for instance, the presence of contaminants, especially in RNA-Seq experiments.
format	Online Article Text
id	pubmed-9103773
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-91037732022-05-14 Fast, Ungapped Reads Mapping Using Squid Riccardi, Christopher Innocenti, Gabriel Fondi, Marco Bacci, Giovanni Int J Environ Res Public Health Article Advances in Next Generation Sequencing technologies allow us to inspect and unlock the genome to a level of detail that was unimaginable only a few decades ago. Omics-based studies are casting a light on the patterns and determinants of disease conditions in populations, as well as on the influence of microbial communities on human health, just to name a few. Through increasing volumes of sequencing information, for example, it is possible to compare genomic features and analyze the modulation of the transcriptome under different environmental stimuli. Although protocols for NGS preparation are intended to leave little to no space for contamination of any kind, a noticeable fraction of sequencing reads still may not uniquely represent what was intended to be sequenced in the first place. If a natural consequence of a sequencing sample is to assess the presence of features of interest by mapping the obtained reads to a genome of reference, sometimes it is useful to determine the fraction of those that do not map, or that map discordantly, and store this information to a new file for subsequent analyses. Here we propose a new mapper, which we called Squid, that among other accessory functionalities finds and returns sequencing reads that match or do not match to a reference sequence database in any orientation. We encourage the use of Squid prior to any quantification pipeline to assess, for instance, the presence of contaminants, especially in RNA-Seq experiments. MDPI 2022-04-29 /pmc/articles/PMC9103773/ /pubmed/35564837 http://dx.doi.org/10.3390/ijerph19095442 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Riccardi, Christopher Innocenti, Gabriel Fondi, Marco Bacci, Giovanni Fast, Ungapped Reads Mapping Using Squid
title	Fast, Ungapped Reads Mapping Using Squid
title_full	Fast, Ungapped Reads Mapping Using Squid
title_fullStr	Fast, Ungapped Reads Mapping Using Squid
title_full_unstemmed	Fast, Ungapped Reads Mapping Using Squid
title_short	Fast, Ungapped Reads Mapping Using Squid
title_sort	fast, ungapped reads mapping using squid
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9103773/ https://www.ncbi.nlm.nih.gov/pubmed/35564837 http://dx.doi.org/10.3390/ijerph19095442
work_keys_str_mv	AT riccardichristopher fastungappedreadsmappingusingsquid AT innocentigabriel fastungappedreadsmappingusingsquid AT fondimarco fastungappedreadsmappingusingsquid AT baccigiovanni fastungappedreadsmappingusingsquid

Fast, Ungapped Reads Mapping Using Squid

Ejemplares similares