Cargando…

mmannot: How to improve small–RNA annotation?

High-throughput sequencing makes it possible to provide the genome-wide distribution of small non coding RNAs in a single experiment, and contributed greatly to the identification and understanding of these RNAs in the last decade. Small non coding RNAs gather a wide collection of classes, such as m...

Descripción completa

Detalles Bibliográficos
Autores principales: Zytnicki, Matthias, Gaspin, Christine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7255610/
https://www.ncbi.nlm.nih.gov/pubmed/32463818
http://dx.doi.org/10.1371/journal.pone.0231738
_version_ 1783539762407145472
author Zytnicki, Matthias
Gaspin, Christine
author_facet Zytnicki, Matthias
Gaspin, Christine
author_sort Zytnicki, Matthias
collection PubMed
description High-throughput sequencing makes it possible to provide the genome-wide distribution of small non coding RNAs in a single experiment, and contributed greatly to the identification and understanding of these RNAs in the last decade. Small non coding RNAs gather a wide collection of classes, such as microRNAs, tRNA-derived fragments, small nucleolar RNAs and small nuclear RNAs, to name a few. As usual in RNA-seq studies, the sequencing step is followed by a feature quantification step: when a genome is available, the reads are aligned to the genome, their genomic positions are compared to the already available annotations, and the corresponding features are quantified. However, problem arises when many reads map at several positions and while different strategies exist to circumvent this problem, all of them are biased. In this article, we present a new strategy that compares all the reads that map at several positions, and their annotations when available. In many cases, all the hits co-localize with the same feature annotation (a duplicated miRNA or a duplicated gene, for instance). When different annotations exist for a given read, we propose to merge existing features and provide the counts for the merged features. This new strategy has been implemented in a tool, mmannot, freely available at https://github.com/mzytnicki/mmannot.
format Online
Article
Text
id pubmed-7255610
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-72556102020-06-08 mmannot: How to improve small–RNA annotation? Zytnicki, Matthias Gaspin, Christine PLoS One Research Article High-throughput sequencing makes it possible to provide the genome-wide distribution of small non coding RNAs in a single experiment, and contributed greatly to the identification and understanding of these RNAs in the last decade. Small non coding RNAs gather a wide collection of classes, such as microRNAs, tRNA-derived fragments, small nucleolar RNAs and small nuclear RNAs, to name a few. As usual in RNA-seq studies, the sequencing step is followed by a feature quantification step: when a genome is available, the reads are aligned to the genome, their genomic positions are compared to the already available annotations, and the corresponding features are quantified. However, problem arises when many reads map at several positions and while different strategies exist to circumvent this problem, all of them are biased. In this article, we present a new strategy that compares all the reads that map at several positions, and their annotations when available. In many cases, all the hits co-localize with the same feature annotation (a duplicated miRNA or a duplicated gene, for instance). When different annotations exist for a given read, we propose to merge existing features and provide the counts for the merged features. This new strategy has been implemented in a tool, mmannot, freely available at https://github.com/mzytnicki/mmannot. Public Library of Science 2020-05-28 /pmc/articles/PMC7255610/ /pubmed/32463818 http://dx.doi.org/10.1371/journal.pone.0231738 Text en © 2020 Zytnicki, Gaspin http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Zytnicki, Matthias
Gaspin, Christine
mmannot: How to improve small–RNA annotation?
title mmannot: How to improve small–RNA annotation?
title_full mmannot: How to improve small–RNA annotation?
title_fullStr mmannot: How to improve small–RNA annotation?
title_full_unstemmed mmannot: How to improve small–RNA annotation?
title_short mmannot: How to improve small–RNA annotation?
title_sort mmannot: how to improve small–rna annotation?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7255610/
https://www.ncbi.nlm.nih.gov/pubmed/32463818
http://dx.doi.org/10.1371/journal.pone.0231738
work_keys_str_mv AT zytnickimatthias mmannothowtoimprovesmallrnaannotation
AT gaspinchristine mmannothowtoimprovesmallrnaannotation