Cargando…

Improved Placement of Multi-mapping Small RNAs

High-throughput sequencing of small RNAs (sRNA-seq) is a popular method used to discover and annotate microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs), and Piwi-associated RNAs (piRNAs). One of the key steps in sRNA-seq data analysis is alignment to a reference genome. sRNA-seq librari...

Descripción completa

Detalles Bibliográficos
Autores principales: Johnson, Nathan R., Yeoh, Jonathan M., Coruh, Ceyda, Axtell, Michael J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4938663/
https://www.ncbi.nlm.nih.gov/pubmed/27175019
http://dx.doi.org/10.1534/g3.116.030452
_version_ 1782441901960462336
author Johnson, Nathan R.
Yeoh, Jonathan M.
Coruh, Ceyda
Axtell, Michael J.
author_facet Johnson, Nathan R.
Yeoh, Jonathan M.
Coruh, Ceyda
Axtell, Michael J.
author_sort Johnson, Nathan R.
collection PubMed
description High-throughput sequencing of small RNAs (sRNA-seq) is a popular method used to discover and annotate microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs), and Piwi-associated RNAs (piRNAs). One of the key steps in sRNA-seq data analysis is alignment to a reference genome. sRNA-seq libraries often have a high proportion of reads that align to multiple genomic locations, which makes determining their true origins difficult. Commonly used sRNA-seq alignment methods result in either very low precision (choosing an alignment at random), or sensitivity (ignoring multi-mapping reads). Here, we describe and test an sRNA-seq alignment strategy that uses local genomic context to guide decisions on proper placements of multi-mapped sRNA-seq reads. Tests using simulated sRNA-seq data demonstrated that this local-weighting method outperforms other alignment strategies using three different plant genomes. Experimental analyses with real sRNA-seq data also indicate superior performance of local-weighting methods for both plant miRNAs and heterochromatic siRNAs. The local-weighting methods we have developed are implemented as part of the sRNA-seq analysis program ShortStack, which is freely available under a general public license. Improved genome alignments of sRNA-seq data should increase the quality of downstream analyses and genome annotation efforts.
format Online
Article
Text
id pubmed-4938663
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-49386632016-07-19 Improved Placement of Multi-mapping Small RNAs Johnson, Nathan R. Yeoh, Jonathan M. Coruh, Ceyda Axtell, Michael J. G3 (Bethesda) Investigations High-throughput sequencing of small RNAs (sRNA-seq) is a popular method used to discover and annotate microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs), and Piwi-associated RNAs (piRNAs). One of the key steps in sRNA-seq data analysis is alignment to a reference genome. sRNA-seq libraries often have a high proportion of reads that align to multiple genomic locations, which makes determining their true origins difficult. Commonly used sRNA-seq alignment methods result in either very low precision (choosing an alignment at random), or sensitivity (ignoring multi-mapping reads). Here, we describe and test an sRNA-seq alignment strategy that uses local genomic context to guide decisions on proper placements of multi-mapped sRNA-seq reads. Tests using simulated sRNA-seq data demonstrated that this local-weighting method outperforms other alignment strategies using three different plant genomes. Experimental analyses with real sRNA-seq data also indicate superior performance of local-weighting methods for both plant miRNAs and heterochromatic siRNAs. The local-weighting methods we have developed are implemented as part of the sRNA-seq analysis program ShortStack, which is freely available under a general public license. Improved genome alignments of sRNA-seq data should increase the quality of downstream analyses and genome annotation efforts. Genetics Society of America 2016-05-11 /pmc/articles/PMC4938663/ /pubmed/27175019 http://dx.doi.org/10.1534/g3.116.030452 Text en Copyright © 2016 Johnson et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Johnson, Nathan R.
Yeoh, Jonathan M.
Coruh, Ceyda
Axtell, Michael J.
Improved Placement of Multi-mapping Small RNAs
title Improved Placement of Multi-mapping Small RNAs
title_full Improved Placement of Multi-mapping Small RNAs
title_fullStr Improved Placement of Multi-mapping Small RNAs
title_full_unstemmed Improved Placement of Multi-mapping Small RNAs
title_short Improved Placement of Multi-mapping Small RNAs
title_sort improved placement of multi-mapping small rnas
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4938663/
https://www.ncbi.nlm.nih.gov/pubmed/27175019
http://dx.doi.org/10.1534/g3.116.030452
work_keys_str_mv AT johnsonnathanr improvedplacementofmultimappingsmallrnas
AT yeohjonathanm improvedplacementofmultimappingsmallrnas
AT coruhceyda improvedplacementofmultimappingsmallrnas
AT axtellmichaelj improvedplacementofmultimappingsmallrnas