Cargando…
Improved Placement of Multi-mapping Small RNAs
High-throughput sequencing of small RNAs (sRNA-seq) is a popular method used to discover and annotate microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs), and Piwi-associated RNAs (piRNAs). One of the key steps in sRNA-seq data analysis is alignment to a reference genome. sRNA-seq librari...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4938663/ https://www.ncbi.nlm.nih.gov/pubmed/27175019 http://dx.doi.org/10.1534/g3.116.030452 |
_version_ | 1782441901960462336 |
---|---|
author | Johnson, Nathan R. Yeoh, Jonathan M. Coruh, Ceyda Axtell, Michael J. |
author_facet | Johnson, Nathan R. Yeoh, Jonathan M. Coruh, Ceyda Axtell, Michael J. |
author_sort | Johnson, Nathan R. |
collection | PubMed |
description | High-throughput sequencing of small RNAs (sRNA-seq) is a popular method used to discover and annotate microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs), and Piwi-associated RNAs (piRNAs). One of the key steps in sRNA-seq data analysis is alignment to a reference genome. sRNA-seq libraries often have a high proportion of reads that align to multiple genomic locations, which makes determining their true origins difficult. Commonly used sRNA-seq alignment methods result in either very low precision (choosing an alignment at random), or sensitivity (ignoring multi-mapping reads). Here, we describe and test an sRNA-seq alignment strategy that uses local genomic context to guide decisions on proper placements of multi-mapped sRNA-seq reads. Tests using simulated sRNA-seq data demonstrated that this local-weighting method outperforms other alignment strategies using three different plant genomes. Experimental analyses with real sRNA-seq data also indicate superior performance of local-weighting methods for both plant miRNAs and heterochromatic siRNAs. The local-weighting methods we have developed are implemented as part of the sRNA-seq analysis program ShortStack, which is freely available under a general public license. Improved genome alignments of sRNA-seq data should increase the quality of downstream analyses and genome annotation efforts. |
format | Online Article Text |
id | pubmed-4938663 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-49386632016-07-19 Improved Placement of Multi-mapping Small RNAs Johnson, Nathan R. Yeoh, Jonathan M. Coruh, Ceyda Axtell, Michael J. G3 (Bethesda) Investigations High-throughput sequencing of small RNAs (sRNA-seq) is a popular method used to discover and annotate microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs), and Piwi-associated RNAs (piRNAs). One of the key steps in sRNA-seq data analysis is alignment to a reference genome. sRNA-seq libraries often have a high proportion of reads that align to multiple genomic locations, which makes determining their true origins difficult. Commonly used sRNA-seq alignment methods result in either very low precision (choosing an alignment at random), or sensitivity (ignoring multi-mapping reads). Here, we describe and test an sRNA-seq alignment strategy that uses local genomic context to guide decisions on proper placements of multi-mapped sRNA-seq reads. Tests using simulated sRNA-seq data demonstrated that this local-weighting method outperforms other alignment strategies using three different plant genomes. Experimental analyses with real sRNA-seq data also indicate superior performance of local-weighting methods for both plant miRNAs and heterochromatic siRNAs. The local-weighting methods we have developed are implemented as part of the sRNA-seq analysis program ShortStack, which is freely available under a general public license. Improved genome alignments of sRNA-seq data should increase the quality of downstream analyses and genome annotation efforts. Genetics Society of America 2016-05-11 /pmc/articles/PMC4938663/ /pubmed/27175019 http://dx.doi.org/10.1534/g3.116.030452 Text en Copyright © 2016 Johnson et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigations Johnson, Nathan R. Yeoh, Jonathan M. Coruh, Ceyda Axtell, Michael J. Improved Placement of Multi-mapping Small RNAs |
title | Improved Placement of Multi-mapping Small RNAs |
title_full | Improved Placement of Multi-mapping Small RNAs |
title_fullStr | Improved Placement of Multi-mapping Small RNAs |
title_full_unstemmed | Improved Placement of Multi-mapping Small RNAs |
title_short | Improved Placement of Multi-mapping Small RNAs |
title_sort | improved placement of multi-mapping small rnas |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4938663/ https://www.ncbi.nlm.nih.gov/pubmed/27175019 http://dx.doi.org/10.1534/g3.116.030452 |
work_keys_str_mv | AT johnsonnathanr improvedplacementofmultimappingsmallrnas AT yeohjonathanm improvedplacementofmultimappingsmallrnas AT coruhceyda improvedplacementofmultimappingsmallrnas AT axtellmichaelj improvedplacementofmultimappingsmallrnas |