Cargando…
SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints
Splicing event identification is one of the most important issues in the comprehensive analysis of transcription profile. Recent development of next-generation sequencing technology has generated an extensive profile of alternative splicing. However, while many of these splicing events are between e...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2919401/ https://www.ncbi.nlm.nih.gov/pubmed/20706591 http://dx.doi.org/10.1371/journal.pone.0012047 |
_version_ | 1782185176563974144 |
---|---|
author | Ning, Kang Fermin, Damian |
author_facet | Ning, Kang Fermin, Damian |
author_sort | Ning, Kang |
collection | PubMed |
description | Splicing event identification is one of the most important issues in the comprehensive analysis of transcription profile. Recent development of next-generation sequencing technology has generated an extensive profile of alternative splicing. However, while many of these splicing events are between exons that are relatively close on genome sequences, reads generated by RNA-Seq are not limited to alternative splicing between close exons but occur in virtually all splicing events. In this work, a novel method, SAW, was proposed for the identification of all splicing events based on short reads from RNA-Seq. It was observed that short reads not in known gene models are actually absent words from known gene sequences. An efficient method to filter and cluster these short reads by fingerprint fragments of splicing events without aligning short reads to genome sequences was developed. Additionally, the possible splicing sites were also determined without alignment against genome sequences. A consensus sequence was then generated for each short read cluster, which was then aligned to the genome sequences. Results demonstrated that this method could identify more than 90% of the known splicing events with a very low false discovery rate, as well as accurately identify, a number of novel splicing events between distant exons. |
format | Text |
id | pubmed-2919401 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-29194012010-08-12 SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints Ning, Kang Fermin, Damian PLoS One Research Article Splicing event identification is one of the most important issues in the comprehensive analysis of transcription profile. Recent development of next-generation sequencing technology has generated an extensive profile of alternative splicing. However, while many of these splicing events are between exons that are relatively close on genome sequences, reads generated by RNA-Seq are not limited to alternative splicing between close exons but occur in virtually all splicing events. In this work, a novel method, SAW, was proposed for the identification of all splicing events based on short reads from RNA-Seq. It was observed that short reads not in known gene models are actually absent words from known gene sequences. An efficient method to filter and cluster these short reads by fingerprint fragments of splicing events without aligning short reads to genome sequences was developed. Additionally, the possible splicing sites were also determined without alignment against genome sequences. A consensus sequence was then generated for each short read cluster, which was then aligned to the genome sequences. Results demonstrated that this method could identify more than 90% of the known splicing events with a very low false discovery rate, as well as accurately identify, a number of novel splicing events between distant exons. Public Library of Science 2010-08-10 /pmc/articles/PMC2919401/ /pubmed/20706591 http://dx.doi.org/10.1371/journal.pone.0012047 Text en Ning, Fermin. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Ning, Kang Fermin, Damian SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints |
title | SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints |
title_full | SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints |
title_fullStr | SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints |
title_full_unstemmed | SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints |
title_short | SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints |
title_sort | saw: a method to identify splicing events from rna-seq data based on splicing fingerprints |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2919401/ https://www.ncbi.nlm.nih.gov/pubmed/20706591 http://dx.doi.org/10.1371/journal.pone.0012047 |
work_keys_str_mv | AT ningkang sawamethodtoidentifysplicingeventsfromrnaseqdatabasedonsplicingfingerprints AT fermindamian sawamethodtoidentifysplicingeventsfromrnaseqdatabasedonsplicingfingerprints |