Cargando…

SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints

Splicing event identification is one of the most important issues in the comprehensive analysis of transcription profile. Recent development of next-generation sequencing technology has generated an extensive profile of alternative splicing. However, while many of these splicing events are between e...

Descripción completa

Detalles Bibliográficos
Autores principales: Ning, Kang, Fermin, Damian
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2919401/
https://www.ncbi.nlm.nih.gov/pubmed/20706591
http://dx.doi.org/10.1371/journal.pone.0012047
_version_ 1782185176563974144
author Ning, Kang
Fermin, Damian
author_facet Ning, Kang
Fermin, Damian
author_sort Ning, Kang
collection PubMed
description Splicing event identification is one of the most important issues in the comprehensive analysis of transcription profile. Recent development of next-generation sequencing technology has generated an extensive profile of alternative splicing. However, while many of these splicing events are between exons that are relatively close on genome sequences, reads generated by RNA-Seq are not limited to alternative splicing between close exons but occur in virtually all splicing events. In this work, a novel method, SAW, was proposed for the identification of all splicing events based on short reads from RNA-Seq. It was observed that short reads not in known gene models are actually absent words from known gene sequences. An efficient method to filter and cluster these short reads by fingerprint fragments of splicing events without aligning short reads to genome sequences was developed. Additionally, the possible splicing sites were also determined without alignment against genome sequences. A consensus sequence was then generated for each short read cluster, which was then aligned to the genome sequences. Results demonstrated that this method could identify more than 90% of the known splicing events with a very low false discovery rate, as well as accurately identify, a number of novel splicing events between distant exons.
format Text
id pubmed-2919401
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-29194012010-08-12 SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints Ning, Kang Fermin, Damian PLoS One Research Article Splicing event identification is one of the most important issues in the comprehensive analysis of transcription profile. Recent development of next-generation sequencing technology has generated an extensive profile of alternative splicing. However, while many of these splicing events are between exons that are relatively close on genome sequences, reads generated by RNA-Seq are not limited to alternative splicing between close exons but occur in virtually all splicing events. In this work, a novel method, SAW, was proposed for the identification of all splicing events based on short reads from RNA-Seq. It was observed that short reads not in known gene models are actually absent words from known gene sequences. An efficient method to filter and cluster these short reads by fingerprint fragments of splicing events without aligning short reads to genome sequences was developed. Additionally, the possible splicing sites were also determined without alignment against genome sequences. A consensus sequence was then generated for each short read cluster, which was then aligned to the genome sequences. Results demonstrated that this method could identify more than 90% of the known splicing events with a very low false discovery rate, as well as accurately identify, a number of novel splicing events between distant exons. Public Library of Science 2010-08-10 /pmc/articles/PMC2919401/ /pubmed/20706591 http://dx.doi.org/10.1371/journal.pone.0012047 Text en Ning, Fermin. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ning, Kang
Fermin, Damian
SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints
title SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints
title_full SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints
title_fullStr SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints
title_full_unstemmed SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints
title_short SAW: A Method to Identify Splicing Events from RNA-Seq Data Based on Splicing Fingerprints
title_sort saw: a method to identify splicing events from rna-seq data based on splicing fingerprints
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2919401/
https://www.ncbi.nlm.nih.gov/pubmed/20706591
http://dx.doi.org/10.1371/journal.pone.0012047
work_keys_str_mv AT ningkang sawamethodtoidentifysplicingeventsfromrnaseqdatabasedonsplicingfingerprints
AT fermindamian sawamethodtoidentifysplicingeventsfromrnaseqdatabasedonsplicingfingerprints