Cargando…

Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification

The automated transcript discovery and quantification of high-throughput RNA sequencing (RNA-seq) data are important tasks of next-generation sequencing (NGS) research. However, these tasks are challenging due to the uncertainties that arise in the inference of complete splicing isoform variants fro...

Descripción completa

Detalles Bibliográficos
Autores principales: Deng, Yue, Bao, Feng, Yang, Yang, Ji, Xiangyang, Du, Mulong, Zhang, Zhengdong, Wang, Meilin, Dai, Qionghai
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5587798/
https://www.ncbi.nlm.nih.gov/pubmed/28911101
http://dx.doi.org/10.1093/nar/gkx585
_version_ 1783262062857682944
author Deng, Yue
Bao, Feng
Yang, Yang
Ji, Xiangyang
Du, Mulong
Zhang, Zhengdong
Wang, Meilin
Dai, Qionghai
author_facet Deng, Yue
Bao, Feng
Yang, Yang
Ji, Xiangyang
Du, Mulong
Zhang, Zhengdong
Wang, Meilin
Dai, Qionghai
author_sort Deng, Yue
collection PubMed
description The automated transcript discovery and quantification of high-throughput RNA sequencing (RNA-seq) data are important tasks of next-generation sequencing (NGS) research. However, these tasks are challenging due to the uncertainties that arise in the inference of complete splicing isoform variants from partially observed short reads. Here, we address this problem by explicitly reducing the inherent uncertainties in a biological system caused by missing information. In our approach, the RNA-seq procedure for transforming transcripts into short reads is considered an information transmission process. Consequently, the data uncertainties are substantially reduced by exploiting the information transduction capacity of information theory. The experimental results obtained from the analyses of simulated datasets and RNA-seq datasets from cell lines and tissues demonstrate the advantages of our method over state-of-the-art competitors. Our algorithm is an open-source implementation of MaxInfo.
format Online
Article
Text
id pubmed-5587798
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-55877982017-09-11 Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification Deng, Yue Bao, Feng Yang, Yang Ji, Xiangyang Du, Mulong Zhang, Zhengdong Wang, Meilin Dai, Qionghai Nucleic Acids Res Methods Online The automated transcript discovery and quantification of high-throughput RNA sequencing (RNA-seq) data are important tasks of next-generation sequencing (NGS) research. However, these tasks are challenging due to the uncertainties that arise in the inference of complete splicing isoform variants from partially observed short reads. Here, we address this problem by explicitly reducing the inherent uncertainties in a biological system caused by missing information. In our approach, the RNA-seq procedure for transforming transcripts into short reads is considered an information transmission process. Consequently, the data uncertainties are substantially reduced by exploiting the information transduction capacity of information theory. The experimental results obtained from the analyses of simulated datasets and RNA-seq datasets from cell lines and tissues demonstrate the advantages of our method over state-of-the-art competitors. Our algorithm is an open-source implementation of MaxInfo. Oxford University Press 2017-09-06 2017-07-07 /pmc/articles/PMC5587798/ /pubmed/28911101 http://dx.doi.org/10.1093/nar/gkx585 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Deng, Yue
Bao, Feng
Yang, Yang
Ji, Xiangyang
Du, Mulong
Zhang, Zhengdong
Wang, Meilin
Dai, Qionghai
Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification
title Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification
title_full Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification
title_fullStr Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification
title_full_unstemmed Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification
title_short Information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification
title_sort information transduction capacity reduces the uncertainties in annotation-free isoform discovery and quantification
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5587798/
https://www.ncbi.nlm.nih.gov/pubmed/28911101
http://dx.doi.org/10.1093/nar/gkx585
work_keys_str_mv AT dengyue informationtransductioncapacityreducestheuncertaintiesinannotationfreeisoformdiscoveryandquantification
AT baofeng informationtransductioncapacityreducestheuncertaintiesinannotationfreeisoformdiscoveryandquantification
AT yangyang informationtransductioncapacityreducestheuncertaintiesinannotationfreeisoformdiscoveryandquantification
AT jixiangyang informationtransductioncapacityreducestheuncertaintiesinannotationfreeisoformdiscoveryandquantification
AT dumulong informationtransductioncapacityreducestheuncertaintiesinannotationfreeisoformdiscoveryandquantification
AT zhangzhengdong informationtransductioncapacityreducestheuncertaintiesinannotationfreeisoformdiscoveryandquantification
AT wangmeilin informationtransductioncapacityreducestheuncertaintiesinannotationfreeisoformdiscoveryandquantification
AT daiqionghai informationtransductioncapacityreducestheuncertaintiesinannotationfreeisoformdiscoveryandquantification