Cargando…

High-resolution functional annotation of human transcriptome: predicting isoform functions by a novel multiple instance-based label propagation method

Alternative transcript processing is an important mechanism for generating functional diversity in genes. However, little is known about the precise functions of individual isoforms. In fact, proteins (translated from transcript isoforms), not genes, are the function carriers. By integrating multipl...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Wenyuan, Kang, Shuli, Liu, Chun-Chi, Zhang, Shihua, Shi, Yi, Liu, Yan, Zhou, Xianghong Jasmine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3973446/
https://www.ncbi.nlm.nih.gov/pubmed/24369432
http://dx.doi.org/10.1093/nar/gkt1362
_version_ 1782479322999685120
author Li, Wenyuan
Kang, Shuli
Liu, Chun-Chi
Zhang, Shihua
Shi, Yi
Liu, Yan
Zhou, Xianghong Jasmine
author_facet Li, Wenyuan
Kang, Shuli
Liu, Chun-Chi
Zhang, Shihua
Shi, Yi
Liu, Yan
Zhou, Xianghong Jasmine
author_sort Li, Wenyuan
collection PubMed
description Alternative transcript processing is an important mechanism for generating functional diversity in genes. However, little is known about the precise functions of individual isoforms. In fact, proteins (translated from transcript isoforms), not genes, are the function carriers. By integrating multiple human RNA-seq data sets, we carried out the first systematic prediction of isoform functions, enabling high-resolution functional annotation of human transcriptome. Unlike gene function prediction, isoform function prediction faces a unique challenge: the lack of the training data—all known functional annotations are at the gene level. To address this challenge, we modelled the gene–isoform relationships as multiple instance data and developed a novel label propagation method to predict functions. Our method achieved an average area under the receiver operating characteristic curve of 0.67 and assigned functions to 15 572 isoforms. Interestingly, we observed that different functions have different sensitivities to alternative isoform processing, and that the function diversity of isoforms from the same gene is positively correlated with their tissue expression diversity. Finally, we surveyed the literature to validate our predictions for a number of apoptotic genes. Strikingly, for the famous ‘TP53’ gene, we not only accurately identified the apoptosis regulation function of its five isoforms, but also correctly predicted the precise direction of the regulation.
format Online
Article
Text
id pubmed-3973446
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-39734462014-04-04 High-resolution functional annotation of human transcriptome: predicting isoform functions by a novel multiple instance-based label propagation method Li, Wenyuan Kang, Shuli Liu, Chun-Chi Zhang, Shihua Shi, Yi Liu, Yan Zhou, Xianghong Jasmine Nucleic Acids Res Methods Online Alternative transcript processing is an important mechanism for generating functional diversity in genes. However, little is known about the precise functions of individual isoforms. In fact, proteins (translated from transcript isoforms), not genes, are the function carriers. By integrating multiple human RNA-seq data sets, we carried out the first systematic prediction of isoform functions, enabling high-resolution functional annotation of human transcriptome. Unlike gene function prediction, isoform function prediction faces a unique challenge: the lack of the training data—all known functional annotations are at the gene level. To address this challenge, we modelled the gene–isoform relationships as multiple instance data and developed a novel label propagation method to predict functions. Our method achieved an average area under the receiver operating characteristic curve of 0.67 and assigned functions to 15 572 isoforms. Interestingly, we observed that different functions have different sensitivities to alternative isoform processing, and that the function diversity of isoforms from the same gene is positively correlated with their tissue expression diversity. Finally, we surveyed the literature to validate our predictions for a number of apoptotic genes. Strikingly, for the famous ‘TP53’ gene, we not only accurately identified the apoptosis regulation function of its five isoforms, but also correctly predicted the precise direction of the regulation. Oxford University Press 2014-04 2013-12-25 /pmc/articles/PMC3973446/ /pubmed/24369432 http://dx.doi.org/10.1093/nar/gkt1362 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Li, Wenyuan
Kang, Shuli
Liu, Chun-Chi
Zhang, Shihua
Shi, Yi
Liu, Yan
Zhou, Xianghong Jasmine
High-resolution functional annotation of human transcriptome: predicting isoform functions by a novel multiple instance-based label propagation method
title High-resolution functional annotation of human transcriptome: predicting isoform functions by a novel multiple instance-based label propagation method
title_full High-resolution functional annotation of human transcriptome: predicting isoform functions by a novel multiple instance-based label propagation method
title_fullStr High-resolution functional annotation of human transcriptome: predicting isoform functions by a novel multiple instance-based label propagation method
title_full_unstemmed High-resolution functional annotation of human transcriptome: predicting isoform functions by a novel multiple instance-based label propagation method
title_short High-resolution functional annotation of human transcriptome: predicting isoform functions by a novel multiple instance-based label propagation method
title_sort high-resolution functional annotation of human transcriptome: predicting isoform functions by a novel multiple instance-based label propagation method
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3973446/
https://www.ncbi.nlm.nih.gov/pubmed/24369432
http://dx.doi.org/10.1093/nar/gkt1362
work_keys_str_mv AT liwenyuan highresolutionfunctionalannotationofhumantranscriptomepredictingisoformfunctionsbyanovelmultipleinstancebasedlabelpropagationmethod
AT kangshuli highresolutionfunctionalannotationofhumantranscriptomepredictingisoformfunctionsbyanovelmultipleinstancebasedlabelpropagationmethod
AT liuchunchi highresolutionfunctionalannotationofhumantranscriptomepredictingisoformfunctionsbyanovelmultipleinstancebasedlabelpropagationmethod
AT zhangshihua highresolutionfunctionalannotationofhumantranscriptomepredictingisoformfunctionsbyanovelmultipleinstancebasedlabelpropagationmethod
AT shiyi highresolutionfunctionalannotationofhumantranscriptomepredictingisoformfunctionsbyanovelmultipleinstancebasedlabelpropagationmethod
AT liuyan highresolutionfunctionalannotationofhumantranscriptomepredictingisoformfunctionsbyanovelmultipleinstancebasedlabelpropagationmethod
AT zhouxianghongjasmine highresolutionfunctionalannotationofhumantranscriptomepredictingisoformfunctionsbyanovelmultipleinstancebasedlabelpropagationmethod