Cargando…

Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data

Integrating large-scale functional genomic data has significantly accelerated our understanding of gene functions. However, no algorithm has been developed to differentiate functions for isoforms of the same gene using high-throughput genomic data. This is because standard supervised learning requir...

Descripción completa

Detalles Bibliográficos
Autores principales: Eksi, Ridvan, Li, Hong-Dong, Menon, Rajasree, Wen, Yuchen, Omenn, Gilbert S., Kretzler, Matthias, Guan, Yuanfang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3820534/
https://www.ncbi.nlm.nih.gov/pubmed/24244129
http://dx.doi.org/10.1371/journal.pcbi.1003314
_version_ 1782290153581051904
author Eksi, Ridvan
Li, Hong-Dong
Menon, Rajasree
Wen, Yuchen
Omenn, Gilbert S.
Kretzler, Matthias
Guan, Yuanfang
author_facet Eksi, Ridvan
Li, Hong-Dong
Menon, Rajasree
Wen, Yuchen
Omenn, Gilbert S.
Kretzler, Matthias
Guan, Yuanfang
author_sort Eksi, Ridvan
collection PubMed
description Integrating large-scale functional genomic data has significantly accelerated our understanding of gene functions. However, no algorithm has been developed to differentiate functions for isoforms of the same gene using high-throughput genomic data. This is because standard supervised learning requires ‘ground-truth’ functional annotations, which are lacking at the isoform level. To address this challenge, we developed a generic framework that interrogates public RNA-seq data at the transcript level to differentiate functions for alternatively spliced isoforms. For a specific function, our algorithm identifies the ‘responsible’ isoform(s) of a gene and generates classifying models at the isoform level instead of at the gene level. Through cross-validation, we demonstrated that our algorithm is effective in assigning functions to genes, especially the ones with multiple isoforms, and robust to gene expression levels and removal of homologous gene pairs. We identified genes in the mouse whose isoforms are predicted to have disparate functionalities and experimentally validated the ‘responsible’ isoforms using data from mammary tissue. With protein structure modeling and experimental evidence, we further validated the predicted isoform functional differences for the genes Cdkn2a and Anxa6. Our generic framework is the first to predict and differentiate functions for alternatively spliced isoforms, instead of genes, using genomic data. It is extendable to any base machine learner and other species with alternatively spliced isoforms, and shifts the current gene-centered function prediction to isoform-level predictions.
format Online
Article
Text
id pubmed-3820534
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-38205342013-11-15 Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data Eksi, Ridvan Li, Hong-Dong Menon, Rajasree Wen, Yuchen Omenn, Gilbert S. Kretzler, Matthias Guan, Yuanfang PLoS Comput Biol Research Article Integrating large-scale functional genomic data has significantly accelerated our understanding of gene functions. However, no algorithm has been developed to differentiate functions for isoforms of the same gene using high-throughput genomic data. This is because standard supervised learning requires ‘ground-truth’ functional annotations, which are lacking at the isoform level. To address this challenge, we developed a generic framework that interrogates public RNA-seq data at the transcript level to differentiate functions for alternatively spliced isoforms. For a specific function, our algorithm identifies the ‘responsible’ isoform(s) of a gene and generates classifying models at the isoform level instead of at the gene level. Through cross-validation, we demonstrated that our algorithm is effective in assigning functions to genes, especially the ones with multiple isoforms, and robust to gene expression levels and removal of homologous gene pairs. We identified genes in the mouse whose isoforms are predicted to have disparate functionalities and experimentally validated the ‘responsible’ isoforms using data from mammary tissue. With protein structure modeling and experimental evidence, we further validated the predicted isoform functional differences for the genes Cdkn2a and Anxa6. Our generic framework is the first to predict and differentiate functions for alternatively spliced isoforms, instead of genes, using genomic data. It is extendable to any base machine learner and other species with alternatively spliced isoforms, and shifts the current gene-centered function prediction to isoform-level predictions. Public Library of Science 2013-11-07 /pmc/articles/PMC3820534/ /pubmed/24244129 http://dx.doi.org/10.1371/journal.pcbi.1003314 Text en © 2013 Eksi et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Eksi, Ridvan
Li, Hong-Dong
Menon, Rajasree
Wen, Yuchen
Omenn, Gilbert S.
Kretzler, Matthias
Guan, Yuanfang
Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data
title Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data
title_full Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data
title_fullStr Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data
title_full_unstemmed Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data
title_short Systematically Differentiating Functions for Alternatively Spliced Isoforms through Integrating RNA-seq Data
title_sort systematically differentiating functions for alternatively spliced isoforms through integrating rna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3820534/
https://www.ncbi.nlm.nih.gov/pubmed/24244129
http://dx.doi.org/10.1371/journal.pcbi.1003314
work_keys_str_mv AT eksiridvan systematicallydifferentiatingfunctionsforalternativelysplicedisoformsthroughintegratingrnaseqdata
AT lihongdong systematicallydifferentiatingfunctionsforalternativelysplicedisoformsthroughintegratingrnaseqdata
AT menonrajasree systematicallydifferentiatingfunctionsforalternativelysplicedisoformsthroughintegratingrnaseqdata
AT wenyuchen systematicallydifferentiatingfunctionsforalternativelysplicedisoformsthroughintegratingrnaseqdata
AT omenngilberts systematicallydifferentiatingfunctionsforalternativelysplicedisoformsthroughintegratingrnaseqdata
AT kretzlermatthias systematicallydifferentiatingfunctionsforalternativelysplicedisoformsthroughintegratingrnaseqdata
AT guanyuanfang systematicallydifferentiatingfunctionsforalternativelysplicedisoformsthroughintegratingrnaseqdata