Cargando…

An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs

Reconstructing full-length transcript isoforms from sequence fragments (such as ESTs) is a major interest and challenge for bioinformatic analysis of pre-mRNA alternative splicing. This problem has been formulated as finding traversals across the splice graph, which is a directed acyclic graph (DAG)...

Descripción completa

Detalles Bibliográficos
Autores principales: Xing, Yi, Yu, Tianwei, Wu, Ying Nian, Roy, Meenakshi, Kim, Joseph, Lee, Christopher
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1475746/
https://www.ncbi.nlm.nih.gov/pubmed/16757580
http://dx.doi.org/10.1093/nar/gkl396
_version_ 1782128142302838784
author Xing, Yi
Yu, Tianwei
Wu, Ying Nian
Roy, Meenakshi
Kim, Joseph
Lee, Christopher
author_facet Xing, Yi
Yu, Tianwei
Wu, Ying Nian
Roy, Meenakshi
Kim, Joseph
Lee, Christopher
author_sort Xing, Yi
collection PubMed
description Reconstructing full-length transcript isoforms from sequence fragments (such as ESTs) is a major interest and challenge for bioinformatic analysis of pre-mRNA alternative splicing. This problem has been formulated as finding traversals across the splice graph, which is a directed acyclic graph (DAG) representation of gene structure and alternative splicing. In this manuscript we introduce a probabilistic formulation of the isoform reconstruction problem, and provide an expectation-maximization (EM) algorithm for its maximum likelihood solution. Using a series of simulated data and expressed sequences from real human genes, we demonstrate that our EM algorithm can correctly handle various situations of fragmentation and coupling in the input data. Our work establishes a general probabilistic framework for splice graph-based reconstructions of full-length isoforms.
format Text
id pubmed-1475746
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-14757462006-06-12 An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs Xing, Yi Yu, Tianwei Wu, Ying Nian Roy, Meenakshi Kim, Joseph Lee, Christopher Nucleic Acids Res Article Reconstructing full-length transcript isoforms from sequence fragments (such as ESTs) is a major interest and challenge for bioinformatic analysis of pre-mRNA alternative splicing. This problem has been formulated as finding traversals across the splice graph, which is a directed acyclic graph (DAG) representation of gene structure and alternative splicing. In this manuscript we introduce a probabilistic formulation of the isoform reconstruction problem, and provide an expectation-maximization (EM) algorithm for its maximum likelihood solution. Using a series of simulated data and expressed sequences from real human genes, we demonstrate that our EM algorithm can correctly handle various situations of fragmentation and coupling in the input data. Our work establishes a general probabilistic framework for splice graph-based reconstructions of full-length isoforms. Oxford University Press 2006 2006-06-06 /pmc/articles/PMC1475746/ /pubmed/16757580 http://dx.doi.org/10.1093/nar/gkl396 Text en © 2006 The Author(s)
spellingShingle Article
Xing, Yi
Yu, Tianwei
Wu, Ying Nian
Roy, Meenakshi
Kim, Joseph
Lee, Christopher
An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs
title An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs
title_full An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs
title_fullStr An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs
title_full_unstemmed An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs
title_short An expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs
title_sort expectation-maximization algorithm for probabilistic reconstructions of full-length isoforms from splice graphs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1475746/
https://www.ncbi.nlm.nih.gov/pubmed/16757580
http://dx.doi.org/10.1093/nar/gkl396
work_keys_str_mv AT xingyi anexpectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT yutianwei anexpectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT wuyingnian anexpectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT roymeenakshi anexpectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT kimjoseph anexpectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT leechristopher anexpectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT xingyi expectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT yutianwei expectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT wuyingnian expectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT roymeenakshi expectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT kimjoseph expectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs
AT leechristopher expectationmaximizationalgorithmforprobabilisticreconstructionsoffulllengthisoformsfromsplicegraphs