Cargando…

Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields

Motivation: Computational approaches for the annotation of phenotypes from image data have shown promising results across many applications, and provide rich and valuable information for studying gene function and interactions. While data are often available both at high spatial resolution and acros...

Descripción completa

Detalles Bibliográficos
Autores principales: Pruteanu-Malinici, Iulian, Majoros, William H., Ohler, Uwe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694682/
https://www.ncbi.nlm.nih.gov/pubmed/23812993
http://dx.doi.org/10.1093/bioinformatics/btt206
_version_ 1782274888377040896
author Pruteanu-Malinici, Iulian
Majoros, William H.
Ohler, Uwe
author_facet Pruteanu-Malinici, Iulian
Majoros, William H.
Ohler, Uwe
author_sort Pruteanu-Malinici, Iulian
collection PubMed
description Motivation: Computational approaches for the annotation of phenotypes from image data have shown promising results across many applications, and provide rich and valuable information for studying gene function and interactions. While data are often available both at high spatial resolution and across multiple time points, phenotypes are frequently annotated independently, for individual time points only. In particular, for the analysis of developmental gene expression patterns, it is biologically sensible when images across multiple time points are jointly accounted for, such that spatial and temporal dependencies are captured simultaneously. Methods: We describe a discriminative undirected graphical model to label gene-expression time-series image data, with an efficient training and decoding method based on the junction tree algorithm. The approach is based on an effective feature selection technique, consisting of a non-parametric sparse Bayesian factor analysis model. The result is a flexible framework, which can handle large-scale data with noisy incomplete samples, i.e. it can tolerate data missing from individual time points. Results: Using the annotation of gene expression patterns across stages of Drosophila embryonic development as an example, we demonstrate that our method achieves superior accuracy, gained by jointly annotating phenotype sequences, when compared with previous models that annotate each stage in isolation. The experimental results on missing data indicate that our joint learning method successfully annotates genes for which no expression data are available for one or more stages. Contact: uwe.ohler@duke.edu
format Online
Article
Text
id pubmed-3694682
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36946822013-06-27 Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields Pruteanu-Malinici, Iulian Majoros, William H. Ohler, Uwe Bioinformatics Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Motivation: Computational approaches for the annotation of phenotypes from image data have shown promising results across many applications, and provide rich and valuable information for studying gene function and interactions. While data are often available both at high spatial resolution and across multiple time points, phenotypes are frequently annotated independently, for individual time points only. In particular, for the analysis of developmental gene expression patterns, it is biologically sensible when images across multiple time points are jointly accounted for, such that spatial and temporal dependencies are captured simultaneously. Methods: We describe a discriminative undirected graphical model to label gene-expression time-series image data, with an efficient training and decoding method based on the junction tree algorithm. The approach is based on an effective feature selection technique, consisting of a non-parametric sparse Bayesian factor analysis model. The result is a flexible framework, which can handle large-scale data with noisy incomplete samples, i.e. it can tolerate data missing from individual time points. Results: Using the annotation of gene expression patterns across stages of Drosophila embryonic development as an example, we demonstrate that our method achieves superior accuracy, gained by jointly annotating phenotype sequences, when compared with previous models that annotate each stage in isolation. The experimental results on missing data indicate that our joint learning method successfully annotates genes for which no expression data are available for one or more stages. Contact: uwe.ohler@duke.edu Oxford University Press 2013-07-01 2013-06-19 /pmc/articles/PMC3694682/ /pubmed/23812993 http://dx.doi.org/10.1093/bioinformatics/btt206 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
Pruteanu-Malinici, Iulian
Majoros, William H.
Ohler, Uwe
Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields
title Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields
title_full Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields
title_fullStr Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields
title_full_unstemmed Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields
title_short Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields
title_sort automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields
topic Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694682/
https://www.ncbi.nlm.nih.gov/pubmed/23812993
http://dx.doi.org/10.1093/bioinformatics/btt206
work_keys_str_mv AT pruteanumaliniciiulian automatedannotationofgeneexpressionimagesequencesvianonparametricfactoranalysisandconditionalrandomfields
AT majoroswilliamh automatedannotationofgeneexpressionimagesequencesvianonparametricfactoranalysisandconditionalrandomfields
AT ohleruwe automatedannotationofgeneexpressionimagesequencesvianonparametricfactoranalysisandconditionalrandomfields