Cargando…
Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields
Motivation: Computational approaches for the annotation of phenotypes from image data have shown promising results across many applications, and provide rich and valuable information for studying gene function and interactions. While data are often available both at high spatial resolution and acros...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694682/ https://www.ncbi.nlm.nih.gov/pubmed/23812993 http://dx.doi.org/10.1093/bioinformatics/btt206 |
_version_ | 1782274888377040896 |
---|---|
author | Pruteanu-Malinici, Iulian Majoros, William H. Ohler, Uwe |
author_facet | Pruteanu-Malinici, Iulian Majoros, William H. Ohler, Uwe |
author_sort | Pruteanu-Malinici, Iulian |
collection | PubMed |
description | Motivation: Computational approaches for the annotation of phenotypes from image data have shown promising results across many applications, and provide rich and valuable information for studying gene function and interactions. While data are often available both at high spatial resolution and across multiple time points, phenotypes are frequently annotated independently, for individual time points only. In particular, for the analysis of developmental gene expression patterns, it is biologically sensible when images across multiple time points are jointly accounted for, such that spatial and temporal dependencies are captured simultaneously. Methods: We describe a discriminative undirected graphical model to label gene-expression time-series image data, with an efficient training and decoding method based on the junction tree algorithm. The approach is based on an effective feature selection technique, consisting of a non-parametric sparse Bayesian factor analysis model. The result is a flexible framework, which can handle large-scale data with noisy incomplete samples, i.e. it can tolerate data missing from individual time points. Results: Using the annotation of gene expression patterns across stages of Drosophila embryonic development as an example, we demonstrate that our method achieves superior accuracy, gained by jointly annotating phenotype sequences, when compared with previous models that annotate each stage in isolation. The experimental results on missing data indicate that our joint learning method successfully annotates genes for which no expression data are available for one or more stages. Contact: uwe.ohler@duke.edu |
format | Online Article Text |
id | pubmed-3694682 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-36946822013-06-27 Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields Pruteanu-Malinici, Iulian Majoros, William H. Ohler, Uwe Bioinformatics Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Motivation: Computational approaches for the annotation of phenotypes from image data have shown promising results across many applications, and provide rich and valuable information for studying gene function and interactions. While data are often available both at high spatial resolution and across multiple time points, phenotypes are frequently annotated independently, for individual time points only. In particular, for the analysis of developmental gene expression patterns, it is biologically sensible when images across multiple time points are jointly accounted for, such that spatial and temporal dependencies are captured simultaneously. Methods: We describe a discriminative undirected graphical model to label gene-expression time-series image data, with an efficient training and decoding method based on the junction tree algorithm. The approach is based on an effective feature selection technique, consisting of a non-parametric sparse Bayesian factor analysis model. The result is a flexible framework, which can handle large-scale data with noisy incomplete samples, i.e. it can tolerate data missing from individual time points. Results: Using the annotation of gene expression patterns across stages of Drosophila embryonic development as an example, we demonstrate that our method achieves superior accuracy, gained by jointly annotating phenotype sequences, when compared with previous models that annotate each stage in isolation. The experimental results on missing data indicate that our joint learning method successfully annotates genes for which no expression data are available for one or more stages. Contact: uwe.ohler@duke.edu Oxford University Press 2013-07-01 2013-06-19 /pmc/articles/PMC3694682/ /pubmed/23812993 http://dx.doi.org/10.1093/bioinformatics/btt206 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Pruteanu-Malinici, Iulian Majoros, William H. Ohler, Uwe Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields |
title | Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields |
title_full | Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields |
title_fullStr | Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields |
title_full_unstemmed | Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields |
title_short | Automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields |
title_sort | automated annotation of gene expression image sequences via non-parametric factor analysis and conditional random fields |
topic | Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694682/ https://www.ncbi.nlm.nih.gov/pubmed/23812993 http://dx.doi.org/10.1093/bioinformatics/btt206 |
work_keys_str_mv | AT pruteanumaliniciiulian automatedannotationofgeneexpressionimagesequencesvianonparametricfactoranalysisandconditionalrandomfields AT majoroswilliamh automatedannotationofgeneexpressionimagesequencesvianonparametricfactoranalysisandconditionalrandomfields AT ohleruwe automatedannotationofgeneexpressionimagesequencesvianonparametricfactoranalysisandconditionalrandomfields |