Cargando…

Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data

BACKGROUND: Gene expression measurements during the development of the fly Drosophila melanogaster are routinely used to find functional modules of temporally co-expressed genes. Complimentary large data sets of in situ RNA hybridization images for different stages of the fly embryo elucidate the sp...

Descripción completa

Detalles Bibliográficos
Autores principales: Costa, Ivan G, Krause, Roland, Opitz, Lennart, Schliep, Alexander
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2230504/
https://www.ncbi.nlm.nih.gov/pubmed/18269697
http://dx.doi.org/10.1186/1471-2105-8-S10-S3
_version_ 1782150221820592128
author Costa, Ivan G
Krause, Roland
Opitz, Lennart
Schliep, Alexander
author_facet Costa, Ivan G
Krause, Roland
Opitz, Lennart
Schliep, Alexander
author_sort Costa, Ivan G
collection PubMed
description BACKGROUND: Gene expression measurements during the development of the fly Drosophila melanogaster are routinely used to find functional modules of temporally co-expressed genes. Complimentary large data sets of in situ RNA hybridization images for different stages of the fly embryo elucidate the spatial expression patterns. RESULTS: Using a semi-supervised approach, constrained clustering with mixture models, we can find clusters of genes exhibiting spatio-temporal similarities in expression, or syn-expression. The temporal gene expression measurements are taken as primary data for which pairwise constraints are computed in an automated fashion from raw in situ images without the need for manual annotation. We investigate the influence of these pairwise constraints in the clustering and discuss the biological relevance of our results. CONCLUSION: Spatial information contributes to a detailed, biological meaningful analysis of temporal gene expression data. Semi-supervised learning provides a flexible, robust and efficient framework for integrating data sources of differing quality and abundance.
format Text
id pubmed-2230504
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-22305042008-02-06 Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data Costa, Ivan G Krause, Roland Opitz, Lennart Schliep, Alexander BMC Bioinformatics Proceedings BACKGROUND: Gene expression measurements during the development of the fly Drosophila melanogaster are routinely used to find functional modules of temporally co-expressed genes. Complimentary large data sets of in situ RNA hybridization images for different stages of the fly embryo elucidate the spatial expression patterns. RESULTS: Using a semi-supervised approach, constrained clustering with mixture models, we can find clusters of genes exhibiting spatio-temporal similarities in expression, or syn-expression. The temporal gene expression measurements are taken as primary data for which pairwise constraints are computed in an automated fashion from raw in situ images without the need for manual annotation. We investigate the influence of these pairwise constraints in the clustering and discuss the biological relevance of our results. CONCLUSION: Spatial information contributes to a detailed, biological meaningful analysis of temporal gene expression data. Semi-supervised learning provides a flexible, robust and efficient framework for integrating data sources of differing quality and abundance. BioMed Central 2007-12-21 /pmc/articles/PMC2230504/ /pubmed/18269697 http://dx.doi.org/10.1186/1471-2105-8-S10-S3 Text en Copyright © 2007 Costa et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Costa, Ivan G
Krause, Roland
Opitz, Lennart
Schliep, Alexander
Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data
title Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data
title_full Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data
title_fullStr Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data
title_full_unstemmed Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data
title_short Semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data
title_sort semi-supervised learning for the identification of syn-expressed genes from fused microarray and in situ image data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2230504/
https://www.ncbi.nlm.nih.gov/pubmed/18269697
http://dx.doi.org/10.1186/1471-2105-8-S10-S3
work_keys_str_mv AT costaivang semisupervisedlearningfortheidentificationofsynexpressedgenesfromfusedmicroarrayandinsituimagedata
AT krauseroland semisupervisedlearningfortheidentificationofsynexpressedgenesfromfusedmicroarrayandinsituimagedata
AT opitzlennart semisupervisedlearningfortheidentificationofsynexpressedgenesfromfusedmicroarrayandinsituimagedata
AT schliepalexander semisupervisedlearningfortheidentificationofsynexpressedgenesfromfusedmicroarrayandinsituimagedata