Cargando…

Partitioning of functional gene expression data using principal points

BACKGROUND: DNA microarrays offer motivation and hope for the simultaneous study of variations in multiple genes. Gene expression is a temporal process that allows variations in expression levels with a characterized gene function over a period of time. Temporal gene expression curves can be treated...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Jaehee, Kim, Haseong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5639779/
https://www.ncbi.nlm.nih.gov/pubmed/29025390
http://dx.doi.org/10.1186/s12859-017-1860-0
_version_ 1783270944571129856
author Kim, Jaehee
Kim, Haseong
author_facet Kim, Jaehee
Kim, Haseong
author_sort Kim, Jaehee
collection PubMed
description BACKGROUND: DNA microarrays offer motivation and hope for the simultaneous study of variations in multiple genes. Gene expression is a temporal process that allows variations in expression levels with a characterized gene function over a period of time. Temporal gene expression curves can be treated as functional data since they are considered as independent realizations of a stochastic process. This process requires appropriate models to identify patterns of gene functions. The partitioning of the functional data can find homogeneous subgroups of entities for the massive genes within the inherent biological networks. Therefor it can be a useful technique for the analysis of time-course gene expression data. We propose a new self-consistent partitioning method of functional coefficients for individual expression profiles based on the orthonormal basis system. RESULTS: A principal points based functional partitioning method is proposed for time-course gene expression data. The method explores the relationship between genes using Legendre coefficients as principal points to extract the features of gene functions. Our proposed method provides high connectivity in connectedness after clustering for simulated data and finds a significant subsets of genes with the increased connectivity. Our approach has comparative advantages that fewer coefficients are used from the functional data and self-consistency of principal points for partitioning. As real data applications, we are able to find partitioned genes through the gene expressions found in budding yeast data and Escherichia coli data. CONCLUSIONS: The proposed method benefitted from the use of principal points, dimension reduction, and choice of orthogonal basis system as well as provides appropriately connected genes in the resulting subsets. We illustrate our method by applying with each set of cell-cycle-regulated time-course yeast genes and E. coli genes. The proposed method is able to identify highly connected genes and to explore the complex dynamics of biological systems in functional genomics.
format Online
Article
Text
id pubmed-5639779
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-56397792017-10-18 Partitioning of functional gene expression data using principal points Kim, Jaehee Kim, Haseong BMC Bioinformatics Research Article BACKGROUND: DNA microarrays offer motivation and hope for the simultaneous study of variations in multiple genes. Gene expression is a temporal process that allows variations in expression levels with a characterized gene function over a period of time. Temporal gene expression curves can be treated as functional data since they are considered as independent realizations of a stochastic process. This process requires appropriate models to identify patterns of gene functions. The partitioning of the functional data can find homogeneous subgroups of entities for the massive genes within the inherent biological networks. Therefor it can be a useful technique for the analysis of time-course gene expression data. We propose a new self-consistent partitioning method of functional coefficients for individual expression profiles based on the orthonormal basis system. RESULTS: A principal points based functional partitioning method is proposed for time-course gene expression data. The method explores the relationship between genes using Legendre coefficients as principal points to extract the features of gene functions. Our proposed method provides high connectivity in connectedness after clustering for simulated data and finds a significant subsets of genes with the increased connectivity. Our approach has comparative advantages that fewer coefficients are used from the functional data and self-consistency of principal points for partitioning. As real data applications, we are able to find partitioned genes through the gene expressions found in budding yeast data and Escherichia coli data. CONCLUSIONS: The proposed method benefitted from the use of principal points, dimension reduction, and choice of orthogonal basis system as well as provides appropriately connected genes in the resulting subsets. We illustrate our method by applying with each set of cell-cycle-regulated time-course yeast genes and E. coli genes. The proposed method is able to identify highly connected genes and to explore the complex dynamics of biological systems in functional genomics. BioMed Central 2017-10-12 /pmc/articles/PMC5639779/ /pubmed/29025390 http://dx.doi.org/10.1186/s12859-017-1860-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Kim, Jaehee
Kim, Haseong
Partitioning of functional gene expression data using principal points
title Partitioning of functional gene expression data using principal points
title_full Partitioning of functional gene expression data using principal points
title_fullStr Partitioning of functional gene expression data using principal points
title_full_unstemmed Partitioning of functional gene expression data using principal points
title_short Partitioning of functional gene expression data using principal points
title_sort partitioning of functional gene expression data using principal points
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5639779/
https://www.ncbi.nlm.nih.gov/pubmed/29025390
http://dx.doi.org/10.1186/s12859-017-1860-0
work_keys_str_mv AT kimjaehee partitioningoffunctionalgeneexpressiondatausingprincipalpoints
AT kimhaseong partitioningoffunctionalgeneexpressiondatausingprincipalpoints