Cargando…

Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments

BACKGROUND: Time-course microarray experiments produce vector gene expression profiles across a series of time points. Clustering genes based on these profiles is important in discovering functional related and co-regulated genes. Early developed clustering algorithms do not take advantage of the or...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Tianqing, Lin, Nan, Shi, Ningzhong, Zhang, Baoxue
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2696449/
https://www.ncbi.nlm.nih.gov/pubmed/19445669
http://dx.doi.org/10.1186/1471-2105-10-146
_version_ 1782168267561893888
author Liu, Tianqing
Lin, Nan
Shi, Ningzhong
Zhang, Baoxue
author_facet Liu, Tianqing
Lin, Nan
Shi, Ningzhong
Zhang, Baoxue
author_sort Liu, Tianqing
collection PubMed
description BACKGROUND: Time-course microarray experiments produce vector gene expression profiles across a series of time points. Clustering genes based on these profiles is important in discovering functional related and co-regulated genes. Early developed clustering algorithms do not take advantage of the ordering in a time-course study, explicit use of which should allow more sensitive detection of genes that display a consistent pattern over time. Peddada et al. [1] proposed a clustering algorithm that can incorporate the temporal ordering using order-restricted statistical inference. This algorithm is, however, very time-consuming and hence inapplicable to most microarray experiments that contain a large number of genes. Its computational burden also imposes difficulty to assess the clustering reliability, which is a very important measure when clustering noisy microarray data. RESULTS: We propose a computationally efficient information criterion-based clustering algorithm, called ORICC, that also takes account of the ordering in time-course microarray experiments by embedding the order-restricted inference into a model selection framework. Genes are assigned to the profile which they best match determined by a newly proposed information criterion for order-restricted inference. In addition, we also developed a bootstrap procedure to assess ORICC's clustering reliability for every gene. Simulation studies show that the ORICC method is robust, always gives better clustering accuracy than Peddada's method and saves hundreds of times computational time. Under some scenarios, its accuracy is also better than some other existing clustering methods for short time-course microarray data, such as STEM [2] and Wang et al. [3]. It is also computationally much faster than Wang et al. [3]. CONCLUSION: Our ORICC algorithm, which takes advantage of the temporal ordering in time-course microarray experiments, provides good clustering accuracy and is meanwhile much faster than Peddada's method. Moreover, the clustering reliability for each gene can also be assessed, which is unavailable in Peddada's method. In a real data example, the ORICC algorithm identifies new and interesting genes that previous analyses failed to reveal.
format Text
id pubmed-2696449
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26964492009-06-16 Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments Liu, Tianqing Lin, Nan Shi, Ningzhong Zhang, Baoxue BMC Bioinformatics Methodology Article BACKGROUND: Time-course microarray experiments produce vector gene expression profiles across a series of time points. Clustering genes based on these profiles is important in discovering functional related and co-regulated genes. Early developed clustering algorithms do not take advantage of the ordering in a time-course study, explicit use of which should allow more sensitive detection of genes that display a consistent pattern over time. Peddada et al. [1] proposed a clustering algorithm that can incorporate the temporal ordering using order-restricted statistical inference. This algorithm is, however, very time-consuming and hence inapplicable to most microarray experiments that contain a large number of genes. Its computational burden also imposes difficulty to assess the clustering reliability, which is a very important measure when clustering noisy microarray data. RESULTS: We propose a computationally efficient information criterion-based clustering algorithm, called ORICC, that also takes account of the ordering in time-course microarray experiments by embedding the order-restricted inference into a model selection framework. Genes are assigned to the profile which they best match determined by a newly proposed information criterion for order-restricted inference. In addition, we also developed a bootstrap procedure to assess ORICC's clustering reliability for every gene. Simulation studies show that the ORICC method is robust, always gives better clustering accuracy than Peddada's method and saves hundreds of times computational time. Under some scenarios, its accuracy is also better than some other existing clustering methods for short time-course microarray data, such as STEM [2] and Wang et al. [3]. It is also computationally much faster than Wang et al. [3]. CONCLUSION: Our ORICC algorithm, which takes advantage of the temporal ordering in time-course microarray experiments, provides good clustering accuracy and is meanwhile much faster than Peddada's method. Moreover, the clustering reliability for each gene can also be assessed, which is unavailable in Peddada's method. In a real data example, the ORICC algorithm identifies new and interesting genes that previous analyses failed to reveal. BioMed Central 2009-05-15 /pmc/articles/PMC2696449/ /pubmed/19445669 http://dx.doi.org/10.1186/1471-2105-10-146 Text en Copyright © 2009 Liu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Liu, Tianqing
Lin, Nan
Shi, Ningzhong
Zhang, Baoxue
Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments
title Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments
title_full Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments
title_fullStr Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments
title_full_unstemmed Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments
title_short Information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments
title_sort information criterion-based clustering with order-restricted candidate profiles in short time-course microarray experiments
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2696449/
https://www.ncbi.nlm.nih.gov/pubmed/19445669
http://dx.doi.org/10.1186/1471-2105-10-146
work_keys_str_mv AT liutianqing informationcriterionbasedclusteringwithorderrestrictedcandidateprofilesinshorttimecoursemicroarrayexperiments
AT linnan informationcriterionbasedclusteringwithorderrestrictedcandidateprofilesinshorttimecoursemicroarrayexperiments
AT shiningzhong informationcriterionbasedclusteringwithorderrestrictedcandidateprofilesinshorttimecoursemicroarrayexperiments
AT zhangbaoxue informationcriterionbasedclusteringwithorderrestrictedcandidateprofilesinshorttimecoursemicroarrayexperiments