Cargando…

Incorporation of biological knowledge into distance for clustering genes

In this paper we propose a data based algorithm to marry existing biological knowledge (e.g., functional annotations of genes) with experimental data (gene expression profiles) in creating an overall dissimilarity that can be used with any clustering algorithm that uses a general dissimilarity matri...

Descripción completa

Detalles Bibliográficos
Autores principales: Boratyn, Grzegorz M, Datta, Susmita, Datta, Somnath
Formato: Texto
Lenguaje:English
Publicado: Biomedical Informatics Publishing Group 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1896054/
https://www.ncbi.nlm.nih.gov/pubmed/17597929
Descripción
Sumario:In this paper we propose a data based algorithm to marry existing biological knowledge (e.g., functional annotations of genes) with experimental data (gene expression profiles) in creating an overall dissimilarity that can be used with any clustering algorithm that uses a general dissimilarity matrix. We explore this idea with two publicly available gene expression data sets and functional annotations where the results are compared with the clustering results that uses only the experimental data. Although more elaborate evaluations might be called for, the present paper makes a strong case for utilizing existing biological information in the clustering process. AVAILABILITY: Supplement is available at www.somnathdatta.org/Supp/Bioinformation/appendix.pdf