Cargando…

Incorporation of biological knowledge into distance for clustering genes

In this paper we propose a data based algorithm to marry existing biological knowledge (e.g., functional annotations of genes) with experimental data (gene expression profiles) in creating an overall dissimilarity that can be used with any clustering algorithm that uses a general dissimilarity matri...

Descripción completa

Detalles Bibliográficos
Autores principales: Boratyn, Grzegorz M, Datta, Susmita, Datta, Somnath
Formato: Texto
Lenguaje:English
Publicado: Biomedical Informatics Publishing Group 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1896054/
https://www.ncbi.nlm.nih.gov/pubmed/17597929
_version_ 1782133913671434240
author Boratyn, Grzegorz M
Datta, Susmita
Datta, Somnath
author_facet Boratyn, Grzegorz M
Datta, Susmita
Datta, Somnath
author_sort Boratyn, Grzegorz M
collection PubMed
description In this paper we propose a data based algorithm to marry existing biological knowledge (e.g., functional annotations of genes) with experimental data (gene expression profiles) in creating an overall dissimilarity that can be used with any clustering algorithm that uses a general dissimilarity matrix. We explore this idea with two publicly available gene expression data sets and functional annotations where the results are compared with the clustering results that uses only the experimental data. Although more elaborate evaluations might be called for, the present paper makes a strong case for utilizing existing biological information in the clustering process. AVAILABILITY: Supplement is available at www.somnathdatta.org/Supp/Bioinformation/appendix.pdf
format Text
id pubmed-1896054
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Biomedical Informatics Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-18960542007-06-27 Incorporation of biological knowledge into distance for clustering genes Boratyn, Grzegorz M Datta, Susmita Datta, Somnath Bioinformation Prediction Model In this paper we propose a data based algorithm to marry existing biological knowledge (e.g., functional annotations of genes) with experimental data (gene expression profiles) in creating an overall dissimilarity that can be used with any clustering algorithm that uses a general dissimilarity matrix. We explore this idea with two publicly available gene expression data sets and functional annotations where the results are compared with the clustering results that uses only the experimental data. Although more elaborate evaluations might be called for, the present paper makes a strong case for utilizing existing biological information in the clustering process. AVAILABILITY: Supplement is available at www.somnathdatta.org/Supp/Bioinformation/appendix.pdf Biomedical Informatics Publishing Group 2007-04-10 /pmc/articles/PMC1896054/ /pubmed/17597929 Text en © 2006 Biomedical Informatics Publishing Group This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Prediction Model
Boratyn, Grzegorz M
Datta, Susmita
Datta, Somnath
Incorporation of biological knowledge into distance for clustering genes
title Incorporation of biological knowledge into distance for clustering genes
title_full Incorporation of biological knowledge into distance for clustering genes
title_fullStr Incorporation of biological knowledge into distance for clustering genes
title_full_unstemmed Incorporation of biological knowledge into distance for clustering genes
title_short Incorporation of biological knowledge into distance for clustering genes
title_sort incorporation of biological knowledge into distance for clustering genes
topic Prediction Model
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1896054/
https://www.ncbi.nlm.nih.gov/pubmed/17597929
work_keys_str_mv AT boratyngrzegorzm incorporationofbiologicalknowledgeintodistanceforclusteringgenes
AT dattasusmita incorporationofbiologicalknowledgeintodistanceforclusteringgenes
AT dattasomnath incorporationofbiologicalknowledgeintodistanceforclusteringgenes