Cargando…

Theme discovery from gene lists for identification and viewing of multiple functional groups

BACKGROUND: High throughput methods of the genome era produce vast amounts of data in the form of gene lists. These lists are large and difficult to interpret without advanced computational or bioinformatic tools. Most existing methods analyse a gene list as a single entity although it is comprised...

Descripción completa

Detalles Bibliográficos
Autores principales: Pehkonen, Petri, Wong, Garry, Törönen, Petri
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1190153/
https://www.ncbi.nlm.nih.gov/pubmed/15987504
http://dx.doi.org/10.1186/1471-2105-6-162
_version_ 1782124792603738112
author Pehkonen, Petri
Wong, Garry
Törönen, Petri
author_facet Pehkonen, Petri
Wong, Garry
Törönen, Petri
author_sort Pehkonen, Petri
collection PubMed
description BACKGROUND: High throughput methods of the genome era produce vast amounts of data in the form of gene lists. These lists are large and difficult to interpret without advanced computational or bioinformatic tools. Most existing methods analyse a gene list as a single entity although it is comprised of multiple gene groups associated with separate biological functions. Therefore it is imperative to define and visualize gene groups with unique functionality within gene lists. RESULTS: In order to analyse the functional heterogeneity within a gene list, we have developed a method that clusters genes to groups with homogenous functionalities. The method uses Non-negative Matrix Factorization (NMF) to create several clustering results with varying numbers of clusters. The obtained clustering results are combined into a simple graphical presentation showing the functional groups over-represented in the analyzed gene list. We demonstrate its performance on two data sets and show results that improve upon existing methods. The comparison also shows that our method creates a more simplified view that aids in discovery of biological themes within the list and discards less informative classes from the results. CONCLUSION: The presented method and associated software are useful for the identification and interpretation of biological functions associated with gene lists and are especially useful for the analysis of large lists.
format Text
id pubmed-1190153
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11901532005-08-25 Theme discovery from gene lists for identification and viewing of multiple functional groups Pehkonen, Petri Wong, Garry Törönen, Petri BMC Bioinformatics Methodology Article BACKGROUND: High throughput methods of the genome era produce vast amounts of data in the form of gene lists. These lists are large and difficult to interpret without advanced computational or bioinformatic tools. Most existing methods analyse a gene list as a single entity although it is comprised of multiple gene groups associated with separate biological functions. Therefore it is imperative to define and visualize gene groups with unique functionality within gene lists. RESULTS: In order to analyse the functional heterogeneity within a gene list, we have developed a method that clusters genes to groups with homogenous functionalities. The method uses Non-negative Matrix Factorization (NMF) to create several clustering results with varying numbers of clusters. The obtained clustering results are combined into a simple graphical presentation showing the functional groups over-represented in the analyzed gene list. We demonstrate its performance on two data sets and show results that improve upon existing methods. The comparison also shows that our method creates a more simplified view that aids in discovery of biological themes within the list and discards less informative classes from the results. CONCLUSION: The presented method and associated software are useful for the identification and interpretation of biological functions associated with gene lists and are especially useful for the analysis of large lists. BioMed Central 2005-06-29 /pmc/articles/PMC1190153/ /pubmed/15987504 http://dx.doi.org/10.1186/1471-2105-6-162 Text en Copyright © 2005 Pehkonen et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Pehkonen, Petri
Wong, Garry
Törönen, Petri
Theme discovery from gene lists for identification and viewing of multiple functional groups
title Theme discovery from gene lists for identification and viewing of multiple functional groups
title_full Theme discovery from gene lists for identification and viewing of multiple functional groups
title_fullStr Theme discovery from gene lists for identification and viewing of multiple functional groups
title_full_unstemmed Theme discovery from gene lists for identification and viewing of multiple functional groups
title_short Theme discovery from gene lists for identification and viewing of multiple functional groups
title_sort theme discovery from gene lists for identification and viewing of multiple functional groups
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1190153/
https://www.ncbi.nlm.nih.gov/pubmed/15987504
http://dx.doi.org/10.1186/1471-2105-6-162
work_keys_str_mv AT pehkonenpetri themediscoveryfromgenelistsforidentificationandviewingofmultiplefunctionalgroups
AT wonggarry themediscoveryfromgenelistsforidentificationandviewingofmultiplefunctionalgroups
AT toronenpetri themediscoveryfromgenelistsforidentificationandviewingofmultiplefunctionalgroups