Cargando…

Gene Tree Labeling Using Nonnegative Matrix Factorization on Biomedical Literature

Identifying functional groups of genes is a challenging problem for biological applications. Text mining approaches can be used to build hierarchical clusters or trees from the information in the biological literature. In particular, the nonnegative matrix factorization (NMF) is examined as one appr...

Descripción completa

Detalles Bibliográficos
Autores principales: Heinrich, Kevin E., Berry, Michael W., Homayouni, Ramin
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2292806/
https://www.ncbi.nlm.nih.gov/pubmed/18431447
http://dx.doi.org/10.1155/2008/276535
_version_ 1782152528557768704
author Heinrich, Kevin E.
Berry, Michael W.
Homayouni, Ramin
author_facet Heinrich, Kevin E.
Berry, Michael W.
Homayouni, Ramin
author_sort Heinrich, Kevin E.
collection PubMed
description Identifying functional groups of genes is a challenging problem for biological applications. Text mining approaches can be used to build hierarchical clusters or trees from the information in the biological literature. In particular, the nonnegative matrix factorization (NMF) is examined as one approach to label hierarchical trees. A generic labeling algorithm as well as an evaluation technique is proposed, and the effects of different NMF parameters with regard to convergence and labeling accuracy are discussed. The primary goals of this study are to provide a qualitative assessment of the NMF and its various parameters and initialization, to provide an automated way to classify biomedical data, and to provide a method for evaluating labeled data assuming a static input tree. As a byproduct, a method for generating gold standard trees is proposed.
format Text
id pubmed-2292806
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-22928062008-04-22 Gene Tree Labeling Using Nonnegative Matrix Factorization on Biomedical Literature Heinrich, Kevin E. Berry, Michael W. Homayouni, Ramin Comput Intell Neurosci Research Article Identifying functional groups of genes is a challenging problem for biological applications. Text mining approaches can be used to build hierarchical clusters or trees from the information in the biological literature. In particular, the nonnegative matrix factorization (NMF) is examined as one approach to label hierarchical trees. A generic labeling algorithm as well as an evaluation technique is proposed, and the effects of different NMF parameters with regard to convergence and labeling accuracy are discussed. The primary goals of this study are to provide a qualitative assessment of the NMF and its various parameters and initialization, to provide an automated way to classify biomedical data, and to provide a method for evaluating labeled data assuming a static input tree. As a byproduct, a method for generating gold standard trees is proposed. Hindawi Publishing Corporation 2008 2008-04-09 /pmc/articles/PMC2292806/ /pubmed/18431447 http://dx.doi.org/10.1155/2008/276535 Text en Copyright © 2008 Kevin E. Heinrich et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Heinrich, Kevin E.
Berry, Michael W.
Homayouni, Ramin
Gene Tree Labeling Using Nonnegative Matrix Factorization on Biomedical Literature
title Gene Tree Labeling Using Nonnegative Matrix Factorization on Biomedical Literature
title_full Gene Tree Labeling Using Nonnegative Matrix Factorization on Biomedical Literature
title_fullStr Gene Tree Labeling Using Nonnegative Matrix Factorization on Biomedical Literature
title_full_unstemmed Gene Tree Labeling Using Nonnegative Matrix Factorization on Biomedical Literature
title_short Gene Tree Labeling Using Nonnegative Matrix Factorization on Biomedical Literature
title_sort gene tree labeling using nonnegative matrix factorization on biomedical literature
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2292806/
https://www.ncbi.nlm.nih.gov/pubmed/18431447
http://dx.doi.org/10.1155/2008/276535
work_keys_str_mv AT heinrichkevine genetreelabelingusingnonnegativematrixfactorizationonbiomedicalliterature
AT berrymichaelw genetreelabelingusingnonnegativematrixfactorizationonbiomedicalliterature
AT homayouniramin genetreelabelingusingnonnegativematrixfactorizationonbiomedicalliterature