Cargando…

Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation

Prioritising candidate genes for further experimental characterisation is an essential, yet challenging task in biomedical research. One way of achieving this goal is to identify specific biological themes that are enriched within the gene set of interest to obtain insights into the biological pheno...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Yi-An, Tripathi, Lokesh P., Dessailly, Benoit H., Nyström-Persson, Johan, Ahmad, Shandar, Mizuguchi, Kenji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4053319/
https://www.ncbi.nlm.nih.gov/pubmed/24918583
http://dx.doi.org/10.1371/journal.pone.0099030
_version_ 1782320356655104000
author Chen, Yi-An
Tripathi, Lokesh P.
Dessailly, Benoit H.
Nyström-Persson, Johan
Ahmad, Shandar
Mizuguchi, Kenji
author_facet Chen, Yi-An
Tripathi, Lokesh P.
Dessailly, Benoit H.
Nyström-Persson, Johan
Ahmad, Shandar
Mizuguchi, Kenji
author_sort Chen, Yi-An
collection PubMed
description Prioritising candidate genes for further experimental characterisation is an essential, yet challenging task in biomedical research. One way of achieving this goal is to identify specific biological themes that are enriched within the gene set of interest to obtain insights into the biological phenomena under study. Biological pathway data have been particularly useful in identifying functional associations of genes and/or gene sets. However, biological pathway information as compiled in varied repositories often differs in scope and content, preventing a more effective and comprehensive characterisation of gene sets. Here we describe a new approach to constructing biologically coherent gene sets from pathway data in major public repositories and employing them for functional analysis of large gene sets. We first revealed significant overlaps in gene content between different pathways and then defined a clustering method based on the shared gene content and the similarity of gene overlap patterns. We established the biological relevance of the constructed pathway clusters using independent quantitative measures and we finally demonstrated the effectiveness of the constructed pathway clusters in comparative functional enrichment analysis of gene sets associated with diverse human diseases gathered from the literature. The pathway clusters and gene mappings have been integrated into the TargetMine data warehouse and are likely to provide a concise, manageable and biologically relevant means of functional analysis of gene sets and to facilitate candidate gene prioritisation.
format Online
Article
Text
id pubmed-4053319
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-40533192014-06-18 Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation Chen, Yi-An Tripathi, Lokesh P. Dessailly, Benoit H. Nyström-Persson, Johan Ahmad, Shandar Mizuguchi, Kenji PLoS One Research Article Prioritising candidate genes for further experimental characterisation is an essential, yet challenging task in biomedical research. One way of achieving this goal is to identify specific biological themes that are enriched within the gene set of interest to obtain insights into the biological phenomena under study. Biological pathway data have been particularly useful in identifying functional associations of genes and/or gene sets. However, biological pathway information as compiled in varied repositories often differs in scope and content, preventing a more effective and comprehensive characterisation of gene sets. Here we describe a new approach to constructing biologically coherent gene sets from pathway data in major public repositories and employing them for functional analysis of large gene sets. We first revealed significant overlaps in gene content between different pathways and then defined a clustering method based on the shared gene content and the similarity of gene overlap patterns. We established the biological relevance of the constructed pathway clusters using independent quantitative measures and we finally demonstrated the effectiveness of the constructed pathway clusters in comparative functional enrichment analysis of gene sets associated with diverse human diseases gathered from the literature. The pathway clusters and gene mappings have been integrated into the TargetMine data warehouse and are likely to provide a concise, manageable and biologically relevant means of functional analysis of gene sets and to facilitate candidate gene prioritisation. Public Library of Science 2014-06-11 /pmc/articles/PMC4053319/ /pubmed/24918583 http://dx.doi.org/10.1371/journal.pone.0099030 Text en © 2014 Chen et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Chen, Yi-An
Tripathi, Lokesh P.
Dessailly, Benoit H.
Nyström-Persson, Johan
Ahmad, Shandar
Mizuguchi, Kenji
Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation
title Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation
title_full Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation
title_fullStr Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation
title_full_unstemmed Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation
title_short Integrated Pathway Clusters with Coherent Biological Themes for Target Prioritisation
title_sort integrated pathway clusters with coherent biological themes for target prioritisation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4053319/
https://www.ncbi.nlm.nih.gov/pubmed/24918583
http://dx.doi.org/10.1371/journal.pone.0099030
work_keys_str_mv AT chenyian integratedpathwayclusterswithcoherentbiologicalthemesfortargetprioritisation
AT tripathilokeshp integratedpathwayclusterswithcoherentbiologicalthemesfortargetprioritisation
AT dessaillybenoith integratedpathwayclusterswithcoherentbiologicalthemesfortargetprioritisation
AT nystromperssonjohan integratedpathwayclusterswithcoherentbiologicalthemesfortargetprioritisation
AT ahmadshandar integratedpathwayclusterswithcoherentbiologicalthemesfortargetprioritisation
AT mizuguchikenji integratedpathwayclusterswithcoherentbiologicalthemesfortargetprioritisation