Cargando…
EnRICH: Extraction and Ranking using Integration and Criteria Heuristics
BACKGROUND: High throughput screening technologies enable biologists to generate candidate genes at a rate that, due to time and cost constraints, cannot be studied by experimental approaches in the laboratory. Thus, it has become increasingly important to prioritize candidate genes for experiments....
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3564850/ https://www.ncbi.nlm.nih.gov/pubmed/23320748 http://dx.doi.org/10.1186/1752-0509-7-4 |
_version_ | 1782258370452914176 |
---|---|
author | Zhang, Xia Greenlee, M Heather West Serb, Jeanne M |
author_facet | Zhang, Xia Greenlee, M Heather West Serb, Jeanne M |
author_sort | Zhang, Xia |
collection | PubMed |
description | BACKGROUND: High throughput screening technologies enable biologists to generate candidate genes at a rate that, due to time and cost constraints, cannot be studied by experimental approaches in the laboratory. Thus, it has become increasingly important to prioritize candidate genes for experiments. To accomplish this, researchers need to apply selection requirements based on their knowledge, which necessitates qualitative integration of heterogeneous data sources and filtration using multiple criteria. A similar approach can also be applied to putative candidate gene relationships. While automation can assist in this routine and imperative procedure, flexibility of data sources and criteria must not be sacrificed. A tool that can optimize the trade-off between automation and flexibility to simultaneously filter and qualitatively integrate data is needed to prioritize candidate genes and generate composite networks from heterogeneous data sources. RESULTS: We developed the java application, EnRICH (Extraction and Ranking using Integration and Criteria Heuristics), in order to alleviate this need. Here we present a case study in which we used EnRICH to integrate and filter multiple candidate gene lists in order to identify potential retinal disease genes. As a result of this procedure, a candidate pool of several hundred genes was narrowed down to five candidate genes, of which four are confirmed retinal disease genes and one is associated with a retinal disease state. CONCLUSIONS: We developed a platform-independent tool that is able to qualitatively integrate multiple heterogeneous datasets and use different selection criteria to filter each of them, provided the datasets are tables that have distinct identifiers (required) and attributes (optional). With the flexibility to specify data sources and filtering criteria, EnRICH automatically prioritizes candidate genes or gene relationships for biologists based on their specific requirements. Here, we also demonstrate that this tool can be effectively and easily used to apply highly specific user-defined criteria and can efficiently identify high quality candidate genes from relatively sparse datasets. |
format | Online Article Text |
id | pubmed-3564850 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35648502013-02-08 EnRICH: Extraction and Ranking using Integration and Criteria Heuristics Zhang, Xia Greenlee, M Heather West Serb, Jeanne M BMC Syst Biol Software BACKGROUND: High throughput screening technologies enable biologists to generate candidate genes at a rate that, due to time and cost constraints, cannot be studied by experimental approaches in the laboratory. Thus, it has become increasingly important to prioritize candidate genes for experiments. To accomplish this, researchers need to apply selection requirements based on their knowledge, which necessitates qualitative integration of heterogeneous data sources and filtration using multiple criteria. A similar approach can also be applied to putative candidate gene relationships. While automation can assist in this routine and imperative procedure, flexibility of data sources and criteria must not be sacrificed. A tool that can optimize the trade-off between automation and flexibility to simultaneously filter and qualitatively integrate data is needed to prioritize candidate genes and generate composite networks from heterogeneous data sources. RESULTS: We developed the java application, EnRICH (Extraction and Ranking using Integration and Criteria Heuristics), in order to alleviate this need. Here we present a case study in which we used EnRICH to integrate and filter multiple candidate gene lists in order to identify potential retinal disease genes. As a result of this procedure, a candidate pool of several hundred genes was narrowed down to five candidate genes, of which four are confirmed retinal disease genes and one is associated with a retinal disease state. CONCLUSIONS: We developed a platform-independent tool that is able to qualitatively integrate multiple heterogeneous datasets and use different selection criteria to filter each of them, provided the datasets are tables that have distinct identifiers (required) and attributes (optional). With the flexibility to specify data sources and filtering criteria, EnRICH automatically prioritizes candidate genes or gene relationships for biologists based on their specific requirements. Here, we also demonstrate that this tool can be effectively and easily used to apply highly specific user-defined criteria and can efficiently identify high quality candidate genes from relatively sparse datasets. BioMed Central 2013-01-15 /pmc/articles/PMC3564850/ /pubmed/23320748 http://dx.doi.org/10.1186/1752-0509-7-4 Text en Copyright ©2013 Zhang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Zhang, Xia Greenlee, M Heather West Serb, Jeanne M EnRICH: Extraction and Ranking using Integration and Criteria Heuristics |
title | EnRICH: Extraction and Ranking using Integration and Criteria Heuristics |
title_full | EnRICH: Extraction and Ranking using Integration and Criteria Heuristics |
title_fullStr | EnRICH: Extraction and Ranking using Integration and Criteria Heuristics |
title_full_unstemmed | EnRICH: Extraction and Ranking using Integration and Criteria Heuristics |
title_short | EnRICH: Extraction and Ranking using Integration and Criteria Heuristics |
title_sort | enrich: extraction and ranking using integration and criteria heuristics |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3564850/ https://www.ncbi.nlm.nih.gov/pubmed/23320748 http://dx.doi.org/10.1186/1752-0509-7-4 |
work_keys_str_mv | AT zhangxia enrichextractionandrankingusingintegrationandcriteriaheuristics AT greenleemheatherwest enrichextractionandrankingusingintegrationandcriteriaheuristics AT serbjeannem enrichextractionandrankingusingintegrationandcriteriaheuristics |