Cargando…
A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification
Background. The availability of large complex data sets generated by high throughput technologies has enabled the recent proliferation of disease biomarker studies. However, a recurring problem in deriving biological information from large data sets is how to best incorporate expert knowledge into t...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3809975/ https://www.ncbi.nlm.nih.gov/pubmed/24223463 http://dx.doi.org/10.1155/2013/613529 |
_version_ | 1782288736456802304 |
---|---|
author | Wang, Jing Webb-Robertson, Bobbie-Jo M. Matzke, Melissa M. Varnum, Susan M. Brown, Joseph N. Riensche, Roderick M. Adkins, Joshua N. Jacobs, Jon M. Hoidal, John R. Scholand, Mary Beth Pounds, Joel G. Blackburn, Michael R. Rodland, Karin D. McDermott, Jason E. |
author_facet | Wang, Jing Webb-Robertson, Bobbie-Jo M. Matzke, Melissa M. Varnum, Susan M. Brown, Joseph N. Riensche, Roderick M. Adkins, Joshua N. Jacobs, Jon M. Hoidal, John R. Scholand, Mary Beth Pounds, Joel G. Blackburn, Michael R. Rodland, Karin D. McDermott, Jason E. |
author_sort | Wang, Jing |
collection | PubMed |
description | Background. The availability of large complex data sets generated by high throughput technologies has enabled the recent proliferation of disease biomarker studies. However, a recurring problem in deriving biological information from large data sets is how to best incorporate expert knowledge into the biomarker selection process. Objective. To develop a generalizable framework that can incorporate expert knowledge into data-driven processes in a semiautomated way while providing a metric for optimization in a biomarker selection scheme. Methods. The framework was implemented as a pipeline consisting of five components for the identification of signatures from integrated clustering (ISIC). Expert knowledge was integrated into the biomarker identification process using the combination of two distinct approaches; a distance-based clustering approach and an expert knowledge-driven functional selection. Results. The utility of the developed framework ISIC was demonstrated on proteomics data from a study of chronic obstructive pulmonary disease (COPD). Biomarker candidates were identified in a mouse model using ISIC and validated in a study of a human cohort. Conclusions. Expert knowledge can be introduced into a biomarker discovery process in different ways to enhance the robustness of selected marker candidates. Developing strategies for extracting orthogonal and robust features from large data sets increases the chances of success in biomarker identification. |
format | Online Article Text |
id | pubmed-3809975 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-38099752013-11-10 A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification Wang, Jing Webb-Robertson, Bobbie-Jo M. Matzke, Melissa M. Varnum, Susan M. Brown, Joseph N. Riensche, Roderick M. Adkins, Joshua N. Jacobs, Jon M. Hoidal, John R. Scholand, Mary Beth Pounds, Joel G. Blackburn, Michael R. Rodland, Karin D. McDermott, Jason E. Dis Markers Research Article Background. The availability of large complex data sets generated by high throughput technologies has enabled the recent proliferation of disease biomarker studies. However, a recurring problem in deriving biological information from large data sets is how to best incorporate expert knowledge into the biomarker selection process. Objective. To develop a generalizable framework that can incorporate expert knowledge into data-driven processes in a semiautomated way while providing a metric for optimization in a biomarker selection scheme. Methods. The framework was implemented as a pipeline consisting of five components for the identification of signatures from integrated clustering (ISIC). Expert knowledge was integrated into the biomarker identification process using the combination of two distinct approaches; a distance-based clustering approach and an expert knowledge-driven functional selection. Results. The utility of the developed framework ISIC was demonstrated on proteomics data from a study of chronic obstructive pulmonary disease (COPD). Biomarker candidates were identified in a mouse model using ISIC and validated in a study of a human cohort. Conclusions. Expert knowledge can be introduced into a biomarker discovery process in different ways to enhance the robustness of selected marker candidates. Developing strategies for extracting orthogonal and robust features from large data sets increases the chances of success in biomarker identification. Hindawi Publishing Corporation 2013 2013-10-10 /pmc/articles/PMC3809975/ /pubmed/24223463 http://dx.doi.org/10.1155/2013/613529 Text en Copyright © 2013 Jing Wang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Wang, Jing Webb-Robertson, Bobbie-Jo M. Matzke, Melissa M. Varnum, Susan M. Brown, Joseph N. Riensche, Roderick M. Adkins, Joshua N. Jacobs, Jon M. Hoidal, John R. Scholand, Mary Beth Pounds, Joel G. Blackburn, Michael R. Rodland, Karin D. McDermott, Jason E. A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification |
title | A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification |
title_full | A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification |
title_fullStr | A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification |
title_full_unstemmed | A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification |
title_short | A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification |
title_sort | semiautomated framework for integrating expert knowledge into disease marker identification |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3809975/ https://www.ncbi.nlm.nih.gov/pubmed/24223463 http://dx.doi.org/10.1155/2013/613529 |
work_keys_str_mv | AT wangjing asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT webbrobertsonbobbiejom asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT matzkemelissam asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT varnumsusanm asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT brownjosephn asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT rienscheroderickm asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT adkinsjoshuan asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT jacobsjonm asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT hoidaljohnr asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT scholandmarybeth asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT poundsjoelg asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT blackburnmichaelr asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT rodlandkarind asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT mcdermottjasone asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT wangjing semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT webbrobertsonbobbiejom semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT matzkemelissam semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT varnumsusanm semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT brownjosephn semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT rienscheroderickm semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT adkinsjoshuan semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT jacobsjonm semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT hoidaljohnr semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT scholandmarybeth semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT poundsjoelg semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT blackburnmichaelr semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT rodlandkarind semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification AT mcdermottjasone semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification |