Cargando…

A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification

Background. The availability of large complex data sets generated by high throughput technologies has enabled the recent proliferation of disease biomarker studies. However, a recurring problem in deriving biological information from large data sets is how to best incorporate expert knowledge into t...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jing, Webb-Robertson, Bobbie-Jo M., Matzke, Melissa M., Varnum, Susan M., Brown, Joseph N., Riensche, Roderick M., Adkins, Joshua N., Jacobs, Jon M., Hoidal, John R., Scholand, Mary Beth, Pounds, Joel G., Blackburn, Michael R., Rodland, Karin D., McDermott, Jason E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3809975/
https://www.ncbi.nlm.nih.gov/pubmed/24223463
http://dx.doi.org/10.1155/2013/613529
_version_ 1782288736456802304
author Wang, Jing
Webb-Robertson, Bobbie-Jo M.
Matzke, Melissa M.
Varnum, Susan M.
Brown, Joseph N.
Riensche, Roderick M.
Adkins, Joshua N.
Jacobs, Jon M.
Hoidal, John R.
Scholand, Mary Beth
Pounds, Joel G.
Blackburn, Michael R.
Rodland, Karin D.
McDermott, Jason E.
author_facet Wang, Jing
Webb-Robertson, Bobbie-Jo M.
Matzke, Melissa M.
Varnum, Susan M.
Brown, Joseph N.
Riensche, Roderick M.
Adkins, Joshua N.
Jacobs, Jon M.
Hoidal, John R.
Scholand, Mary Beth
Pounds, Joel G.
Blackburn, Michael R.
Rodland, Karin D.
McDermott, Jason E.
author_sort Wang, Jing
collection PubMed
description Background. The availability of large complex data sets generated by high throughput technologies has enabled the recent proliferation of disease biomarker studies. However, a recurring problem in deriving biological information from large data sets is how to best incorporate expert knowledge into the biomarker selection process. Objective. To develop a generalizable framework that can incorporate expert knowledge into data-driven processes in a semiautomated way while providing a metric for optimization in a biomarker selection scheme. Methods. The framework was implemented as a pipeline consisting of five components for the identification of signatures from integrated clustering (ISIC). Expert knowledge was integrated into the biomarker identification process using the combination of two distinct approaches; a distance-based clustering approach and an expert knowledge-driven functional selection. Results. The utility of the developed framework ISIC was demonstrated on proteomics data from a study of chronic obstructive pulmonary disease (COPD). Biomarker candidates were identified in a mouse model using ISIC and validated in a study of a human cohort. Conclusions. Expert knowledge can be introduced into a biomarker discovery process in different ways to enhance the robustness of selected marker candidates. Developing strategies for extracting orthogonal and robust features from large data sets increases the chances of success in biomarker identification.
format Online
Article
Text
id pubmed-3809975
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-38099752013-11-10 A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification Wang, Jing Webb-Robertson, Bobbie-Jo M. Matzke, Melissa M. Varnum, Susan M. Brown, Joseph N. Riensche, Roderick M. Adkins, Joshua N. Jacobs, Jon M. Hoidal, John R. Scholand, Mary Beth Pounds, Joel G. Blackburn, Michael R. Rodland, Karin D. McDermott, Jason E. Dis Markers Research Article Background. The availability of large complex data sets generated by high throughput technologies has enabled the recent proliferation of disease biomarker studies. However, a recurring problem in deriving biological information from large data sets is how to best incorporate expert knowledge into the biomarker selection process. Objective. To develop a generalizable framework that can incorporate expert knowledge into data-driven processes in a semiautomated way while providing a metric for optimization in a biomarker selection scheme. Methods. The framework was implemented as a pipeline consisting of five components for the identification of signatures from integrated clustering (ISIC). Expert knowledge was integrated into the biomarker identification process using the combination of two distinct approaches; a distance-based clustering approach and an expert knowledge-driven functional selection. Results. The utility of the developed framework ISIC was demonstrated on proteomics data from a study of chronic obstructive pulmonary disease (COPD). Biomarker candidates were identified in a mouse model using ISIC and validated in a study of a human cohort. Conclusions. Expert knowledge can be introduced into a biomarker discovery process in different ways to enhance the robustness of selected marker candidates. Developing strategies for extracting orthogonal and robust features from large data sets increases the chances of success in biomarker identification. Hindawi Publishing Corporation 2013 2013-10-10 /pmc/articles/PMC3809975/ /pubmed/24223463 http://dx.doi.org/10.1155/2013/613529 Text en Copyright © 2013 Jing Wang et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wang, Jing
Webb-Robertson, Bobbie-Jo M.
Matzke, Melissa M.
Varnum, Susan M.
Brown, Joseph N.
Riensche, Roderick M.
Adkins, Joshua N.
Jacobs, Jon M.
Hoidal, John R.
Scholand, Mary Beth
Pounds, Joel G.
Blackburn, Michael R.
Rodland, Karin D.
McDermott, Jason E.
A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification
title A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification
title_full A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification
title_fullStr A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification
title_full_unstemmed A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification
title_short A Semiautomated Framework for Integrating Expert Knowledge into Disease Marker Identification
title_sort semiautomated framework for integrating expert knowledge into disease marker identification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3809975/
https://www.ncbi.nlm.nih.gov/pubmed/24223463
http://dx.doi.org/10.1155/2013/613529
work_keys_str_mv AT wangjing asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT webbrobertsonbobbiejom asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT matzkemelissam asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT varnumsusanm asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT brownjosephn asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT rienscheroderickm asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT adkinsjoshuan asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT jacobsjonm asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT hoidaljohnr asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT scholandmarybeth asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT poundsjoelg asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT blackburnmichaelr asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT rodlandkarind asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT mcdermottjasone asemiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT wangjing semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT webbrobertsonbobbiejom semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT matzkemelissam semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT varnumsusanm semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT brownjosephn semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT rienscheroderickm semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT adkinsjoshuan semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT jacobsjonm semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT hoidaljohnr semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT scholandmarybeth semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT poundsjoelg semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT blackburnmichaelr semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT rodlandkarind semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification
AT mcdermottjasone semiautomatedframeworkforintegratingexpertknowledgeintodiseasemarkeridentification