Cargando…
Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
BACKGROUND: Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy s...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3116487/ https://www.ncbi.nlm.nih.gov/pubmed/21554713 http://dx.doi.org/10.1186/1471-2105-12-140 |
_version_ | 1782206252163530752 |
---|---|
author | Bauer, Chris Kleinjung, Frank Smith, Celia J Towers, Mark W Tiss, Ali Chadt, Alexandra Dreja, Tanja Beule, Dieter Al-Hasani, Hadi Reinert, Knut Schuchhardt, Johannes Cramer, Rainer |
author_facet | Bauer, Chris Kleinjung, Frank Smith, Celia J Towers, Mark W Tiss, Ali Chadt, Alexandra Dreja, Tanja Beule, Dieter Al-Hasani, Hadi Reinert, Knut Schuchhardt, Johannes Cramer, Rainer |
author_sort | Bauer, Chris |
collection | PubMed |
description | BACKGROUND: Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics. RESULTS: We present a comprehensive work-flow tailored for analyzing complex data including data from multi-factorial studies. The developed approach aims at revealing effects caused by a distinct combination of experimental factors, in our case genotype and diet. Applying the developed work-flow to the analysis of an established polygenic mouse model for diet-induced type 2 diabetes, we found peptides with significant fold changes exclusively for the combination of a particular strain and diet. Exploitation of redundancy enables the visualization of peptide correlation and provides a natural way of feature selection for classification and prediction. Classification based on the features selected using our approach performs similar to classifications based on more complex feature selection methods. CONCLUSIONS: The combination of ANOVA and redundancy exploitation allows for identification of biomarker candidates in multi-dimensional MALDI-TOF MS profiling studies with complex experimental design. With respect to feature selection our method provides a fast and intuitive alternative to global optimization strategies with comparable performance. The method is implemented in R and the scripts are available by contacting the corresponding author. |
format | Online Article Text |
id | pubmed-3116487 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31164872011-06-17 Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset Bauer, Chris Kleinjung, Frank Smith, Celia J Towers, Mark W Tiss, Ali Chadt, Alexandra Dreja, Tanja Beule, Dieter Al-Hasani, Hadi Reinert, Knut Schuchhardt, Johannes Cramer, Rainer BMC Bioinformatics Research Article BACKGROUND: Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics. RESULTS: We present a comprehensive work-flow tailored for analyzing complex data including data from multi-factorial studies. The developed approach aims at revealing effects caused by a distinct combination of experimental factors, in our case genotype and diet. Applying the developed work-flow to the analysis of an established polygenic mouse model for diet-induced type 2 diabetes, we found peptides with significant fold changes exclusively for the combination of a particular strain and diet. Exploitation of redundancy enables the visualization of peptide correlation and provides a natural way of feature selection for classification and prediction. Classification based on the features selected using our approach performs similar to classifications based on more complex feature selection methods. CONCLUSIONS: The combination of ANOVA and redundancy exploitation allows for identification of biomarker candidates in multi-dimensional MALDI-TOF MS profiling studies with complex experimental design. With respect to feature selection our method provides a fast and intuitive alternative to global optimization strategies with comparable performance. The method is implemented in R and the scripts are available by contacting the corresponding author. BioMed Central 2011-05-09 /pmc/articles/PMC3116487/ /pubmed/21554713 http://dx.doi.org/10.1186/1471-2105-12-140 Text en Copyright ©2011 Bauer et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Bauer, Chris Kleinjung, Frank Smith, Celia J Towers, Mark W Tiss, Ali Chadt, Alexandra Dreja, Tanja Beule, Dieter Al-Hasani, Hadi Reinert, Knut Schuchhardt, Johannes Cramer, Rainer Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset |
title | Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset |
title_full | Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset |
title_fullStr | Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset |
title_full_unstemmed | Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset |
title_short | Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset |
title_sort | biomarker discovery and redundancy reduction towards classification using a multi-factorial maldi-tof ms t2dm mouse model dataset |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3116487/ https://www.ncbi.nlm.nih.gov/pubmed/21554713 http://dx.doi.org/10.1186/1471-2105-12-140 |
work_keys_str_mv | AT bauerchris biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT kleinjungfrank biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT smithceliaj biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT towersmarkw biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT tissali biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT chadtalexandra biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT drejatanja biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT beuledieter biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT alhasanihadi biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT reinertknut biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT schuchhardtjohannes biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset AT cramerrainer biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset |