Cargando…

Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset

BACKGROUND: Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy s...

Descripción completa

Detalles Bibliográficos
Autores principales: Bauer, Chris, Kleinjung, Frank, Smith, Celia J, Towers, Mark W, Tiss, Ali, Chadt, Alexandra, Dreja, Tanja, Beule, Dieter, Al-Hasani, Hadi, Reinert, Knut, Schuchhardt, Johannes, Cramer, Rainer
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3116487/
https://www.ncbi.nlm.nih.gov/pubmed/21554713
http://dx.doi.org/10.1186/1471-2105-12-140
_version_ 1782206252163530752
author Bauer, Chris
Kleinjung, Frank
Smith, Celia J
Towers, Mark W
Tiss, Ali
Chadt, Alexandra
Dreja, Tanja
Beule, Dieter
Al-Hasani, Hadi
Reinert, Knut
Schuchhardt, Johannes
Cramer, Rainer
author_facet Bauer, Chris
Kleinjung, Frank
Smith, Celia J
Towers, Mark W
Tiss, Ali
Chadt, Alexandra
Dreja, Tanja
Beule, Dieter
Al-Hasani, Hadi
Reinert, Knut
Schuchhardt, Johannes
Cramer, Rainer
author_sort Bauer, Chris
collection PubMed
description BACKGROUND: Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics. RESULTS: We present a comprehensive work-flow tailored for analyzing complex data including data from multi-factorial studies. The developed approach aims at revealing effects caused by a distinct combination of experimental factors, in our case genotype and diet. Applying the developed work-flow to the analysis of an established polygenic mouse model for diet-induced type 2 diabetes, we found peptides with significant fold changes exclusively for the combination of a particular strain and diet. Exploitation of redundancy enables the visualization of peptide correlation and provides a natural way of feature selection for classification and prediction. Classification based on the features selected using our approach performs similar to classifications based on more complex feature selection methods. CONCLUSIONS: The combination of ANOVA and redundancy exploitation allows for identification of biomarker candidates in multi-dimensional MALDI-TOF MS profiling studies with complex experimental design. With respect to feature selection our method provides a fast and intuitive alternative to global optimization strategies with comparable performance. The method is implemented in R and the scripts are available by contacting the corresponding author.
format Online
Article
Text
id pubmed-3116487
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31164872011-06-17 Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset Bauer, Chris Kleinjung, Frank Smith, Celia J Towers, Mark W Tiss, Ali Chadt, Alexandra Dreja, Tanja Beule, Dieter Al-Hasani, Hadi Reinert, Knut Schuchhardt, Johannes Cramer, Rainer BMC Bioinformatics Research Article BACKGROUND: Diabetes like many diseases and biological processes is not mono-causal. On the one hand multi-factorial studies with complex experimental design are required for its comprehensive analysis. On the other hand, the data from these studies often include a substantial amount of redundancy such as proteins that are typically represented by a multitude of peptides. Coping simultaneously with both complexities (experimental and technological) makes data analysis a challenge for Bioinformatics. RESULTS: We present a comprehensive work-flow tailored for analyzing complex data including data from multi-factorial studies. The developed approach aims at revealing effects caused by a distinct combination of experimental factors, in our case genotype and diet. Applying the developed work-flow to the analysis of an established polygenic mouse model for diet-induced type 2 diabetes, we found peptides with significant fold changes exclusively for the combination of a particular strain and diet. Exploitation of redundancy enables the visualization of peptide correlation and provides a natural way of feature selection for classification and prediction. Classification based on the features selected using our approach performs similar to classifications based on more complex feature selection methods. CONCLUSIONS: The combination of ANOVA and redundancy exploitation allows for identification of biomarker candidates in multi-dimensional MALDI-TOF MS profiling studies with complex experimental design. With respect to feature selection our method provides a fast and intuitive alternative to global optimization strategies with comparable performance. The method is implemented in R and the scripts are available by contacting the corresponding author. BioMed Central 2011-05-09 /pmc/articles/PMC3116487/ /pubmed/21554713 http://dx.doi.org/10.1186/1471-2105-12-140 Text en Copyright ©2011 Bauer et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Bauer, Chris
Kleinjung, Frank
Smith, Celia J
Towers, Mark W
Tiss, Ali
Chadt, Alexandra
Dreja, Tanja
Beule, Dieter
Al-Hasani, Hadi
Reinert, Knut
Schuchhardt, Johannes
Cramer, Rainer
Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_full Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_fullStr Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_full_unstemmed Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_short Biomarker Discovery and Redundancy Reduction towards Classification using a Multi-factorial MALDI-TOF MS T2DM Mouse Model Dataset
title_sort biomarker discovery and redundancy reduction towards classification using a multi-factorial maldi-tof ms t2dm mouse model dataset
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3116487/
https://www.ncbi.nlm.nih.gov/pubmed/21554713
http://dx.doi.org/10.1186/1471-2105-12-140
work_keys_str_mv AT bauerchris biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT kleinjungfrank biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT smithceliaj biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT towersmarkw biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT tissali biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT chadtalexandra biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT drejatanja biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT beuledieter biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT alhasanihadi biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT reinertknut biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT schuchhardtjohannes biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset
AT cramerrainer biomarkerdiscoveryandredundancyreductiontowardsclassificationusingamultifactorialmalditofmst2dmmousemodeldataset