Cargando…
Exploring Computational Data Amplification and Imputation for the Discovery of Type 1 Diabetes (T1D) Biomarkers from Limited Human Datasets
Background: Type 1 diabetes (T1D) is a devastating disease with serious health complications. Early T1D biomarkers that could enable timely detection and prevention before the onset of clinical symptoms are paramount but currently unavailable. Despite their promise, omics approaches have so far fail...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9599756/ https://www.ncbi.nlm.nih.gov/pubmed/36291653 http://dx.doi.org/10.3390/biom12101444 |
_version_ | 1784816671589924864 |
---|---|
author | Alcazar, Oscar Ogihara, Mitsunori Ren, Gang Buchwald, Peter Abdulreda, Midhat H. |
author_facet | Alcazar, Oscar Ogihara, Mitsunori Ren, Gang Buchwald, Peter Abdulreda, Midhat H. |
author_sort | Alcazar, Oscar |
collection | PubMed |
description | Background: Type 1 diabetes (T1D) is a devastating disease with serious health complications. Early T1D biomarkers that could enable timely detection and prevention before the onset of clinical symptoms are paramount but currently unavailable. Despite their promise, omics approaches have so far failed to deliver such biomarkers, likely due to the fragmented nature of information obtained through the single omics approach. We recently demonstrated the utility of parallel multi-omics for the identification of T1D biomarker signatures. Our studies also identified challenges. Methods: Here, we evaluated a novel computational approach of data imputation and amplification as one way to overcome challenges associated with the relatively small number of subjects in these studies. Results: Using proprietary algorithms, we amplified our quadra-omics (proteomics, metabolomics, lipidomics, and transcriptomics) dataset from nine subjects a thousand-fold and analyzed the data using Ingenuity Pathway Analysis (IPA) software to assess the change in its analytical capabilities and biomarker prediction power in the amplified datasets compared to the original. These studies showed the ability to identify an increased number of T1D-relevant pathways and biomarkers in such computationally amplified datasets, especially, at imputation ratios close to the “golden ratio” of 38.2%:61.8%. Specifically, the Canonical Pathway and Diseases and Functions modules identified higher numbers of inflammatory pathways and functions relevant to autoimmune T1D, including novel ones not identified in the original data. The Biomarker Prediction module also predicted in the amplified data several unique biomarker candidates with direct links to T1D pathogenesis. Conclusions: These preliminary findings indicate that such large-scale data imputation and amplification approaches are useful in facilitating the discovery of candidate integrated biomarker signatures of T1D or other diseases by increasing the predictive range of existing data mining tools, especially when the size of the input data is inherently limited. |
format | Online Article Text |
id | pubmed-9599756 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95997562022-10-27 Exploring Computational Data Amplification and Imputation for the Discovery of Type 1 Diabetes (T1D) Biomarkers from Limited Human Datasets Alcazar, Oscar Ogihara, Mitsunori Ren, Gang Buchwald, Peter Abdulreda, Midhat H. Biomolecules Article Background: Type 1 diabetes (T1D) is a devastating disease with serious health complications. Early T1D biomarkers that could enable timely detection and prevention before the onset of clinical symptoms are paramount but currently unavailable. Despite their promise, omics approaches have so far failed to deliver such biomarkers, likely due to the fragmented nature of information obtained through the single omics approach. We recently demonstrated the utility of parallel multi-omics for the identification of T1D biomarker signatures. Our studies also identified challenges. Methods: Here, we evaluated a novel computational approach of data imputation and amplification as one way to overcome challenges associated with the relatively small number of subjects in these studies. Results: Using proprietary algorithms, we amplified our quadra-omics (proteomics, metabolomics, lipidomics, and transcriptomics) dataset from nine subjects a thousand-fold and analyzed the data using Ingenuity Pathway Analysis (IPA) software to assess the change in its analytical capabilities and biomarker prediction power in the amplified datasets compared to the original. These studies showed the ability to identify an increased number of T1D-relevant pathways and biomarkers in such computationally amplified datasets, especially, at imputation ratios close to the “golden ratio” of 38.2%:61.8%. Specifically, the Canonical Pathway and Diseases and Functions modules identified higher numbers of inflammatory pathways and functions relevant to autoimmune T1D, including novel ones not identified in the original data. The Biomarker Prediction module also predicted in the amplified data several unique biomarker candidates with direct links to T1D pathogenesis. Conclusions: These preliminary findings indicate that such large-scale data imputation and amplification approaches are useful in facilitating the discovery of candidate integrated biomarker signatures of T1D or other diseases by increasing the predictive range of existing data mining tools, especially when the size of the input data is inherently limited. MDPI 2022-10-09 /pmc/articles/PMC9599756/ /pubmed/36291653 http://dx.doi.org/10.3390/biom12101444 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Alcazar, Oscar Ogihara, Mitsunori Ren, Gang Buchwald, Peter Abdulreda, Midhat H. Exploring Computational Data Amplification and Imputation for the Discovery of Type 1 Diabetes (T1D) Biomarkers from Limited Human Datasets |
title | Exploring Computational Data Amplification and Imputation for the Discovery of Type 1 Diabetes (T1D) Biomarkers from Limited Human Datasets |
title_full | Exploring Computational Data Amplification and Imputation for the Discovery of Type 1 Diabetes (T1D) Biomarkers from Limited Human Datasets |
title_fullStr | Exploring Computational Data Amplification and Imputation for the Discovery of Type 1 Diabetes (T1D) Biomarkers from Limited Human Datasets |
title_full_unstemmed | Exploring Computational Data Amplification and Imputation for the Discovery of Type 1 Diabetes (T1D) Biomarkers from Limited Human Datasets |
title_short | Exploring Computational Data Amplification and Imputation for the Discovery of Type 1 Diabetes (T1D) Biomarkers from Limited Human Datasets |
title_sort | exploring computational data amplification and imputation for the discovery of type 1 diabetes (t1d) biomarkers from limited human datasets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9599756/ https://www.ncbi.nlm.nih.gov/pubmed/36291653 http://dx.doi.org/10.3390/biom12101444 |
work_keys_str_mv | AT alcazaroscar exploringcomputationaldataamplificationandimputationforthediscoveryoftype1diabetest1dbiomarkersfromlimitedhumandatasets AT ogiharamitsunori exploringcomputationaldataamplificationandimputationforthediscoveryoftype1diabetest1dbiomarkersfromlimitedhumandatasets AT rengang exploringcomputationaldataamplificationandimputationforthediscoveryoftype1diabetest1dbiomarkersfromlimitedhumandatasets AT buchwaldpeter exploringcomputationaldataamplificationandimputationforthediscoveryoftype1diabetest1dbiomarkersfromlimitedhumandatasets AT abdulredamidhath exploringcomputationaldataamplificationandimputationforthediscoveryoftype1diabetest1dbiomarkersfromlimitedhumandatasets |