Cargando…

Biomarker discovery in inflammatory bowel diseases using network-based feature selection

Reliable identification of Inflammatory biomarkers from metagenomics data is a promising direction for developing non-invasive, cost-effective, and rapid clinical tests for early diagnosis of IBD. We present an integrative approach to Network-Based Biomarker Discovery (NBBD) which integrates network...

Descripción completa

Detalles Bibliográficos
Autores principales: Abbas, Mostafa, Matta, John, Le, Thanh, Bensmail, Halima, Obafemi-Ajayi, Tayo, Honavar, Vasant, EL-Manzalawy, Yasser
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6874333/
https://www.ncbi.nlm.nih.gov/pubmed/31756219
http://dx.doi.org/10.1371/journal.pone.0225382
_version_ 1783472819783335936
author Abbas, Mostafa
Matta, John
Le, Thanh
Bensmail, Halima
Obafemi-Ajayi, Tayo
Honavar, Vasant
EL-Manzalawy, Yasser
author_facet Abbas, Mostafa
Matta, John
Le, Thanh
Bensmail, Halima
Obafemi-Ajayi, Tayo
Honavar, Vasant
EL-Manzalawy, Yasser
author_sort Abbas, Mostafa
collection PubMed
description Reliable identification of Inflammatory biomarkers from metagenomics data is a promising direction for developing non-invasive, cost-effective, and rapid clinical tests for early diagnosis of IBD. We present an integrative approach to Network-Based Biomarker Discovery (NBBD) which integrates network analyses methods for prioritizing potential biomarkers and machine learning techniques for assessing the discriminative power of the prioritized biomarkers. Using a large dataset of new-onset pediatric IBD metagenomics biopsy samples, we compare the performance of Random Forest (RF) classifiers trained on features selected using a representative set of traditional feature selection methods against NBBD framework, configured using five different tools for inferring networks from metagenomics data, and nine different methods for prioritizing biomarkers as well as a hybrid approach combining best traditional and NBBD based feature selection. We also examine how the performance of the predictive models for IBD diagnosis varies as a function of the size of the data used for biomarker identification. Our results show that (i) NBBD is competitive with some of the state-of-the-art feature selection methods including Random Forest Feature Importance (RFFI) scores; and (ii) NBBD is especially effective in reliably identifying IBD biomarkers when the number of data samples available for biomarker discovery is small.
format Online
Article
Text
id pubmed-6874333
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-68743332019-12-06 Biomarker discovery in inflammatory bowel diseases using network-based feature selection Abbas, Mostafa Matta, John Le, Thanh Bensmail, Halima Obafemi-Ajayi, Tayo Honavar, Vasant EL-Manzalawy, Yasser PLoS One Research Article Reliable identification of Inflammatory biomarkers from metagenomics data is a promising direction for developing non-invasive, cost-effective, and rapid clinical tests for early diagnosis of IBD. We present an integrative approach to Network-Based Biomarker Discovery (NBBD) which integrates network analyses methods for prioritizing potential biomarkers and machine learning techniques for assessing the discriminative power of the prioritized biomarkers. Using a large dataset of new-onset pediatric IBD metagenomics biopsy samples, we compare the performance of Random Forest (RF) classifiers trained on features selected using a representative set of traditional feature selection methods against NBBD framework, configured using five different tools for inferring networks from metagenomics data, and nine different methods for prioritizing biomarkers as well as a hybrid approach combining best traditional and NBBD based feature selection. We also examine how the performance of the predictive models for IBD diagnosis varies as a function of the size of the data used for biomarker identification. Our results show that (i) NBBD is competitive with some of the state-of-the-art feature selection methods including Random Forest Feature Importance (RFFI) scores; and (ii) NBBD is especially effective in reliably identifying IBD biomarkers when the number of data samples available for biomarker discovery is small. Public Library of Science 2019-11-22 /pmc/articles/PMC6874333/ /pubmed/31756219 http://dx.doi.org/10.1371/journal.pone.0225382 Text en © 2019 Abbas et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Abbas, Mostafa
Matta, John
Le, Thanh
Bensmail, Halima
Obafemi-Ajayi, Tayo
Honavar, Vasant
EL-Manzalawy, Yasser
Biomarker discovery in inflammatory bowel diseases using network-based feature selection
title Biomarker discovery in inflammatory bowel diseases using network-based feature selection
title_full Biomarker discovery in inflammatory bowel diseases using network-based feature selection
title_fullStr Biomarker discovery in inflammatory bowel diseases using network-based feature selection
title_full_unstemmed Biomarker discovery in inflammatory bowel diseases using network-based feature selection
title_short Biomarker discovery in inflammatory bowel diseases using network-based feature selection
title_sort biomarker discovery in inflammatory bowel diseases using network-based feature selection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6874333/
https://www.ncbi.nlm.nih.gov/pubmed/31756219
http://dx.doi.org/10.1371/journal.pone.0225382
work_keys_str_mv AT abbasmostafa biomarkerdiscoveryininflammatoryboweldiseasesusingnetworkbasedfeatureselection
AT mattajohn biomarkerdiscoveryininflammatoryboweldiseasesusingnetworkbasedfeatureselection
AT lethanh biomarkerdiscoveryininflammatoryboweldiseasesusingnetworkbasedfeatureselection
AT bensmailhalima biomarkerdiscoveryininflammatoryboweldiseasesusingnetworkbasedfeatureselection
AT obafemiajayitayo biomarkerdiscoveryininflammatoryboweldiseasesusingnetworkbasedfeatureselection
AT honavarvasant biomarkerdiscoveryininflammatoryboweldiseasesusingnetworkbasedfeatureselection
AT elmanzalawyyasser biomarkerdiscoveryininflammatoryboweldiseasesusingnetworkbasedfeatureselection