Cargando…

Identification of risk factors in epidemiologic study based on ROC curve and network

This article proposes a new non-parametric approach for identification of risk factors and their correlations in epidemiologic study, in which investigation data may have high variations because of individual differences or correlated risk factors. First, based on classification information of high...

Descripción completa

Detalles Bibliográficos
Autores principales: Jin, Jiao, Zhou, Shixin, Xu, Qiujin, An, Jinbing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5402390/
https://www.ncbi.nlm.nih.gov/pubmed/28436477
http://dx.doi.org/10.1038/srep46655
_version_ 1783231224223891456
author Jin, Jiao
Zhou, Shixin
Xu, Qiujin
An, Jinbing
author_facet Jin, Jiao
Zhou, Shixin
Xu, Qiujin
An, Jinbing
author_sort Jin, Jiao
collection PubMed
description This article proposes a new non-parametric approach for identification of risk factors and their correlations in epidemiologic study, in which investigation data may have high variations because of individual differences or correlated risk factors. First, based on classification information of high or low disease incidence, we estimate Receptor Operating Characteristic (ROC) curve of each risk factor. Then, through the difference between ROC curve of each factor and diagonal, we evaluate and screen for the important risk factors. In addition, based on the difference of ROC curves corresponding to any pair of factors, we define a new type of correlation matrix to measure their correlations with disease, and then use this matrix as adjacency matrix to construct a network as a visualization tool for exploring the structure among factors, which can be used to direct further studies. Finally, these methods are applied to analysis on water pollutants and gastrointestinal tumor, and analysis on gene expression data in tumor and normal colon tissue samples.
format Online
Article
Text
id pubmed-5402390
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-54023902017-04-26 Identification of risk factors in epidemiologic study based on ROC curve and network Jin, Jiao Zhou, Shixin Xu, Qiujin An, Jinbing Sci Rep Article This article proposes a new non-parametric approach for identification of risk factors and their correlations in epidemiologic study, in which investigation data may have high variations because of individual differences or correlated risk factors. First, based on classification information of high or low disease incidence, we estimate Receptor Operating Characteristic (ROC) curve of each risk factor. Then, through the difference between ROC curve of each factor and diagonal, we evaluate and screen for the important risk factors. In addition, based on the difference of ROC curves corresponding to any pair of factors, we define a new type of correlation matrix to measure their correlations with disease, and then use this matrix as adjacency matrix to construct a network as a visualization tool for exploring the structure among factors, which can be used to direct further studies. Finally, these methods are applied to analysis on water pollutants and gastrointestinal tumor, and analysis on gene expression data in tumor and normal colon tissue samples. Nature Publishing Group 2017-04-24 /pmc/articles/PMC5402390/ /pubmed/28436477 http://dx.doi.org/10.1038/srep46655 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Jin, Jiao
Zhou, Shixin
Xu, Qiujin
An, Jinbing
Identification of risk factors in epidemiologic study based on ROC curve and network
title Identification of risk factors in epidemiologic study based on ROC curve and network
title_full Identification of risk factors in epidemiologic study based on ROC curve and network
title_fullStr Identification of risk factors in epidemiologic study based on ROC curve and network
title_full_unstemmed Identification of risk factors in epidemiologic study based on ROC curve and network
title_short Identification of risk factors in epidemiologic study based on ROC curve and network
title_sort identification of risk factors in epidemiologic study based on roc curve and network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5402390/
https://www.ncbi.nlm.nih.gov/pubmed/28436477
http://dx.doi.org/10.1038/srep46655
work_keys_str_mv AT jinjiao identificationofriskfactorsinepidemiologicstudybasedonroccurveandnetwork
AT zhoushixin identificationofriskfactorsinepidemiologicstudybasedonroccurveandnetwork
AT xuqiujin identificationofriskfactorsinepidemiologicstudybasedonroccurveandnetwork
AT anjinbing identificationofriskfactorsinepidemiologicstudybasedonroccurveandnetwork