Cargando…
Identification of risk factors in epidemiologic study based on ROC curve and network
This article proposes a new non-parametric approach for identification of risk factors and their correlations in epidemiologic study, in which investigation data may have high variations because of individual differences or correlated risk factors. First, based on classification information of high...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5402390/ https://www.ncbi.nlm.nih.gov/pubmed/28436477 http://dx.doi.org/10.1038/srep46655 |
_version_ | 1783231224223891456 |
---|---|
author | Jin, Jiao Zhou, Shixin Xu, Qiujin An, Jinbing |
author_facet | Jin, Jiao Zhou, Shixin Xu, Qiujin An, Jinbing |
author_sort | Jin, Jiao |
collection | PubMed |
description | This article proposes a new non-parametric approach for identification of risk factors and their correlations in epidemiologic study, in which investigation data may have high variations because of individual differences or correlated risk factors. First, based on classification information of high or low disease incidence, we estimate Receptor Operating Characteristic (ROC) curve of each risk factor. Then, through the difference between ROC curve of each factor and diagonal, we evaluate and screen for the important risk factors. In addition, based on the difference of ROC curves corresponding to any pair of factors, we define a new type of correlation matrix to measure their correlations with disease, and then use this matrix as adjacency matrix to construct a network as a visualization tool for exploring the structure among factors, which can be used to direct further studies. Finally, these methods are applied to analysis on water pollutants and gastrointestinal tumor, and analysis on gene expression data in tumor and normal colon tissue samples. |
format | Online Article Text |
id | pubmed-5402390 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-54023902017-04-26 Identification of risk factors in epidemiologic study based on ROC curve and network Jin, Jiao Zhou, Shixin Xu, Qiujin An, Jinbing Sci Rep Article This article proposes a new non-parametric approach for identification of risk factors and their correlations in epidemiologic study, in which investigation data may have high variations because of individual differences or correlated risk factors. First, based on classification information of high or low disease incidence, we estimate Receptor Operating Characteristic (ROC) curve of each risk factor. Then, through the difference between ROC curve of each factor and diagonal, we evaluate and screen for the important risk factors. In addition, based on the difference of ROC curves corresponding to any pair of factors, we define a new type of correlation matrix to measure their correlations with disease, and then use this matrix as adjacency matrix to construct a network as a visualization tool for exploring the structure among factors, which can be used to direct further studies. Finally, these methods are applied to analysis on water pollutants and gastrointestinal tumor, and analysis on gene expression data in tumor and normal colon tissue samples. Nature Publishing Group 2017-04-24 /pmc/articles/PMC5402390/ /pubmed/28436477 http://dx.doi.org/10.1038/srep46655 Text en Copyright © 2017, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Jin, Jiao Zhou, Shixin Xu, Qiujin An, Jinbing Identification of risk factors in epidemiologic study based on ROC curve and network |
title | Identification of risk factors in epidemiologic study based on ROC curve and network |
title_full | Identification of risk factors in epidemiologic study based on ROC curve and network |
title_fullStr | Identification of risk factors in epidemiologic study based on ROC curve and network |
title_full_unstemmed | Identification of risk factors in epidemiologic study based on ROC curve and network |
title_short | Identification of risk factors in epidemiologic study based on ROC curve and network |
title_sort | identification of risk factors in epidemiologic study based on roc curve and network |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5402390/ https://www.ncbi.nlm.nih.gov/pubmed/28436477 http://dx.doi.org/10.1038/srep46655 |
work_keys_str_mv | AT jinjiao identificationofriskfactorsinepidemiologicstudybasedonroccurveandnetwork AT zhoushixin identificationofriskfactorsinepidemiologicstudybasedonroccurveandnetwork AT xuqiujin identificationofriskfactorsinepidemiologicstudybasedonroccurveandnetwork AT anjinbing identificationofriskfactorsinepidemiologicstudybasedonroccurveandnetwork |