Cargando…
Machine learning to assist risk-of-bias assessments in systematic reviews
Background: Risk-of-bias assessments are now a standard component of systematic reviews. At present, reviewers need to manually identify relevant parts of research articles for a set of methodological elements that affect the risk of bias, in order to make a risk-of-bias judgement for each of these...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4795562/ https://www.ncbi.nlm.nih.gov/pubmed/26659355 http://dx.doi.org/10.1093/ije/dyv306 |
_version_ | 1782421622119989248 |
---|---|
author | Millard, Louise AC Flach, Peter A Higgins, Julian PT |
author_facet | Millard, Louise AC Flach, Peter A Higgins, Julian PT |
author_sort | Millard, Louise AC |
collection | PubMed |
description | Background: Risk-of-bias assessments are now a standard component of systematic reviews. At present, reviewers need to manually identify relevant parts of research articles for a set of methodological elements that affect the risk of bias, in order to make a risk-of-bias judgement for each of these elements. We investigate the use of text mining methods to automate risk-of-bias assessments in systematic reviews. We aim to identify relevant sentences within the text of included articles, to rank articles by risk of bias and to reduce the number of risk-of-bias assessments that the reviewers need to perform by hand. Methods: We use supervised machine learning to train two types of models, for each of the three risk-of-bias properties of sequence generation, allocation concealment and blinding. The first model predicts whether a sentence in a research article contains relevant information. The second model predicts a risk-of-bias value for each research article. We use logistic regression, where each independent variable is the frequency of a word in a sentence or article, respectively. Results: We found that sentences can be successfully ranked by relevance with area under the receiver operating characteristic (ROC) curve (AUC) > 0.98. Articles can be ranked by risk of bias with AUC > 0.72. We estimate that more than 33% of articles can be assessed by just one reviewer, where two reviewers are normally required. Conclusions: We show that text mining can be used to assist risk-of-bias assessments. |
format | Online Article Text |
id | pubmed-4795562 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-47955622016-03-21 Machine learning to assist risk-of-bias assessments in systematic reviews Millard, Louise AC Flach, Peter A Higgins, Julian PT Int J Epidemiol Knowledge Generation Background: Risk-of-bias assessments are now a standard component of systematic reviews. At present, reviewers need to manually identify relevant parts of research articles for a set of methodological elements that affect the risk of bias, in order to make a risk-of-bias judgement for each of these elements. We investigate the use of text mining methods to automate risk-of-bias assessments in systematic reviews. We aim to identify relevant sentences within the text of included articles, to rank articles by risk of bias and to reduce the number of risk-of-bias assessments that the reviewers need to perform by hand. Methods: We use supervised machine learning to train two types of models, for each of the three risk-of-bias properties of sequence generation, allocation concealment and blinding. The first model predicts whether a sentence in a research article contains relevant information. The second model predicts a risk-of-bias value for each research article. We use logistic regression, where each independent variable is the frequency of a word in a sentence or article, respectively. Results: We found that sentences can be successfully ranked by relevance with area under the receiver operating characteristic (ROC) curve (AUC) > 0.98. Articles can be ranked by risk of bias with AUC > 0.72. We estimate that more than 33% of articles can be assessed by just one reviewer, where two reviewers are normally required. Conclusions: We show that text mining can be used to assist risk-of-bias assessments. Oxford University Press 2016-02 2015-12-08 /pmc/articles/PMC4795562/ /pubmed/26659355 http://dx.doi.org/10.1093/ije/dyv306 Text en © The Author 2015. Published by Oxford University Press on behalf of the International Epidemiological Association http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/ ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Knowledge Generation Millard, Louise AC Flach, Peter A Higgins, Julian PT Machine learning to assist risk-of-bias assessments in systematic reviews |
title | Machine learning to assist risk-of-bias assessments in systematic reviews |
title_full | Machine learning to assist risk-of-bias assessments in systematic reviews |
title_fullStr | Machine learning to assist risk-of-bias assessments in systematic reviews |
title_full_unstemmed | Machine learning to assist risk-of-bias assessments in systematic reviews |
title_short | Machine learning to assist risk-of-bias assessments in systematic reviews |
title_sort | machine learning to assist risk-of-bias assessments in systematic reviews |
topic | Knowledge Generation |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4795562/ https://www.ncbi.nlm.nih.gov/pubmed/26659355 http://dx.doi.org/10.1093/ije/dyv306 |
work_keys_str_mv | AT millardlouiseac machinelearningtoassistriskofbiasassessmentsinsystematicreviews AT flachpetera machinelearningtoassistriskofbiasassessmentsinsystematicreviews AT higginsjulianpt machinelearningtoassistriskofbiasassessmentsinsystematicreviews |