Cargando…

An approach for transgender population information extraction and summarization from clinical trial text

BACKGROUND: Gender information frequently exists in the eligibility criteria of clinical trial text as essential information for participant population recruitment. Particularly, current eligibility criteria text contains the incompleteness and ambiguity issues in expressing transgender population,...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Boyu, Jin, Hao, Yang, Zhiwen, Qu, Yingying, Weng, Heng, Hao, Tianyong
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2019
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454593/ https://www.ncbi.nlm.nih.gov/pubmed/30961595 http://dx.doi.org/10.1186/s12911-019-0768-1

_version_	1783409566713643008
author	Chen, Boyu Jin, Hao Yang, Zhiwen Qu, Yingying Weng, Heng Hao, Tianyong
author_facet	Chen, Boyu Jin, Hao Yang, Zhiwen Qu, Yingying Weng, Heng Hao, Tianyong
author_sort	Chen, Boyu
collection	PubMed
description	BACKGROUND: Gender information frequently exists in the eligibility criteria of clinical trial text as essential information for participant population recruitment. Particularly, current eligibility criteria text contains the incompleteness and ambiguity issues in expressing transgender population, leading to difficulties or even failure of transgender population recruitment in clinical trial studies. METHODS: A new gender model is proposed for providing comprehensive transgender requirement specification. In addition, an automated approach is developed to extract and summarize gender requirements from unstructured text in accordance with the gender model. This approach consists of: 1) the feature extraction module, and 2) the feature summarization module. The first module identifies and extracts gender features using heuristic rules and automatically-generated patterns. The second module summarizes gender requirements by relation inference. RESULTS: Based on 100,134 clinical trials from ClinicalTrials.gov, our approach was compared with 20 commonly applied machine learning methods. It achieved a macro-averaged precision of 0.885, a macro-averaged recall of 0.871 and a macro-averaged F(1)-measure of 0.878. The results illustrated that our approach outperformed all baseline methods in terms of both commonly used metrics and macro-averaged metrics. CONCLUSIONS: This study presented a new gender model aiming for specifying the transgender requirement more precisely. We also proposed an approach for gender information extraction and summarization from unstructured clinical text to enhance transgender-related clinical trial population recruitment. The experiment results demonstrated that the approach was effective in transgender criteria extraction and summarization.
format	Online Article Text
id	pubmed-6454593
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-64545932019-04-19 An approach for transgender population information extraction and summarization from clinical trial text Chen, Boyu Jin, Hao Yang, Zhiwen Qu, Yingying Weng, Heng Hao, Tianyong BMC Med Inform Decis Mak Research BACKGROUND: Gender information frequently exists in the eligibility criteria of clinical trial text as essential information for participant population recruitment. Particularly, current eligibility criteria text contains the incompleteness and ambiguity issues in expressing transgender population, leading to difficulties or even failure of transgender population recruitment in clinical trial studies. METHODS: A new gender model is proposed for providing comprehensive transgender requirement specification. In addition, an automated approach is developed to extract and summarize gender requirements from unstructured text in accordance with the gender model. This approach consists of: 1) the feature extraction module, and 2) the feature summarization module. The first module identifies and extracts gender features using heuristic rules and automatically-generated patterns. The second module summarizes gender requirements by relation inference. RESULTS: Based on 100,134 clinical trials from ClinicalTrials.gov, our approach was compared with 20 commonly applied machine learning methods. It achieved a macro-averaged precision of 0.885, a macro-averaged recall of 0.871 and a macro-averaged F(1)-measure of 0.878. The results illustrated that our approach outperformed all baseline methods in terms of both commonly used metrics and macro-averaged metrics. CONCLUSIONS: This study presented a new gender model aiming for specifying the transgender requirement more precisely. We also proposed an approach for gender information extraction and summarization from unstructured clinical text to enhance transgender-related clinical trial population recruitment. The experiment results demonstrated that the approach was effective in transgender criteria extraction and summarization. BioMed Central 2019-04-09 /pmc/articles/PMC6454593/ /pubmed/30961595 http://dx.doi.org/10.1186/s12911-019-0768-1 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Chen, Boyu Jin, Hao Yang, Zhiwen Qu, Yingying Weng, Heng Hao, Tianyong An approach for transgender population information extraction and summarization from clinical trial text
title	An approach for transgender population information extraction and summarization from clinical trial text
title_full	An approach for transgender population information extraction and summarization from clinical trial text
title_fullStr	An approach for transgender population information extraction and summarization from clinical trial text
title_full_unstemmed	An approach for transgender population information extraction and summarization from clinical trial text
title_short	An approach for transgender population information extraction and summarization from clinical trial text
title_sort	approach for transgender population information extraction and summarization from clinical trial text
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6454593/ https://www.ncbi.nlm.nih.gov/pubmed/30961595 http://dx.doi.org/10.1186/s12911-019-0768-1
work_keys_str_mv	AT chenboyu anapproachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT jinhao anapproachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT yangzhiwen anapproachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT quyingying anapproachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT wengheng anapproachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT haotianyong anapproachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT chenboyu approachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT jinhao approachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT yangzhiwen approachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT quyingying approachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT wengheng approachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext AT haotianyong approachfortransgenderpopulationinformationextractionandsummarizationfromclinicaltrialtext

An approach for transgender population information extraction and summarization from clinical trial text

Ejemplares similares