Cargando…

The Database for Aggregate Analysis of ClinicalTrials.gov (AACT) and Subsequent Regrouping by Clinical Specialty

BACKGROUND: The ClinicalTrials.gov registry provides information regarding characteristics of past, current, and planned clinical studies to patients, clinicians, and researchers; in addition, registry data are available for bulk download. However, issues related to data structure, nomenclature, and...

Descripción completa

Detalles Bibliográficos
Autores principales: Tasneem, Asba, Aberle, Laura, Ananth, Hari, Chakraborty, Swati, Chiswell, Karen, McCourt, Brian J., Pietrobon, Ricardo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3306288/
https://www.ncbi.nlm.nih.gov/pubmed/22438982
http://dx.doi.org/10.1371/journal.pone.0033677
_version_ 1782227202995126272
author Tasneem, Asba
Aberle, Laura
Ananth, Hari
Chakraborty, Swati
Chiswell, Karen
McCourt, Brian J.
Pietrobon, Ricardo
author_facet Tasneem, Asba
Aberle, Laura
Ananth, Hari
Chakraborty, Swati
Chiswell, Karen
McCourt, Brian J.
Pietrobon, Ricardo
author_sort Tasneem, Asba
collection PubMed
description BACKGROUND: The ClinicalTrials.gov registry provides information regarding characteristics of past, current, and planned clinical studies to patients, clinicians, and researchers; in addition, registry data are available for bulk download. However, issues related to data structure, nomenclature, and changes in data collection over time present challenges to the aggregate analysis and interpretation of these data in general and to the analysis of trials according to clinical specialty in particular. Improving usability of these data could enhance the utility of ClinicalTrials.gov as a research resource. METHODS/PRINCIPAL RESULTS: The purpose of our project was twofold. First, we sought to extend the usability of ClinicalTrials.gov for research purposes by developing a database for aggregate analysis of ClinicalTrials.gov (AACT) that contains data from the 96,346 clinical trials registered as of September 27, 2010. Second, we developed and validated a methodology for annotating studies by clinical specialty, using a custom taxonomy employing Medical Subject Heading (MeSH) terms applied by an NLM algorithm, as well as MeSH terms and other disease condition terms provided by study sponsors. Clinical specialists reviewed and annotated MeSH and non-MeSH disease condition terms, and an algorithm was created to classify studies into clinical specialties based on both MeSH and non-MeSH annotations. False positives and false negatives were evaluated by comparing algorithmic classification with manual classification for three specialties. CONCLUSIONS/SIGNIFICANCE: The resulting AACT database features study design attributes parsed into discrete fields, integrated metadata, and an integrated MeSH thesaurus, and is available for download as Oracle extracts (.dmp file and text format). This publicly-accessible dataset will facilitate analysis of studies and permit detailed characterization and analysis of the U.S. clinical trials enterprise as a whole. In addition, the methodology we present for creating specialty datasets may facilitate other efforts to analyze studies by specialty groups.
format Online
Article
Text
id pubmed-3306288
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33062882012-03-21 The Database for Aggregate Analysis of ClinicalTrials.gov (AACT) and Subsequent Regrouping by Clinical Specialty Tasneem, Asba Aberle, Laura Ananth, Hari Chakraborty, Swati Chiswell, Karen McCourt, Brian J. Pietrobon, Ricardo PLoS One Research Article BACKGROUND: The ClinicalTrials.gov registry provides information regarding characteristics of past, current, and planned clinical studies to patients, clinicians, and researchers; in addition, registry data are available for bulk download. However, issues related to data structure, nomenclature, and changes in data collection over time present challenges to the aggregate analysis and interpretation of these data in general and to the analysis of trials according to clinical specialty in particular. Improving usability of these data could enhance the utility of ClinicalTrials.gov as a research resource. METHODS/PRINCIPAL RESULTS: The purpose of our project was twofold. First, we sought to extend the usability of ClinicalTrials.gov for research purposes by developing a database for aggregate analysis of ClinicalTrials.gov (AACT) that contains data from the 96,346 clinical trials registered as of September 27, 2010. Second, we developed and validated a methodology for annotating studies by clinical specialty, using a custom taxonomy employing Medical Subject Heading (MeSH) terms applied by an NLM algorithm, as well as MeSH terms and other disease condition terms provided by study sponsors. Clinical specialists reviewed and annotated MeSH and non-MeSH disease condition terms, and an algorithm was created to classify studies into clinical specialties based on both MeSH and non-MeSH annotations. False positives and false negatives were evaluated by comparing algorithmic classification with manual classification for three specialties. CONCLUSIONS/SIGNIFICANCE: The resulting AACT database features study design attributes parsed into discrete fields, integrated metadata, and an integrated MeSH thesaurus, and is available for download as Oracle extracts (.dmp file and text format). This publicly-accessible dataset will facilitate analysis of studies and permit detailed characterization and analysis of the U.S. clinical trials enterprise as a whole. In addition, the methodology we present for creating specialty datasets may facilitate other efforts to analyze studies by specialty groups. Public Library of Science 2012-03-16 /pmc/articles/PMC3306288/ /pubmed/22438982 http://dx.doi.org/10.1371/journal.pone.0033677 Text en Tasneem et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Tasneem, Asba
Aberle, Laura
Ananth, Hari
Chakraborty, Swati
Chiswell, Karen
McCourt, Brian J.
Pietrobon, Ricardo
The Database for Aggregate Analysis of ClinicalTrials.gov (AACT) and Subsequent Regrouping by Clinical Specialty
title The Database for Aggregate Analysis of ClinicalTrials.gov (AACT) and Subsequent Regrouping by Clinical Specialty
title_full The Database for Aggregate Analysis of ClinicalTrials.gov (AACT) and Subsequent Regrouping by Clinical Specialty
title_fullStr The Database for Aggregate Analysis of ClinicalTrials.gov (AACT) and Subsequent Regrouping by Clinical Specialty
title_full_unstemmed The Database for Aggregate Analysis of ClinicalTrials.gov (AACT) and Subsequent Regrouping by Clinical Specialty
title_short The Database for Aggregate Analysis of ClinicalTrials.gov (AACT) and Subsequent Regrouping by Clinical Specialty
title_sort database for aggregate analysis of clinicaltrials.gov (aact) and subsequent regrouping by clinical specialty
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3306288/
https://www.ncbi.nlm.nih.gov/pubmed/22438982
http://dx.doi.org/10.1371/journal.pone.0033677
work_keys_str_mv AT tasneemasba thedatabaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT aberlelaura thedatabaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT ananthhari thedatabaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT chakrabortyswati thedatabaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT chiswellkaren thedatabaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT mccourtbrianj thedatabaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT pietrobonricardo thedatabaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT tasneemasba databaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT aberlelaura databaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT ananthhari databaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT chakrabortyswati databaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT chiswellkaren databaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT mccourtbrianj databaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty
AT pietrobonricardo databaseforaggregateanalysisofclinicaltrialsgovaactandsubsequentregroupingbyclinicalspecialty