Cargando…

Artificial Intelligence Research: The Utility and Design of a Relational Database System

Although many researchers talk about a “patient database,” they typically are not referring to a database at all, but instead to a spreadsheet of curated facts about a cohort of patients. This article describes relational database systems and how they differ from spreadsheets. At their core, spreads...

Descripción completa

Detalles Bibliográficos
Autor principal: Dilling, Thomas J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7718502/
https://www.ncbi.nlm.nih.gov/pubmed/33305089
http://dx.doi.org/10.1016/j.adro.2020.06.027
_version_ 1783619506435784704
author Dilling, Thomas J.
author_facet Dilling, Thomas J.
author_sort Dilling, Thomas J.
collection PubMed
description Although many researchers talk about a “patient database,” they typically are not referring to a database at all, but instead to a spreadsheet of curated facts about a cohort of patients. This article describes relational database systems and how they differ from spreadsheets. At their core, spreadsheets are only capable of describing one-to-one (1:1) relationships. However, this article demonstrates that clinical medical data encapsulate numerous one-to-many relationships. Consequently, spreadsheets are very inefficient relative to relational database systems, which gracefully manage such data. Databases provide other advantages, in that the data fields are “typed” (that is, they contain specific kinds of data). This prevents users from entering spurious data during data import. Because each record contains a “key,” it becomes impossible to add duplicate information (ie, add the same patient twice). Databases store data in very efficient ways, minimizing space and memory requirements on the host system. Likewise, databases can be queried or manipulated using a highly complex language called SQL. Consequently, it becomes trivial to cull large amounts of data from a vast number of data fields on very precise subsets of patients. Databases can be quite large (terabytes or more in size), yet still are highly efficient to query. Consequently, with the explosion of data available in electronic health records and other data sources, databases become increasingly important to contain or order these data. Ultimately, this will enable the clinical researcher to perform artificial intelligence analyses across vast amounts of clinical data in a way heretofore impossible. This article provides initial guidance in terms of creating a relational database system.
format Online
Article
Text
id pubmed-7718502
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-77185022020-12-09 Artificial Intelligence Research: The Utility and Design of a Relational Database System Dilling, Thomas J. Adv Radiat Oncol Scientific Article Although many researchers talk about a “patient database,” they typically are not referring to a database at all, but instead to a spreadsheet of curated facts about a cohort of patients. This article describes relational database systems and how they differ from spreadsheets. At their core, spreadsheets are only capable of describing one-to-one (1:1) relationships. However, this article demonstrates that clinical medical data encapsulate numerous one-to-many relationships. Consequently, spreadsheets are very inefficient relative to relational database systems, which gracefully manage such data. Databases provide other advantages, in that the data fields are “typed” (that is, they contain specific kinds of data). This prevents users from entering spurious data during data import. Because each record contains a “key,” it becomes impossible to add duplicate information (ie, add the same patient twice). Databases store data in very efficient ways, minimizing space and memory requirements on the host system. Likewise, databases can be queried or manipulated using a highly complex language called SQL. Consequently, it becomes trivial to cull large amounts of data from a vast number of data fields on very precise subsets of patients. Databases can be quite large (terabytes or more in size), yet still are highly efficient to query. Consequently, with the explosion of data available in electronic health records and other data sources, databases become increasingly important to contain or order these data. Ultimately, this will enable the clinical researcher to perform artificial intelligence analyses across vast amounts of clinical data in a way heretofore impossible. This article provides initial guidance in terms of creating a relational database system. Elsevier 2020-07-13 /pmc/articles/PMC7718502/ /pubmed/33305089 http://dx.doi.org/10.1016/j.adro.2020.06.027 Text en © 2020 Published by Elsevier Inc. on behalf of American Society for Radiation Oncology. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Scientific Article
Dilling, Thomas J.
Artificial Intelligence Research: The Utility and Design of a Relational Database System
title Artificial Intelligence Research: The Utility and Design of a Relational Database System
title_full Artificial Intelligence Research: The Utility and Design of a Relational Database System
title_fullStr Artificial Intelligence Research: The Utility and Design of a Relational Database System
title_full_unstemmed Artificial Intelligence Research: The Utility and Design of a Relational Database System
title_short Artificial Intelligence Research: The Utility and Design of a Relational Database System
title_sort artificial intelligence research: the utility and design of a relational database system
topic Scientific Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7718502/
https://www.ncbi.nlm.nih.gov/pubmed/33305089
http://dx.doi.org/10.1016/j.adro.2020.06.027
work_keys_str_mv AT dillingthomasj artificialintelligenceresearchtheutilityanddesignofarelationaldatabasesystem