Cargando…
Artificial Intelligence Research: The Utility and Design of a Relational Database System
Although many researchers talk about a “patient database,” they typically are not referring to a database at all, but instead to a spreadsheet of curated facts about a cohort of patients. This article describes relational database systems and how they differ from spreadsheets. At their core, spreads...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7718502/ https://www.ncbi.nlm.nih.gov/pubmed/33305089 http://dx.doi.org/10.1016/j.adro.2020.06.027 |
_version_ | 1783619506435784704 |
---|---|
author | Dilling, Thomas J. |
author_facet | Dilling, Thomas J. |
author_sort | Dilling, Thomas J. |
collection | PubMed |
description | Although many researchers talk about a “patient database,” they typically are not referring to a database at all, but instead to a spreadsheet of curated facts about a cohort of patients. This article describes relational database systems and how they differ from spreadsheets. At their core, spreadsheets are only capable of describing one-to-one (1:1) relationships. However, this article demonstrates that clinical medical data encapsulate numerous one-to-many relationships. Consequently, spreadsheets are very inefficient relative to relational database systems, which gracefully manage such data. Databases provide other advantages, in that the data fields are “typed” (that is, they contain specific kinds of data). This prevents users from entering spurious data during data import. Because each record contains a “key,” it becomes impossible to add duplicate information (ie, add the same patient twice). Databases store data in very efficient ways, minimizing space and memory requirements on the host system. Likewise, databases can be queried or manipulated using a highly complex language called SQL. Consequently, it becomes trivial to cull large amounts of data from a vast number of data fields on very precise subsets of patients. Databases can be quite large (terabytes or more in size), yet still are highly efficient to query. Consequently, with the explosion of data available in electronic health records and other data sources, databases become increasingly important to contain or order these data. Ultimately, this will enable the clinical researcher to perform artificial intelligence analyses across vast amounts of clinical data in a way heretofore impossible. This article provides initial guidance in terms of creating a relational database system. |
format | Online Article Text |
id | pubmed-7718502 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-77185022020-12-09 Artificial Intelligence Research: The Utility and Design of a Relational Database System Dilling, Thomas J. Adv Radiat Oncol Scientific Article Although many researchers talk about a “patient database,” they typically are not referring to a database at all, but instead to a spreadsheet of curated facts about a cohort of patients. This article describes relational database systems and how they differ from spreadsheets. At their core, spreadsheets are only capable of describing one-to-one (1:1) relationships. However, this article demonstrates that clinical medical data encapsulate numerous one-to-many relationships. Consequently, spreadsheets are very inefficient relative to relational database systems, which gracefully manage such data. Databases provide other advantages, in that the data fields are “typed” (that is, they contain specific kinds of data). This prevents users from entering spurious data during data import. Because each record contains a “key,” it becomes impossible to add duplicate information (ie, add the same patient twice). Databases store data in very efficient ways, minimizing space and memory requirements on the host system. Likewise, databases can be queried or manipulated using a highly complex language called SQL. Consequently, it becomes trivial to cull large amounts of data from a vast number of data fields on very precise subsets of patients. Databases can be quite large (terabytes or more in size), yet still are highly efficient to query. Consequently, with the explosion of data available in electronic health records and other data sources, databases become increasingly important to contain or order these data. Ultimately, this will enable the clinical researcher to perform artificial intelligence analyses across vast amounts of clinical data in a way heretofore impossible. This article provides initial guidance in terms of creating a relational database system. Elsevier 2020-07-13 /pmc/articles/PMC7718502/ /pubmed/33305089 http://dx.doi.org/10.1016/j.adro.2020.06.027 Text en © 2020 Published by Elsevier Inc. on behalf of American Society for Radiation Oncology. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Scientific Article Dilling, Thomas J. Artificial Intelligence Research: The Utility and Design of a Relational Database System |
title | Artificial Intelligence Research: The Utility and Design of a Relational Database System |
title_full | Artificial Intelligence Research: The Utility and Design of a Relational Database System |
title_fullStr | Artificial Intelligence Research: The Utility and Design of a Relational Database System |
title_full_unstemmed | Artificial Intelligence Research: The Utility and Design of a Relational Database System |
title_short | Artificial Intelligence Research: The Utility and Design of a Relational Database System |
title_sort | artificial intelligence research: the utility and design of a relational database system |
topic | Scientific Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7718502/ https://www.ncbi.nlm.nih.gov/pubmed/33305089 http://dx.doi.org/10.1016/j.adro.2020.06.027 |
work_keys_str_mv | AT dillingthomasj artificialintelligenceresearchtheutilityanddesignofarelationaldatabasesystem |