Cargando…

A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data

Researchers are seeking cost-effective solutions for management and analysis of large-scale genotypic and phenotypic data. Open-source software is uniquely positioned to fill this need through user-focused, crowd-sourced development. Tripal, an open-source toolkit for developing biological data web...

Descripción completa

Detalles Bibliográficos
Autores principales: Sanderson, Lacey-Anne, Caron, Carolyn T, Tan, Reynold L, Bett, Kirstin E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8363843/
https://www.ncbi.nlm.nih.gov/pubmed/34389844
http://dx.doi.org/10.1093/database/baab051
_version_ 1783738419598327808
author Sanderson, Lacey-Anne
Caron, Carolyn T
Tan, Reynold L
Bett, Kirstin E
author_facet Sanderson, Lacey-Anne
Caron, Carolyn T
Tan, Reynold L
Bett, Kirstin E
author_sort Sanderson, Lacey-Anne
collection PubMed
description Researchers are seeking cost-effective solutions for management and analysis of large-scale genotypic and phenotypic data. Open-source software is uniquely positioned to fill this need through user-focused, crowd-sourced development. Tripal, an open-source toolkit for developing biological data web portals, uses the GMOD Chado database schema to achieve flexible, ontology-driven storage in PostgreSQL. Tripal also aids research-focused web portals in providing data according to findable, accessible, interoperable, reusable (FAIR) principles. We describe here a fully relational PostgreSQL solution to handle large-scale genotypic and phenotypic data that is implemented as a collection of freely available, open-source modules. These Tripal extension modules provide a holistic approach for importing, storage, display and analysis within a relational database schema. Furthermore, they embody the Tripal approach to FAIR data by providing multiple search tools and ensuring metadata is fully described and interoperable. Our solution focuses on data integrity, as well as optimizing performance to provide a fully functional system that is currently being used in the production of Tripal portals for crop species. We fully describe the implementation of our solution and discuss why a PostgreSQL-powered web portal provides an efficient environment for researcher-driven genotypic and phenotypic data analysis.
format Online
Article
Text
id pubmed-8363843
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-83638432021-08-17 A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data Sanderson, Lacey-Anne Caron, Carolyn T Tan, Reynold L Bett, Kirstin E Database (Oxford) Original Article Researchers are seeking cost-effective solutions for management and analysis of large-scale genotypic and phenotypic data. Open-source software is uniquely positioned to fill this need through user-focused, crowd-sourced development. Tripal, an open-source toolkit for developing biological data web portals, uses the GMOD Chado database schema to achieve flexible, ontology-driven storage in PostgreSQL. Tripal also aids research-focused web portals in providing data according to findable, accessible, interoperable, reusable (FAIR) principles. We describe here a fully relational PostgreSQL solution to handle large-scale genotypic and phenotypic data that is implemented as a collection of freely available, open-source modules. These Tripal extension modules provide a holistic approach for importing, storage, display and analysis within a relational database schema. Furthermore, they embody the Tripal approach to FAIR data by providing multiple search tools and ensuring metadata is fully described and interoperable. Our solution focuses on data integrity, as well as optimizing performance to provide a fully functional system that is currently being used in the production of Tripal portals for crop species. We fully describe the implementation of our solution and discuss why a PostgreSQL-powered web portal provides an efficient environment for researcher-driven genotypic and phenotypic data analysis. Oxford University Press 2021-08-14 /pmc/articles/PMC8363843/ /pubmed/34389844 http://dx.doi.org/10.1093/database/baab051 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Sanderson, Lacey-Anne
Caron, Carolyn T
Tan, Reynold L
Bett, Kirstin E
A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data
title A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data
title_full A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data
title_fullStr A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data
title_full_unstemmed A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data
title_short A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data
title_sort postgresql tripal solution for large-scale genotypic and phenotypic data
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8363843/
https://www.ncbi.nlm.nih.gov/pubmed/34389844
http://dx.doi.org/10.1093/database/baab051
work_keys_str_mv AT sandersonlaceyanne apostgresqltripalsolutionforlargescalegenotypicandphenotypicdata
AT caroncarolynt apostgresqltripalsolutionforlargescalegenotypicandphenotypicdata
AT tanreynoldl apostgresqltripalsolutionforlargescalegenotypicandphenotypicdata
AT bettkirstine apostgresqltripalsolutionforlargescalegenotypicandphenotypicdata
AT sandersonlaceyanne postgresqltripalsolutionforlargescalegenotypicandphenotypicdata
AT caroncarolynt postgresqltripalsolutionforlargescalegenotypicandphenotypicdata
AT tanreynoldl postgresqltripalsolutionforlargescalegenotypicandphenotypicdata
AT bettkirstine postgresqltripalsolutionforlargescalegenotypicandphenotypicdata