Cargando…
A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data
Researchers are seeking cost-effective solutions for management and analysis of large-scale genotypic and phenotypic data. Open-source software is uniquely positioned to fill this need through user-focused, crowd-sourced development. Tripal, an open-source toolkit for developing biological data web...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8363843/ https://www.ncbi.nlm.nih.gov/pubmed/34389844 http://dx.doi.org/10.1093/database/baab051 |
_version_ | 1783738419598327808 |
---|---|
author | Sanderson, Lacey-Anne Caron, Carolyn T Tan, Reynold L Bett, Kirstin E |
author_facet | Sanderson, Lacey-Anne Caron, Carolyn T Tan, Reynold L Bett, Kirstin E |
author_sort | Sanderson, Lacey-Anne |
collection | PubMed |
description | Researchers are seeking cost-effective solutions for management and analysis of large-scale genotypic and phenotypic data. Open-source software is uniquely positioned to fill this need through user-focused, crowd-sourced development. Tripal, an open-source toolkit for developing biological data web portals, uses the GMOD Chado database schema to achieve flexible, ontology-driven storage in PostgreSQL. Tripal also aids research-focused web portals in providing data according to findable, accessible, interoperable, reusable (FAIR) principles. We describe here a fully relational PostgreSQL solution to handle large-scale genotypic and phenotypic data that is implemented as a collection of freely available, open-source modules. These Tripal extension modules provide a holistic approach for importing, storage, display and analysis within a relational database schema. Furthermore, they embody the Tripal approach to FAIR data by providing multiple search tools and ensuring metadata is fully described and interoperable. Our solution focuses on data integrity, as well as optimizing performance to provide a fully functional system that is currently being used in the production of Tripal portals for crop species. We fully describe the implementation of our solution and discuss why a PostgreSQL-powered web portal provides an efficient environment for researcher-driven genotypic and phenotypic data analysis. |
format | Online Article Text |
id | pubmed-8363843 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-83638432021-08-17 A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data Sanderson, Lacey-Anne Caron, Carolyn T Tan, Reynold L Bett, Kirstin E Database (Oxford) Original Article Researchers are seeking cost-effective solutions for management and analysis of large-scale genotypic and phenotypic data. Open-source software is uniquely positioned to fill this need through user-focused, crowd-sourced development. Tripal, an open-source toolkit for developing biological data web portals, uses the GMOD Chado database schema to achieve flexible, ontology-driven storage in PostgreSQL. Tripal also aids research-focused web portals in providing data according to findable, accessible, interoperable, reusable (FAIR) principles. We describe here a fully relational PostgreSQL solution to handle large-scale genotypic and phenotypic data that is implemented as a collection of freely available, open-source modules. These Tripal extension modules provide a holistic approach for importing, storage, display and analysis within a relational database schema. Furthermore, they embody the Tripal approach to FAIR data by providing multiple search tools and ensuring metadata is fully described and interoperable. Our solution focuses on data integrity, as well as optimizing performance to provide a fully functional system that is currently being used in the production of Tripal portals for crop species. We fully describe the implementation of our solution and discuss why a PostgreSQL-powered web portal provides an efficient environment for researcher-driven genotypic and phenotypic data analysis. Oxford University Press 2021-08-14 /pmc/articles/PMC8363843/ /pubmed/34389844 http://dx.doi.org/10.1093/database/baab051 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Sanderson, Lacey-Anne Caron, Carolyn T Tan, Reynold L Bett, Kirstin E A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data |
title | A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data |
title_full | A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data |
title_fullStr | A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data |
title_full_unstemmed | A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data |
title_short | A PostgreSQL Tripal solution for large-scale genotypic and phenotypic data |
title_sort | postgresql tripal solution for large-scale genotypic and phenotypic data |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8363843/ https://www.ncbi.nlm.nih.gov/pubmed/34389844 http://dx.doi.org/10.1093/database/baab051 |
work_keys_str_mv | AT sandersonlaceyanne apostgresqltripalsolutionforlargescalegenotypicandphenotypicdata AT caroncarolynt apostgresqltripalsolutionforlargescalegenotypicandphenotypicdata AT tanreynoldl apostgresqltripalsolutionforlargescalegenotypicandphenotypicdata AT bettkirstine apostgresqltripalsolutionforlargescalegenotypicandphenotypicdata AT sandersonlaceyanne postgresqltripalsolutionforlargescalegenotypicandphenotypicdata AT caroncarolynt postgresqltripalsolutionforlargescalegenotypicandphenotypicdata AT tanreynoldl postgresqltripalsolutionforlargescalegenotypicandphenotypicdata AT bettkirstine postgresqltripalsolutionforlargescalegenotypicandphenotypicdata |