Cargando…

Curating gene sets: challenges and opportunities for integrative analysis

Genomic data interpretation often requires analyses that move from a gene-by-gene focus to a focus on sets of genes that are associated with biological phenomena such as molecular processes, phenotypes, diseases, drug interactions or environmental conditions. Unique challenges exist in the curation...

Descripción completa

Detalles Bibliográficos
Autores principales: Bubier, Jason, Hill, David, Mukherjee, Gaurab, Reynolds, Timothy, Baker, Erich J, Berger, Alexander, Emerson, Jake, Blake, Judith A, Chesler, Elissa J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6424415/
https://www.ncbi.nlm.nih.gov/pubmed/30888410
http://dx.doi.org/10.1093/database/baz036
_version_ 1783404676337631232
author Bubier, Jason
Hill, David
Mukherjee, Gaurab
Reynolds, Timothy
Baker, Erich J
Berger, Alexander
Emerson, Jake
Blake, Judith A
Chesler, Elissa J
author_facet Bubier, Jason
Hill, David
Mukherjee, Gaurab
Reynolds, Timothy
Baker, Erich J
Berger, Alexander
Emerson, Jake
Blake, Judith A
Chesler, Elissa J
author_sort Bubier, Jason
collection PubMed
description Genomic data interpretation often requires analyses that move from a gene-by-gene focus to a focus on sets of genes that are associated with biological phenomena such as molecular processes, phenotypes, diseases, drug interactions or environmental conditions. Unique challenges exist in the curation of gene sets beyond the challenges in curation of individual genes. Here we highlight a literature curation workflow whereby gene sets are curated from peer-reviewed published data into GeneWeaver (GW), a data repository and analysis platform. We describe the system features that allow for a flexible yet precise curation procedure. We illustrate the value of curation by gene sets through analysis of independently curated sets that relate to the integrated stress response, showing that sets curated from independent sources all share significant Jaccard similarity. A suite of reproducible analysis tools is provided in GW as services to carry out interactive functional investigation of user-submitted gene sets within the context of over 150 000 gene sets constructed from publicly available resources and published gene lists. A curation interface supports the ability of users to design and maintain curation workflows of gene sets, including assigning, reviewing and releasing gene sets within a curation project context.
format Online
Article
Text
id pubmed-6424415
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-64244152019-03-22 Curating gene sets: challenges and opportunities for integrative analysis Bubier, Jason Hill, David Mukherjee, Gaurab Reynolds, Timothy Baker, Erich J Berger, Alexander Emerson, Jake Blake, Judith A Chesler, Elissa J Database (Oxford) Original Article Genomic data interpretation often requires analyses that move from a gene-by-gene focus to a focus on sets of genes that are associated with biological phenomena such as molecular processes, phenotypes, diseases, drug interactions or environmental conditions. Unique challenges exist in the curation of gene sets beyond the challenges in curation of individual genes. Here we highlight a literature curation workflow whereby gene sets are curated from peer-reviewed published data into GeneWeaver (GW), a data repository and analysis platform. We describe the system features that allow for a flexible yet precise curation procedure. We illustrate the value of curation by gene sets through analysis of independently curated sets that relate to the integrated stress response, showing that sets curated from independent sources all share significant Jaccard similarity. A suite of reproducible analysis tools is provided in GW as services to carry out interactive functional investigation of user-submitted gene sets within the context of over 150 000 gene sets constructed from publicly available resources and published gene lists. A curation interface supports the ability of users to design and maintain curation workflows of gene sets, including assigning, reviewing and releasing gene sets within a curation project context. Oxford University Press 2019-03-19 /pmc/articles/PMC6424415/ /pubmed/30888410 http://dx.doi.org/10.1093/database/baz036 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Bubier, Jason
Hill, David
Mukherjee, Gaurab
Reynolds, Timothy
Baker, Erich J
Berger, Alexander
Emerson, Jake
Blake, Judith A
Chesler, Elissa J
Curating gene sets: challenges and opportunities for integrative analysis
title Curating gene sets: challenges and opportunities for integrative analysis
title_full Curating gene sets: challenges and opportunities for integrative analysis
title_fullStr Curating gene sets: challenges and opportunities for integrative analysis
title_full_unstemmed Curating gene sets: challenges and opportunities for integrative analysis
title_short Curating gene sets: challenges and opportunities for integrative analysis
title_sort curating gene sets: challenges and opportunities for integrative analysis
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6424415/
https://www.ncbi.nlm.nih.gov/pubmed/30888410
http://dx.doi.org/10.1093/database/baz036
work_keys_str_mv AT bubierjason curatinggenesetschallengesandopportunitiesforintegrativeanalysis
AT hilldavid curatinggenesetschallengesandopportunitiesforintegrativeanalysis
AT mukherjeegaurab curatinggenesetschallengesandopportunitiesforintegrativeanalysis
AT reynoldstimothy curatinggenesetschallengesandopportunitiesforintegrativeanalysis
AT bakererichj curatinggenesetschallengesandopportunitiesforintegrativeanalysis
AT bergeralexander curatinggenesetschallengesandopportunitiesforintegrativeanalysis
AT emersonjake curatinggenesetschallengesandopportunitiesforintegrativeanalysis
AT blakejuditha curatinggenesetschallengesandopportunitiesforintegrativeanalysis
AT cheslerelissaj curatinggenesetschallengesandopportunitiesforintegrativeanalysis