Cargando…

Mining data and metadata from the gene expression omnibus

Publicly available gene expression datasets deposited in the Gene Expression Omnibus (GEO) are growing at an accelerating rate. Such datasets hold great value for knowledge discovery, particularly when integrated. Although numerous software platforms and tools have been developed to enable reanalysi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zichen, Lachmann, Alexander, Ma’ayan, Avi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Berlin Heidelberg 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6381352/
https://www.ncbi.nlm.nih.gov/pubmed/30594974
http://dx.doi.org/10.1007/s12551-018-0490-8
_version_ 1783396480734724096
author Wang, Zichen
Lachmann, Alexander
Ma’ayan, Avi
author_facet Wang, Zichen
Lachmann, Alexander
Ma’ayan, Avi
author_sort Wang, Zichen
collection PubMed
description Publicly available gene expression datasets deposited in the Gene Expression Omnibus (GEO) are growing at an accelerating rate. Such datasets hold great value for knowledge discovery, particularly when integrated. Although numerous software platforms and tools have been developed to enable reanalysis and integration of individual, or groups, of GEO datasets, large-scale reuse of those datasets is impeded by minimal requirements for standardized metadata both at the study and sample levels as well as uniform processing of the data across studies. Here, we review methodologies developed to facilitate the systematic curation and processing of publicly available gene expression datasets from GEO. We identify trends for advanced metadata curation and summarize approaches for reprocessing the data within the entire GEO repository.
format Online
Article
Text
id pubmed-6381352
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer Berlin Heidelberg
record_format MEDLINE/PubMed
spelling pubmed-63813522019-03-08 Mining data and metadata from the gene expression omnibus Wang, Zichen Lachmann, Alexander Ma’ayan, Avi Biophys Rev Review Publicly available gene expression datasets deposited in the Gene Expression Omnibus (GEO) are growing at an accelerating rate. Such datasets hold great value for knowledge discovery, particularly when integrated. Although numerous software platforms and tools have been developed to enable reanalysis and integration of individual, or groups, of GEO datasets, large-scale reuse of those datasets is impeded by minimal requirements for standardized metadata both at the study and sample levels as well as uniform processing of the data across studies. Here, we review methodologies developed to facilitate the systematic curation and processing of publicly available gene expression datasets from GEO. We identify trends for advanced metadata curation and summarize approaches for reprocessing the data within the entire GEO repository. Springer Berlin Heidelberg 2018-12-29 /pmc/articles/PMC6381352/ /pubmed/30594974 http://dx.doi.org/10.1007/s12551-018-0490-8 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Review
Wang, Zichen
Lachmann, Alexander
Ma’ayan, Avi
Mining data and metadata from the gene expression omnibus
title Mining data and metadata from the gene expression omnibus
title_full Mining data and metadata from the gene expression omnibus
title_fullStr Mining data and metadata from the gene expression omnibus
title_full_unstemmed Mining data and metadata from the gene expression omnibus
title_short Mining data and metadata from the gene expression omnibus
title_sort mining data and metadata from the gene expression omnibus
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6381352/
https://www.ncbi.nlm.nih.gov/pubmed/30594974
http://dx.doi.org/10.1007/s12551-018-0490-8
work_keys_str_mv AT wangzichen miningdataandmetadatafromthegeneexpressionomnibus
AT lachmannalexander miningdataandmetadatafromthegeneexpressionomnibus
AT maayanavi miningdataandmetadatafromthegeneexpressionomnibus