Cargando…
Mining data and metadata from the gene expression omnibus
Publicly available gene expression datasets deposited in the Gene Expression Omnibus (GEO) are growing at an accelerating rate. Such datasets hold great value for knowledge discovery, particularly when integrated. Although numerous software platforms and tools have been developed to enable reanalysi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Berlin Heidelberg
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6381352/ https://www.ncbi.nlm.nih.gov/pubmed/30594974 http://dx.doi.org/10.1007/s12551-018-0490-8 |
_version_ | 1783396480734724096 |
---|---|
author | Wang, Zichen Lachmann, Alexander Ma’ayan, Avi |
author_facet | Wang, Zichen Lachmann, Alexander Ma’ayan, Avi |
author_sort | Wang, Zichen |
collection | PubMed |
description | Publicly available gene expression datasets deposited in the Gene Expression Omnibus (GEO) are growing at an accelerating rate. Such datasets hold great value for knowledge discovery, particularly when integrated. Although numerous software platforms and tools have been developed to enable reanalysis and integration of individual, or groups, of GEO datasets, large-scale reuse of those datasets is impeded by minimal requirements for standardized metadata both at the study and sample levels as well as uniform processing of the data across studies. Here, we review methodologies developed to facilitate the systematic curation and processing of publicly available gene expression datasets from GEO. We identify trends for advanced metadata curation and summarize approaches for reprocessing the data within the entire GEO repository. |
format | Online Article Text |
id | pubmed-6381352 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Springer Berlin Heidelberg |
record_format | MEDLINE/PubMed |
spelling | pubmed-63813522019-03-08 Mining data and metadata from the gene expression omnibus Wang, Zichen Lachmann, Alexander Ma’ayan, Avi Biophys Rev Review Publicly available gene expression datasets deposited in the Gene Expression Omnibus (GEO) are growing at an accelerating rate. Such datasets hold great value for knowledge discovery, particularly when integrated. Although numerous software platforms and tools have been developed to enable reanalysis and integration of individual, or groups, of GEO datasets, large-scale reuse of those datasets is impeded by minimal requirements for standardized metadata both at the study and sample levels as well as uniform processing of the data across studies. Here, we review methodologies developed to facilitate the systematic curation and processing of publicly available gene expression datasets from GEO. We identify trends for advanced metadata curation and summarize approaches for reprocessing the data within the entire GEO repository. Springer Berlin Heidelberg 2018-12-29 /pmc/articles/PMC6381352/ /pubmed/30594974 http://dx.doi.org/10.1007/s12551-018-0490-8 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Review Wang, Zichen Lachmann, Alexander Ma’ayan, Avi Mining data and metadata from the gene expression omnibus |
title | Mining data and metadata from the gene expression omnibus |
title_full | Mining data and metadata from the gene expression omnibus |
title_fullStr | Mining data and metadata from the gene expression omnibus |
title_full_unstemmed | Mining data and metadata from the gene expression omnibus |
title_short | Mining data and metadata from the gene expression omnibus |
title_sort | mining data and metadata from the gene expression omnibus |
topic | Review |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6381352/ https://www.ncbi.nlm.nih.gov/pubmed/30594974 http://dx.doi.org/10.1007/s12551-018-0490-8 |
work_keys_str_mv | AT wangzichen miningdataandmetadatafromthegeneexpressionomnibus AT lachmannalexander miningdataandmetadatafromthegeneexpressionomnibus AT maayanavi miningdataandmetadatafromthegeneexpressionomnibus |