Cargando…
Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd
Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expre...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5052684/ https://www.ncbi.nlm.nih.gov/pubmed/27667448 http://dx.doi.org/10.1038/ncomms12846 |
_version_ | 1782458274959851520 |
---|---|
author | Wang, Zichen Monteiro, Caroline D. Jagodnik, Kathleen M. Fernandez, Nicolas F. Gundersen, Gregory W. Rouillard, Andrew D. Jenkins, Sherry L. Feldmann, Axel S. Hu, Kevin S. McDermott, Michael G. Duan, Qiaonan Clark, Neil R. Jones, Matthew R. Kou, Yan Goff, Troy Woodland, Holly Amaral, Fabio M R. Szeto, Gregory L. Fuchs, Oliver Schüssler-Fiorenza Rose, Sophia M. Sharma, Shvetank Schwartz, Uwe Bausela, Xabier Bengoetxea Szymkiewicz, Maciej Maroulis, Vasileios Salykin, Anton Barra, Carolina M. Kruth, Candice D. Bongio, Nicholas J. Mathur, Vaibhav Todoric, Radmila D Rubin, Udi E. Malatras, Apostolos Fulp, Carl T. Galindo, John A. Motiejunaite, Ruta Jüschke, Christoph Dishuck, Philip C. Lahl, Katharina Jafari, Mohieddin Aibar, Sara Zaravinos, Apostolos Steenhuizen, Linda H. Allison, Lindsey R. Gamallo, Pablo de Andres Segura, Fernando Dae Devlin, Tyler Pérez-García, Vicente Ma'ayan, Avi |
author_facet | Wang, Zichen Monteiro, Caroline D. Jagodnik, Kathleen M. Fernandez, Nicolas F. Gundersen, Gregory W. Rouillard, Andrew D. Jenkins, Sherry L. Feldmann, Axel S. Hu, Kevin S. McDermott, Michael G. Duan, Qiaonan Clark, Neil R. Jones, Matthew R. Kou, Yan Goff, Troy Woodland, Holly Amaral, Fabio M R. Szeto, Gregory L. Fuchs, Oliver Schüssler-Fiorenza Rose, Sophia M. Sharma, Shvetank Schwartz, Uwe Bausela, Xabier Bengoetxea Szymkiewicz, Maciej Maroulis, Vasileios Salykin, Anton Barra, Carolina M. Kruth, Candice D. Bongio, Nicholas J. Mathur, Vaibhav Todoric, Radmila D Rubin, Udi E. Malatras, Apostolos Fulp, Carl T. Galindo, John A. Motiejunaite, Ruta Jüschke, Christoph Dishuck, Philip C. Lahl, Katharina Jafari, Mohieddin Aibar, Sara Zaravinos, Apostolos Steenhuizen, Linda H. Allison, Lindsey R. Gamallo, Pablo de Andres Segura, Fernando Dae Devlin, Tyler Pérez-García, Vicente Ma'ayan, Avi |
author_sort | Wang, Zichen |
collection | PubMed |
description | Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization. |
format | Online Article Text |
id | pubmed-5052684 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-50526842016-10-21 Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd Wang, Zichen Monteiro, Caroline D. Jagodnik, Kathleen M. Fernandez, Nicolas F. Gundersen, Gregory W. Rouillard, Andrew D. Jenkins, Sherry L. Feldmann, Axel S. Hu, Kevin S. McDermott, Michael G. Duan, Qiaonan Clark, Neil R. Jones, Matthew R. Kou, Yan Goff, Troy Woodland, Holly Amaral, Fabio M R. Szeto, Gregory L. Fuchs, Oliver Schüssler-Fiorenza Rose, Sophia M. Sharma, Shvetank Schwartz, Uwe Bausela, Xabier Bengoetxea Szymkiewicz, Maciej Maroulis, Vasileios Salykin, Anton Barra, Carolina M. Kruth, Candice D. Bongio, Nicholas J. Mathur, Vaibhav Todoric, Radmila D Rubin, Udi E. Malatras, Apostolos Fulp, Carl T. Galindo, John A. Motiejunaite, Ruta Jüschke, Christoph Dishuck, Philip C. Lahl, Katharina Jafari, Mohieddin Aibar, Sara Zaravinos, Apostolos Steenhuizen, Linda H. Allison, Lindsey R. Gamallo, Pablo de Andres Segura, Fernando Dae Devlin, Tyler Pérez-García, Vicente Ma'ayan, Avi Nat Commun Article Gene expression data are accumulating exponentially in public repositories. Reanalysis and integration of themed collections from these studies may provide new insights, but requires further human curation. Here we report a crowdsourcing project to annotate and reanalyse a large number of gene expression profiles from Gene Expression Omnibus (GEO). Through a massive open online course on Coursera, over 70 participants from over 25 countries identify and annotate 2,460 single-gene perturbation signatures, 839 disease versus normal signatures, and 906 drug perturbation signatures. All these signatures are unique and are manually validated for quality. Global analysis of these signatures confirms known associations and identifies novel associations between genes, diseases and drugs. The manually curated signatures are used as a training set to develop classifiers for extracting similar signatures from the entire GEO repository. We develop a web portal to serve these signatures for query, download and visualization. Nature Publishing Group 2016-09-26 /pmc/articles/PMC5052684/ /pubmed/27667448 http://dx.doi.org/10.1038/ncomms12846 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Wang, Zichen Monteiro, Caroline D. Jagodnik, Kathleen M. Fernandez, Nicolas F. Gundersen, Gregory W. Rouillard, Andrew D. Jenkins, Sherry L. Feldmann, Axel S. Hu, Kevin S. McDermott, Michael G. Duan, Qiaonan Clark, Neil R. Jones, Matthew R. Kou, Yan Goff, Troy Woodland, Holly Amaral, Fabio M R. Szeto, Gregory L. Fuchs, Oliver Schüssler-Fiorenza Rose, Sophia M. Sharma, Shvetank Schwartz, Uwe Bausela, Xabier Bengoetxea Szymkiewicz, Maciej Maroulis, Vasileios Salykin, Anton Barra, Carolina M. Kruth, Candice D. Bongio, Nicholas J. Mathur, Vaibhav Todoric, Radmila D Rubin, Udi E. Malatras, Apostolos Fulp, Carl T. Galindo, John A. Motiejunaite, Ruta Jüschke, Christoph Dishuck, Philip C. Lahl, Katharina Jafari, Mohieddin Aibar, Sara Zaravinos, Apostolos Steenhuizen, Linda H. Allison, Lindsey R. Gamallo, Pablo de Andres Segura, Fernando Dae Devlin, Tyler Pérez-García, Vicente Ma'ayan, Avi Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd |
title | Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd |
title_full | Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd |
title_fullStr | Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd |
title_full_unstemmed | Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd |
title_short | Extraction and analysis of signatures from the Gene Expression Omnibus by the crowd |
title_sort | extraction and analysis of signatures from the gene expression omnibus by the crowd |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5052684/ https://www.ncbi.nlm.nih.gov/pubmed/27667448 http://dx.doi.org/10.1038/ncomms12846 |
work_keys_str_mv | AT wangzichen extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT monteirocarolined extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT jagodnikkathleenm extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT fernandeznicolasf extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT gundersengregoryw extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT rouillardandrewd extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT jenkinssherryl extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT feldmannaxels extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT hukevins extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT mcdermottmichaelg extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT duanqiaonan extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT clarkneilr extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT jonesmatthewr extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT kouyan extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT gofftroy extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT woodlandholly extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT amaralfabiomr extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT szetogregoryl extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT fuchsoliver extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT schusslerfiorenzarosesophiam extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT sharmashvetank extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT schwartzuwe extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT bauselaxabierbengoetxea extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT szymkiewiczmaciej extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT maroulisvasileios extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT salykinanton extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT barracarolinam extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT kruthcandiced extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT bongionicholasj extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT mathurvaibhav extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT todoricradmilad extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT rubinudie extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT malatrasapostolos extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT fulpcarlt extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT galindojohna extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT motiejunaiteruta extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT juschkechristoph extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT dishuckphilipc extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT lahlkatharina extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT jafarimohieddin extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT aibarsara extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT zaravinosapostolos extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT steenhuizenlindah extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT allisonlindseyr extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT gamallopablo extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT deandressegurafernando extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT daedevlintyler extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT perezgarciavicente extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd AT maayanavi extractionandanalysisofsignaturesfromthegeneexpressionomnibusbythecrowd |