Cargando…

The RNASeq-er API—a gateway to systematically updated analysis of public RNA-seq data

MOTIVATION: The exponential growth of publicly available RNA-sequencing (RNA-Seq) data poses an increasing challenge to researchers wishing to discover, analyse and store such data, particularly those based in institutions with limited computational resources. EMBL-EBI is in an ideal position to add...

Descripción completa

Detalles Bibliográficos
Autores principales: Petryszak, Robert, Fonseca, Nuno A, Füllgrabe, Anja, Huerta, Laura, Keays, Maria, Tang, Y Amy, Brazma, Alvis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870697/
https://www.ncbi.nlm.nih.gov/pubmed/28369191
http://dx.doi.org/10.1093/bioinformatics/btx143
_version_ 1783309536302465024
author Petryszak, Robert
Fonseca, Nuno A
Füllgrabe, Anja
Huerta, Laura
Keays, Maria
Tang, Y Amy
Brazma, Alvis
author_facet Petryszak, Robert
Fonseca, Nuno A
Füllgrabe, Anja
Huerta, Laura
Keays, Maria
Tang, Y Amy
Brazma, Alvis
author_sort Petryszak, Robert
collection PubMed
description MOTIVATION: The exponential growth of publicly available RNA-sequencing (RNA-Seq) data poses an increasing challenge to researchers wishing to discover, analyse and store such data, particularly those based in institutions with limited computational resources. EMBL-EBI is in an ideal position to address these challenges and to allow the scientific community easy access to not just raw, but also processed RNA-Seq data. We present a Web service to access the results of a systematically and continually updated standardized alignment as well as gene and exon expression quantification of all public bulk (and in the near future also single-cell) RNA-Seq runs in 264 species in European Nucleotide Archive, using Representational State Transfer. RESULTS: The RNASeq-er API (Application Programming Interface) enables ontology-powered search for and retrieval of CRAM, bigwig and bedGraph files, gene and exon expression quantification matrices (Fragments Per Kilobase Of Exon Per Million Fragments Mapped, Transcripts Per Million, raw counts) as well as sample attributes annotated with ontology terms. To date over 270 00 RNA-Seq runs in nearly 10 000 studies (1PB of raw FASTQ data) in 264 species in ENA have been processed and made available via the API. AVAILABILITY AND IMPLEMENTATION: The RNASeq-er API can be accessed at http://www.ebi.ac.uk/fg/rnaseq/api. The commands used to analyse the data are available in supplementary materials and at https://github.com/nunofonseca/irap/wiki/iRAP-single-library. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-5870697
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58706972018-04-05 The RNASeq-er API—a gateway to systematically updated analysis of public RNA-seq data Petryszak, Robert Fonseca, Nuno A Füllgrabe, Anja Huerta, Laura Keays, Maria Tang, Y Amy Brazma, Alvis Bioinformatics Applications Notes MOTIVATION: The exponential growth of publicly available RNA-sequencing (RNA-Seq) data poses an increasing challenge to researchers wishing to discover, analyse and store such data, particularly those based in institutions with limited computational resources. EMBL-EBI is in an ideal position to address these challenges and to allow the scientific community easy access to not just raw, but also processed RNA-Seq data. We present a Web service to access the results of a systematically and continually updated standardized alignment as well as gene and exon expression quantification of all public bulk (and in the near future also single-cell) RNA-Seq runs in 264 species in European Nucleotide Archive, using Representational State Transfer. RESULTS: The RNASeq-er API (Application Programming Interface) enables ontology-powered search for and retrieval of CRAM, bigwig and bedGraph files, gene and exon expression quantification matrices (Fragments Per Kilobase Of Exon Per Million Fragments Mapped, Transcripts Per Million, raw counts) as well as sample attributes annotated with ontology terms. To date over 270 00 RNA-Seq runs in nearly 10 000 studies (1PB of raw FASTQ data) in 264 species in ENA have been processed and made available via the API. AVAILABILITY AND IMPLEMENTATION: The RNASeq-er API can be accessed at http://www.ebi.ac.uk/fg/rnaseq/api. The commands used to analyse the data are available in supplementary materials and at https://github.com/nunofonseca/irap/wiki/iRAP-single-library. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2017-07-15 2017-03-22 /pmc/articles/PMC5870697/ /pubmed/28369191 http://dx.doi.org/10.1093/bioinformatics/btx143 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Petryszak, Robert
Fonseca, Nuno A
Füllgrabe, Anja
Huerta, Laura
Keays, Maria
Tang, Y Amy
Brazma, Alvis
The RNASeq-er API—a gateway to systematically updated analysis of public RNA-seq data
title The RNASeq-er API—a gateway to systematically updated analysis of public RNA-seq data
title_full The RNASeq-er API—a gateway to systematically updated analysis of public RNA-seq data
title_fullStr The RNASeq-er API—a gateway to systematically updated analysis of public RNA-seq data
title_full_unstemmed The RNASeq-er API—a gateway to systematically updated analysis of public RNA-seq data
title_short The RNASeq-er API—a gateway to systematically updated analysis of public RNA-seq data
title_sort rnaseq-er api—a gateway to systematically updated analysis of public rna-seq data
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870697/
https://www.ncbi.nlm.nih.gov/pubmed/28369191
http://dx.doi.org/10.1093/bioinformatics/btx143
work_keys_str_mv AT petryszakrobert thernaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT fonsecanunoa thernaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT fullgrabeanja thernaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT huertalaura thernaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT keaysmaria thernaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT tangyamy thernaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT brazmaalvis thernaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT petryszakrobert rnaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT fonsecanunoa rnaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT fullgrabeanja rnaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT huertalaura rnaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT keaysmaria rnaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT tangyamy rnaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata
AT brazmaalvis rnaseqerapiagatewaytosystematicallyupdatedanalysisofpublicrnaseqdata