Cargando…

Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation

The Ensembl software resources are a stable infrastructure to store, access and manipulate genome assemblies and their functional annotations. The Ensembl ‘Core’ database and Application Programming Interface (API) was our first major piece of software infrastructure and remains at the centre of all...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruffier, Magali, Kähäri, Andreas, Komorowska, Monika, Keenan, Stephen, Laird, Matthew, Longden, Ian, Proctor, Glenn, Searle, Steve, Staines, Daniel, Taylor, Kieron, Vullo, Alessandro, Yates, Andrew, Zerbino, Daniel, Flicek, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5467575/
https://www.ncbi.nlm.nih.gov/pubmed/28365736
http://dx.doi.org/10.1093/database/bax020
_version_ 1783243292939386880
author Ruffier, Magali
Kähäri, Andreas
Komorowska, Monika
Keenan, Stephen
Laird, Matthew
Longden, Ian
Proctor, Glenn
Searle, Steve
Staines, Daniel
Taylor, Kieron
Vullo, Alessandro
Yates, Andrew
Zerbino, Daniel
Flicek, Paul
author_facet Ruffier, Magali
Kähäri, Andreas
Komorowska, Monika
Keenan, Stephen
Laird, Matthew
Longden, Ian
Proctor, Glenn
Searle, Steve
Staines, Daniel
Taylor, Kieron
Vullo, Alessandro
Yates, Andrew
Zerbino, Daniel
Flicek, Paul
author_sort Ruffier, Magali
collection PubMed
description The Ensembl software resources are a stable infrastructure to store, access and manipulate genome assemblies and their functional annotations. The Ensembl ‘Core’ database and Application Programming Interface (API) was our first major piece of software infrastructure and remains at the centre of all of our genome resources. Since its initial design more than fifteen years ago, the number of publicly available genomic, transcriptomic and proteomic datasets has grown enormously, accelerated by continuous advances in DNA-sequencing technology. Initially intended to provide annotation for the reference human genome, we have extended our framework to support the genomes of all species as well as richer assembly models. Cross-referenced links to other informatics resources facilitate searching our database with a variety of popular identifiers such as UniProt and RefSeq. Our comprehensive and robust framework storing a large diversity of genome annotations in one location serves as a platform for other groups to generate and maintain their own tailored annotation. We welcome reuse and contributions: our databases and APIs are publicly available, all of our source code is released with a permissive Apache v2.0 licence at http://github.com/Ensembl and we have an active developer mailing list (http://www.ensembl.org/info/about/contact/index.html). Database URL: http://www.ensembl.org
format Online
Article
Text
id pubmed-5467575
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-54675752017-06-19 Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation Ruffier, Magali Kähäri, Andreas Komorowska, Monika Keenan, Stephen Laird, Matthew Longden, Ian Proctor, Glenn Searle, Steve Staines, Daniel Taylor, Kieron Vullo, Alessandro Yates, Andrew Zerbino, Daniel Flicek, Paul Database (Oxford) Database Update The Ensembl software resources are a stable infrastructure to store, access and manipulate genome assemblies and their functional annotations. The Ensembl ‘Core’ database and Application Programming Interface (API) was our first major piece of software infrastructure and remains at the centre of all of our genome resources. Since its initial design more than fifteen years ago, the number of publicly available genomic, transcriptomic and proteomic datasets has grown enormously, accelerated by continuous advances in DNA-sequencing technology. Initially intended to provide annotation for the reference human genome, we have extended our framework to support the genomes of all species as well as richer assembly models. Cross-referenced links to other informatics resources facilitate searching our database with a variety of popular identifiers such as UniProt and RefSeq. Our comprehensive and robust framework storing a large diversity of genome annotations in one location serves as a platform for other groups to generate and maintain their own tailored annotation. We welcome reuse and contributions: our databases and APIs are publicly available, all of our source code is released with a permissive Apache v2.0 licence at http://github.com/Ensembl and we have an active developer mailing list (http://www.ensembl.org/info/about/contact/index.html). Database URL: http://www.ensembl.org Oxford University Press 2017-03-18 /pmc/articles/PMC5467575/ /pubmed/28365736 http://dx.doi.org/10.1093/database/bax020 Text en © The Author(s) 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Database Update
Ruffier, Magali
Kähäri, Andreas
Komorowska, Monika
Keenan, Stephen
Laird, Matthew
Longden, Ian
Proctor, Glenn
Searle, Steve
Staines, Daniel
Taylor, Kieron
Vullo, Alessandro
Yates, Andrew
Zerbino, Daniel
Flicek, Paul
Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation
title Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation
title_full Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation
title_fullStr Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation
title_full_unstemmed Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation
title_short Ensembl core software resources: storage and programmatic access for DNA sequence and genome annotation
title_sort ensembl core software resources: storage and programmatic access for dna sequence and genome annotation
topic Database Update
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5467575/
https://www.ncbi.nlm.nih.gov/pubmed/28365736
http://dx.doi.org/10.1093/database/bax020
work_keys_str_mv AT ruffiermagali ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT kahariandreas ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT komorowskamonika ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT keenanstephen ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT lairdmatthew ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT longdenian ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT proctorglenn ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT searlesteve ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT stainesdaniel ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT taylorkieron ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT vulloalessandro ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT yatesandrew ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT zerbinodaniel ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation
AT flicekpaul ensemblcoresoftwareresourcesstorageandprogrammaticaccessfordnasequenceandgenomeannotation