Cargando…

APPRIS 2017: principal isoforms for multiple gene sets

The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on...

Descripción completa

Detalles Bibliográficos
Autores principales: Rodriguez, Jose Manuel, Rodriguez-Rivas, Juan, Di Domenico, Tomás, Vázquez, Jesús, Valencia, Alfonso, Tress, Michael L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5753224/
https://www.ncbi.nlm.nih.gov/pubmed/29069475
http://dx.doi.org/10.1093/nar/gkx997
_version_ 1783290234366066688
author Rodriguez, Jose Manuel
Rodriguez-Rivas, Juan
Di Domenico, Tomás
Vázquez, Jesús
Valencia, Alfonso
Tress, Michael L
author_facet Rodriguez, Jose Manuel
Rodriguez-Rivas, Juan
Di Domenico, Tomás
Vázquez, Jesús
Valencia, Alfonso
Tress, Michael L
author_sort Rodriguez, Jose Manuel
collection PubMed
description The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants.
format Online
Article
Text
id pubmed-5753224
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57532242018-01-05 APPRIS 2017: principal isoforms for multiple gene sets Rodriguez, Jose Manuel Rodriguez-Rivas, Juan Di Domenico, Tomás Vázquez, Jesús Valencia, Alfonso Tress, Michael L Nucleic Acids Res Database Issue The APPRIS database (http://appris-tools.org) uses protein structural and functional features and information from cross-species conservation to annotate splice isoforms in protein-coding genes. APPRIS selects a single protein isoform, the ‘principal’ isoform, as the reference for each gene based on these annotations. A single main splice isoform reflects the biological reality for most protein coding genes and APPRIS principal isoforms are the best predictors of these main proteins isoforms. Here, we present the updates to the database, new developments that include the addition of three new species (chimpanzee, Drosophila melangaster and Caenorhabditis elegans), the expansion of APPRIS to cover the RefSeq gene set and the UniProtKB proteome for six species and refinements in the core methods that make up the annotation pipeline. In addition APPRIS now provides a measure of reliability for individual principal isoforms and updates with each release of the GENCODE/Ensembl and RefSeq reference sets. The individual GENCODE/Ensembl, RefSeq and UniProtKB reference gene sets for six organisms have been merged to produce common sets of splice variants. Oxford University Press 2018-01-04 2017-10-23 /pmc/articles/PMC5753224/ /pubmed/29069475 http://dx.doi.org/10.1093/nar/gkx997 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Database Issue
Rodriguez, Jose Manuel
Rodriguez-Rivas, Juan
Di Domenico, Tomás
Vázquez, Jesús
Valencia, Alfonso
Tress, Michael L
APPRIS 2017: principal isoforms for multiple gene sets
title APPRIS 2017: principal isoforms for multiple gene sets
title_full APPRIS 2017: principal isoforms for multiple gene sets
title_fullStr APPRIS 2017: principal isoforms for multiple gene sets
title_full_unstemmed APPRIS 2017: principal isoforms for multiple gene sets
title_short APPRIS 2017: principal isoforms for multiple gene sets
title_sort appris 2017: principal isoforms for multiple gene sets
topic Database Issue
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5753224/
https://www.ncbi.nlm.nih.gov/pubmed/29069475
http://dx.doi.org/10.1093/nar/gkx997
work_keys_str_mv AT rodriguezjosemanuel appris2017principalisoformsformultiplegenesets
AT rodriguezrivasjuan appris2017principalisoformsformultiplegenesets
AT didomenicotomas appris2017principalisoformsformultiplegenesets
AT vazquezjesus appris2017principalisoformsformultiplegenesets
AT valenciaalfonso appris2017principalisoformsformultiplegenesets
AT tressmichaell appris2017principalisoformsformultiplegenesets