Cargando…

Ranked choice voting for representative transcripts with TRaCE

SUMMARY: Genome sequencing projects annotate protein-coding gene models with multiple transcripts, aiming to represent all of the available transcript evidence. However, downstream analyses often operate on only one representative transcript per gene locus, sometimes known as the canonical transcrip...

Descripción completa

Detalles Bibliográficos
Autores principales: Olson, Andrew J, Ware, Doreen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8696091/
https://www.ncbi.nlm.nih.gov/pubmed/34297055
http://dx.doi.org/10.1093/bioinformatics/btab542
_version_ 1784619729277681664
author Olson, Andrew J
Ware, Doreen
author_facet Olson, Andrew J
Ware, Doreen
author_sort Olson, Andrew J
collection PubMed
description SUMMARY: Genome sequencing projects annotate protein-coding gene models with multiple transcripts, aiming to represent all of the available transcript evidence. However, downstream analyses often operate on only one representative transcript per gene locus, sometimes known as the canonical transcript. To choose canonical transcripts, Transcript Ranking and Canonical Election (TRaCE) holds an ‘election’ in which a set of RNA-seq samples rank transcripts by annotation edit distance. These sample-specific votes are tallied along with other criteria such as protein length and InterPro domain coverage. The winner is selected as the canonical transcript, but the election proceeds through multiple rounds of voting to order all the transcripts by relevance. Based on the set of expression data provided, TRaCE can identify the most common isoforms from a broad expression atlas or prioritize alternative transcripts expressed in specific contexts. AVAILABILITY AND IMPLEMENTATION: Transcript ranking code can be found on GitHub at {{https://github.com/warelab/TRaCE}}. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8696091
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86960912022-01-04 Ranked choice voting for representative transcripts with TRaCE Olson, Andrew J Ware, Doreen Bioinformatics Applications Notes SUMMARY: Genome sequencing projects annotate protein-coding gene models with multiple transcripts, aiming to represent all of the available transcript evidence. However, downstream analyses often operate on only one representative transcript per gene locus, sometimes known as the canonical transcript. To choose canonical transcripts, Transcript Ranking and Canonical Election (TRaCE) holds an ‘election’ in which a set of RNA-seq samples rank transcripts by annotation edit distance. These sample-specific votes are tallied along with other criteria such as protein length and InterPro domain coverage. The winner is selected as the canonical transcript, but the election proceeds through multiple rounds of voting to order all the transcripts by relevance. Based on the set of expression data provided, TRaCE can identify the most common isoforms from a broad expression atlas or prioritize alternative transcripts expressed in specific contexts. AVAILABILITY AND IMPLEMENTATION: Transcript ranking code can be found on GitHub at {{https://github.com/warelab/TRaCE}}. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-07-23 /pmc/articles/PMC8696091/ /pubmed/34297055 http://dx.doi.org/10.1093/bioinformatics/btab542 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Olson, Andrew J
Ware, Doreen
Ranked choice voting for representative transcripts with TRaCE
title Ranked choice voting for representative transcripts with TRaCE
title_full Ranked choice voting for representative transcripts with TRaCE
title_fullStr Ranked choice voting for representative transcripts with TRaCE
title_full_unstemmed Ranked choice voting for representative transcripts with TRaCE
title_short Ranked choice voting for representative transcripts with TRaCE
title_sort ranked choice voting for representative transcripts with trace
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8696091/
https://www.ncbi.nlm.nih.gov/pubmed/34297055
http://dx.doi.org/10.1093/bioinformatics/btab542
work_keys_str_mv AT olsonandrewj rankedchoicevotingforrepresentativetranscriptswithtrace
AT waredoreen rankedchoicevotingforrepresentativetranscriptswithtrace