Cargando…

Leveraging multiple transcriptome assembly methods for improved gene structure annotation

BACKGROUND: The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand. RESULTS: Here, we show that the accuracy of transcript reconstruction can be boosted by combining multipl...

Descripción completa

Detalles Bibliográficos
Autores principales: Venturini, Luca, Caim, Shabhonam, Kaithakottil, Gemy George, Mapleson, Daniel Lee, Swarbreck, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6105091/
https://www.ncbi.nlm.nih.gov/pubmed/30052957
http://dx.doi.org/10.1093/gigascience/giy093
Descripción
Sumario:BACKGROUND: The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand. RESULTS: Here, we show that the accuracy of transcript reconstruction can be boosted by combining multiple methods, and we present a novel algorithm to integrate multiple RNA-seq assemblies into a coherent transcript annotation. Our algorithm can remove redundancies and select the best transcript models according to user-specified metrics, while solving common artifacts such as erroneous transcript chimerisms. CONCLUSIONS: We have implemented this method in an open-source Python3 and Cython program, Mikado, available on GitHub.