Cargando…

Leveraging multiple transcriptome assembly methods for improved gene structure annotation

BACKGROUND: The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand. RESULTS: Here, we show that the accuracy of transcript reconstruction can be boosted by combining multipl...

Descripción completa

Detalles Bibliográficos
Autores principales: Venturini, Luca, Caim, Shabhonam, Kaithakottil, Gemy George, Mapleson, Daniel Lee, Swarbreck, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6105091/
https://www.ncbi.nlm.nih.gov/pubmed/30052957
http://dx.doi.org/10.1093/gigascience/giy093
_version_ 1783349598260035584
author Venturini, Luca
Caim, Shabhonam
Kaithakottil, Gemy George
Mapleson, Daniel Lee
Swarbreck, David
author_facet Venturini, Luca
Caim, Shabhonam
Kaithakottil, Gemy George
Mapleson, Daniel Lee
Swarbreck, David
author_sort Venturini, Luca
collection PubMed
description BACKGROUND: The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand. RESULTS: Here, we show that the accuracy of transcript reconstruction can be boosted by combining multiple methods, and we present a novel algorithm to integrate multiple RNA-seq assemblies into a coherent transcript annotation. Our algorithm can remove redundancies and select the best transcript models according to user-specified metrics, while solving common artifacts such as erroneous transcript chimerisms. CONCLUSIONS: We have implemented this method in an open-source Python3 and Cython program, Mikado, available on GitHub.
format Online
Article
Text
id pubmed-6105091
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61050912018-08-27 Leveraging multiple transcriptome assembly methods for improved gene structure annotation Venturini, Luca Caim, Shabhonam Kaithakottil, Gemy George Mapleson, Daniel Lee Swarbreck, David Gigascience Technical Note BACKGROUND: The performance of RNA sequencing (RNA-seq) aligners and assemblers varies greatly across different organisms and experiments, and often the optimal approach is not known beforehand. RESULTS: Here, we show that the accuracy of transcript reconstruction can be boosted by combining multiple methods, and we present a novel algorithm to integrate multiple RNA-seq assemblies into a coherent transcript annotation. Our algorithm can remove redundancies and select the best transcript models according to user-specified metrics, while solving common artifacts such as erroneous transcript chimerisms. CONCLUSIONS: We have implemented this method in an open-source Python3 and Cython program, Mikado, available on GitHub. Oxford University Press 2018-07-24 /pmc/articles/PMC6105091/ /pubmed/30052957 http://dx.doi.org/10.1093/gigascience/giy093 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Note
Venturini, Luca
Caim, Shabhonam
Kaithakottil, Gemy George
Mapleson, Daniel Lee
Swarbreck, David
Leveraging multiple transcriptome assembly methods for improved gene structure annotation
title Leveraging multiple transcriptome assembly methods for improved gene structure annotation
title_full Leveraging multiple transcriptome assembly methods for improved gene structure annotation
title_fullStr Leveraging multiple transcriptome assembly methods for improved gene structure annotation
title_full_unstemmed Leveraging multiple transcriptome assembly methods for improved gene structure annotation
title_short Leveraging multiple transcriptome assembly methods for improved gene structure annotation
title_sort leveraging multiple transcriptome assembly methods for improved gene structure annotation
topic Technical Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6105091/
https://www.ncbi.nlm.nih.gov/pubmed/30052957
http://dx.doi.org/10.1093/gigascience/giy093
work_keys_str_mv AT venturiniluca leveragingmultipletranscriptomeassemblymethodsforimprovedgenestructureannotation
AT caimshabhonam leveragingmultipletranscriptomeassemblymethodsforimprovedgenestructureannotation
AT kaithakottilgemygeorge leveragingmultipletranscriptomeassemblymethodsforimprovedgenestructureannotation
AT maplesondaniellee leveragingmultipletranscriptomeassemblymethodsforimprovedgenestructureannotation
AT swarbreckdavid leveragingmultipletranscriptomeassemblymethodsforimprovedgenestructureannotation