Cargando…

Revising transcriptome assemblies with phylogenetic information

A common transcriptome assembly error is to mistake different transcripts of the same gene as transcripts from multiple closely related genes. This error is difficult to identify during assembly, but in a phylogenetic analysis such errors can be diagnosed from gene phylogenies where they appear as c...

Descripción completa

Detalles Bibliográficos
Autores principales: Guang, August, Howison, Mark, Zapata, Felipe, Lawrence, Charles, Dunn, Casey W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7802918/
https://www.ncbi.nlm.nih.gov/pubmed/33434218
http://dx.doi.org/10.1371/journal.pone.0244202
_version_ 1783635839168806912
author Guang, August
Howison, Mark
Zapata, Felipe
Lawrence, Charles
Dunn, Casey W.
author_facet Guang, August
Howison, Mark
Zapata, Felipe
Lawrence, Charles
Dunn, Casey W.
author_sort Guang, August
collection PubMed
description A common transcriptome assembly error is to mistake different transcripts of the same gene as transcripts from multiple closely related genes. This error is difficult to identify during assembly, but in a phylogenetic analysis such errors can be diagnosed from gene phylogenies where they appear as clades of tips from the same species with improbably short branch lengths. treeinform is a method that uses phylogenetic information across species to refine transcriptome assemblies within species. It identifies transcripts of the same gene that were incorrectly assigned to multiple genes and reassign them as transcripts of the same gene. The treeinform method is implemented in Agalma, available at https://bitbucket.org/caseywdunn/agalma, and the general approach is relevant in a variety of other contexts.
format Online
Article
Text
id pubmed-7802918
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-78029182021-01-22 Revising transcriptome assemblies with phylogenetic information Guang, August Howison, Mark Zapata, Felipe Lawrence, Charles Dunn, Casey W. PLoS One Research Article A common transcriptome assembly error is to mistake different transcripts of the same gene as transcripts from multiple closely related genes. This error is difficult to identify during assembly, but in a phylogenetic analysis such errors can be diagnosed from gene phylogenies where they appear as clades of tips from the same species with improbably short branch lengths. treeinform is a method that uses phylogenetic information across species to refine transcriptome assemblies within species. It identifies transcripts of the same gene that were incorrectly assigned to multiple genes and reassign them as transcripts of the same gene. The treeinform method is implemented in Agalma, available at https://bitbucket.org/caseywdunn/agalma, and the general approach is relevant in a variety of other contexts. Public Library of Science 2021-01-12 /pmc/articles/PMC7802918/ /pubmed/33434218 http://dx.doi.org/10.1371/journal.pone.0244202 Text en © 2021 Guang et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Guang, August
Howison, Mark
Zapata, Felipe
Lawrence, Charles
Dunn, Casey W.
Revising transcriptome assemblies with phylogenetic information
title Revising transcriptome assemblies with phylogenetic information
title_full Revising transcriptome assemblies with phylogenetic information
title_fullStr Revising transcriptome assemblies with phylogenetic information
title_full_unstemmed Revising transcriptome assemblies with phylogenetic information
title_short Revising transcriptome assemblies with phylogenetic information
title_sort revising transcriptome assemblies with phylogenetic information
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7802918/
https://www.ncbi.nlm.nih.gov/pubmed/33434218
http://dx.doi.org/10.1371/journal.pone.0244202
work_keys_str_mv AT guangaugust revisingtranscriptomeassemblieswithphylogeneticinformation
AT howisonmark revisingtranscriptomeassemblieswithphylogeneticinformation
AT zapatafelipe revisingtranscriptomeassemblieswithphylogeneticinformation
AT lawrencecharles revisingtranscriptomeassemblieswithphylogeneticinformation
AT dunncaseyw revisingtranscriptomeassemblieswithphylogeneticinformation