Cargando…

Evidence That Inconsistent Gene Prediction Can Mislead Analysis of Dinoflagellate Genomes

Comparative algal genomics often relies on predicted genes from de novo assembled genomes. However, the artifacts introduced by different gene‐prediction approaches, and their impact on comparative genomic analysis remain poorly understood. Here, using available genome data from six dinoflagellate s...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Yibi, González‐Pech, Raúl A., Stephens, Timothy G., Bhattacharya, Debashish, Chan, Cheong Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7065002/
https://www.ncbi.nlm.nih.gov/pubmed/31713873
http://dx.doi.org/10.1111/jpy.12947
Descripción
Sumario:Comparative algal genomics often relies on predicted genes from de novo assembled genomes. However, the artifacts introduced by different gene‐prediction approaches, and their impact on comparative genomic analysis remain poorly understood. Here, using available genome data from six dinoflagellate species in the Symbiodiniaceae, we identified methodological biases in the published genes that were predicted using different approaches and putative contaminant sequences in the published genome assemblies. We developed and applied a comprehensive customized workflow to predict genes from these genomes. The observed variation among predicted genes resulting from our workflow agreed with current understanding of phylogenetic relationships among these taxa, whereas the variation among the previously published genes was largely biased by the distinct approaches used in each instance. Importantly, these biases affect the inference of homologous gene families and synteny among genomes, thus impacting biological interpretation of these data. Our results demonstrate that a consistent gene‐prediction approach is critical for comparative analysis of dinoflagellate genomes.