Cargando…

Comparative Annotation Toolkit (CAT)—simultaneous clade and personal genome annotation

The recent introductions of low-cost, long-read, and read-cloud sequencing technologies coupled with intense efforts to develop efficient algorithms have made affordable, high-quality de novo sequence assembly a realistic proposition. The result is an explosion of new, ultracontiguous genome assembl...

Descripción completa

Detalles Bibliográficos
Autores principales: Fiddes, Ian T., Armstrong, Joel, Diekhans, Mark, Nachtweide, Stefanie, Kronenberg, Zev N., Underwood, Jason G., Gordon, David, Earl, Dent, Keane, Thomas, Eichler, Evan E., Haussler, David, Stanke, Mario, Paten, Benedict
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6028123/
https://www.ncbi.nlm.nih.gov/pubmed/29884752
http://dx.doi.org/10.1101/gr.233460.117
Descripción
Sumario:The recent introductions of low-cost, long-read, and read-cloud sequencing technologies coupled with intense efforts to develop efficient algorithms have made affordable, high-quality de novo sequence assembly a realistic proposition. The result is an explosion of new, ultracontiguous genome assemblies. To compare these genomes, we need robust methods for genome annotation. We describe the fully open source Comparative Annotation Toolkit (CAT), which provides a flexible way to simultaneously annotate entire clades and identify orthology relationships. We show that CAT can be used to improve annotations on the rat genome, annotate the great apes, annotate a diverse set of mammals, and annotate personal, diploid human genomes. We demonstrate the resulting discovery of novel genes, isoforms, and structural variants—even in genomes as well studied as rat and the great apes—and how these annotations improve cross-species RNA expression experiments.