Cargando…

Accurate microRNA annotation of animal genomes using trained covariance models of curated microRNA complements in MirMachine

The annotation of microRNAs depends on the availability of transcriptomics data and expert knowledge. This has led to a gap between the availability of novel genomes and high-quality microRNA complements. Using >16,000 microRNAs from the manually curated microRNA gene database MirGeneDB, we gener...

Descripción completa

Detalles Bibliográficos
Autores principales: Umu, Sinan Uğur, Paynter, Vanessa M., Trondsen, Håvard, Buschmann, Tilo, Rounge, Trine B., Peterson, Kevin J., Fromm, Bastian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10435380/
https://www.ncbi.nlm.nih.gov/pubmed/37601971
http://dx.doi.org/10.1016/j.xgen.2023.100348
Descripción
Sumario:The annotation of microRNAs depends on the availability of transcriptomics data and expert knowledge. This has led to a gap between the availability of novel genomes and high-quality microRNA complements. Using >16,000 microRNAs from the manually curated microRNA gene database MirGeneDB, we generated trained covariance models for all conserved microRNA families. These models are available in our tool MirMachine, which annotates conserved microRNAs within genomes. We successfully applied MirMachine to a range of animal species, including those with large genomes and genome duplications and extinct species, where small RNA sequencing is hard to achieve. We further describe a microRNA score of expected microRNAs that can be used to assess the completeness of genome assemblies. MirMachine closes a long-persisting gap in the microRNA field by facilitating automated genome annotation pipelines and deeper studies into the evolution of genome regulation, even in extinct organisms.