Cargando…

ACES: Analysis of Conservation with an Extensive list of Species

MOTIVATION: An abundance of new reference genomes is becoming available through large-scale sequencing efforts. While the reference FASTA for each genome is available, there is currently no automated mechanism to query a specific sequence across all new reference genomes. RESULTS: We developed ACES...

Descripción completa

Detalles Bibliográficos
Autores principales: Padhi, Evin M, Ng, Jeffrey K, Mehinovic, Elvisa, Sams, Eleanor I, Turner, Tychele N
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8570785/
https://www.ncbi.nlm.nih.gov/pubmed/34601580
http://dx.doi.org/10.1093/bioinformatics/btab684
Descripción
Sumario:MOTIVATION: An abundance of new reference genomes is becoming available through large-scale sequencing efforts. While the reference FASTA for each genome is available, there is currently no automated mechanism to query a specific sequence across all new reference genomes. RESULTS: We developed ACES (Analysis of Conservation with an Extensive list of Species) as a computational workflow to query specific sequences of interest (e.g. enhancers, promoters, exons) against reference genomes with an available reference FASTA. This automated workflow generates BLAST hits against each of the reference genomes, a multiple sequence alignment file, a graphical fragment assembly file and a phylogenetic tree file. These data files can then be used by the researcher in several ways to provide key insights into conservation of the query sequence. AVAILABILITY AND IMPLEMENTATION: ACES is available at https://github.com/TNTurnerLab/ACES SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.