Cargando…

GeoBoost2: a natural languageprocessing pipeline for GenBank metadata enrichment for virus phylogeography

SUMMARY: We present GeoBoost2, a natural language-processing pipeline for extracting the location of infected hosts for enriching metadata in nucleotide sequences repositories like National Center of Biotechnology Information’s GenBank for downstream analysis including phylogeography and genomic epi...

Descripción completa

Detalles Bibliográficos
Autores principales: Magge, Arjun, Weissenbacher, Davy, O’Connor, Karen, Tahsin, Tasnia, Gonzalez-Hernandez, Graciela, Scotch, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755405/
https://www.ncbi.nlm.nih.gov/pubmed/32683454
http://dx.doi.org/10.1093/bioinformatics/btaa647