Cargando…
GeoBoost2: a natural languageprocessing pipeline for GenBank metadata enrichment for virus phylogeography
SUMMARY: We present GeoBoost2, a natural language-processing pipeline for extracting the location of infected hosts for enriching metadata in nucleotide sequences repositories like National Center of Biotechnology Information’s GenBank for downstream analysis including phylogeography and genomic epi...
Autores principales: | Magge, Arjun, Weissenbacher, Davy, O’Connor, Karen, Tahsin, Tasnia, Gonzalez-Hernandez, Graciela, Scotch, Matthew |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7755405/ https://www.ncbi.nlm.nih.gov/pubmed/32683454 http://dx.doi.org/10.1093/bioinformatics/btaa647 |
Ejemplares similares
-
Named entity linking of geospatial and host metadata in GenBank for advancing biomedical research
por: Tahsin, Tasnia, et al.
Publicado: (2017) -
Incorporating sampling uncertainty in the geospatial assignment of taxa for virus phylogeography
por: Scotch, Matthew, et al.
Publicado: (2019) -
Extracting geographic locations from the literature for virus phylogeography using supervised and distant supervision methods
por: Weissenbacher, Davy, et al.
Publicado: (2017) -
GenBank
por: Benson, Dennis A., et al.
Publicado: (2008) -
GenBank
por: Benson, Dennis A., et al.
Publicado: (2006)