Cargando…

A semi-automated workflow for biodiversity data retrieval, cleaning, and quality control

Abstract. The compilation and cleaning of data needed for analyses and prediction of species distributions is a time consuming process requiring a solid understanding of data formats and service APIs provided by biodiversity informatics infrastructures. We designed and implemented a Taverna-based Da...

Descripción completa

Detalles Bibliográficos
Autores principales: Mathew, Cherian, Güntsch, Anton, Obst, Matthias, Vicario, Saverio, Haines, Robert, Williams, Alan R., de Jong, Yde, Goble, Carole
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Pensoft Publishers 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4267104/
https://www.ncbi.nlm.nih.gov/pubmed/25535486
http://dx.doi.org/10.3897/BDJ.2.e4221
Descripción
Sumario:Abstract. The compilation and cleaning of data needed for analyses and prediction of species distributions is a time consuming process requiring a solid understanding of data formats and service APIs provided by biodiversity informatics infrastructures. We designed and implemented a Taverna-based Data Refinement Workflow which integrates taxonomic data retrieval, data cleaning, and data selection into a consistent, standards-based, and effective system hiding the complexity of underlying service infrastructures. The workflow can be freely used both locally and through a web-portal which does not require additional software installations by users.