Cargando…

From text to structured data: Converting a word-processed floristic checklist into Darwin Core Archive format

Abstract. The paper describes a pilot project to convert a conventional floristic checklist, written in a standard word processing program, into structured data in the Darwin Core Archive format. After peer-review and editorial acceptance, the final revised version of the checklist was converted int...

Descripción completa

Detalles Bibliográficos
Autores principales: Remsen, David, Knapp, Sandra, Georgiev, Teodor, Stoev, Pavel, Penev, Lyubomir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Pensoft Publishers 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3281575/
https://www.ncbi.nlm.nih.gov/pubmed/22371687
http://dx.doi.org/10.3897/phytokeys.9.2770
_version_ 1782223983032139776
author Remsen, David
Knapp, Sandra
Georgiev, Teodor
Stoev, Pavel
Penev, Lyubomir
author_facet Remsen, David
Knapp, Sandra
Georgiev, Teodor
Stoev, Pavel
Penev, Lyubomir
author_sort Remsen, David
collection PubMed
description Abstract. The paper describes a pilot project to convert a conventional floristic checklist, written in a standard word processing program, into structured data in the Darwin Core Archive format. After peer-review and editorial acceptance, the final revised version of the checklist was converted into Darwin Core Archive by means of regular expressions and published thereafter in both human-readable form as traditional botanical publication and Darwin Core Archive data files. The data were published and indexed through the Global Biodiversity Information Facility (GBIF) Integrated Publishing Toolkit (IPT) and significant portions of the text of the paper were used to describe the metadata on IPT. After publication, the data will become available through the GBIF infrastructure and can be re-used on their own or collated with other data.
format Online
Article
Text
id pubmed-3281575
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Pensoft Publishers
record_format MEDLINE/PubMed
spelling pubmed-32815752012-02-27 From text to structured data: Converting a word-processed floristic checklist into Darwin Core Archive format Remsen, David Knapp, Sandra Georgiev, Teodor Stoev, Pavel Penev, Lyubomir PhytoKeys Article Abstract. The paper describes a pilot project to convert a conventional floristic checklist, written in a standard word processing program, into structured data in the Darwin Core Archive format. After peer-review and editorial acceptance, the final revised version of the checklist was converted into Darwin Core Archive by means of regular expressions and published thereafter in both human-readable form as traditional botanical publication and Darwin Core Archive data files. The data were published and indexed through the Global Biodiversity Information Facility (GBIF) Integrated Publishing Toolkit (IPT) and significant portions of the text of the paper were used to describe the metadata on IPT. After publication, the data will become available through the GBIF infrastructure and can be re-used on their own or collated with other data. Pensoft Publishers 2012-01-30 /pmc/articles/PMC3281575/ /pubmed/22371687 http://dx.doi.org/10.3897/phytokeys.9.2770 Text en David Remsen, Sandra Knapp, Teodor GeorgievPavel Stoev4, Lyubomir Penev http://creativecommons.org/licenses/by/3.0 This is an open access article distributed under the terms of the Creative Commons Attribution License 3.0 (CC-BY), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Article
Remsen, David
Knapp, Sandra
Georgiev, Teodor
Stoev, Pavel
Penev, Lyubomir
From text to structured data: Converting a word-processed floristic checklist into Darwin Core Archive format
title From text to structured data: Converting a word-processed floristic checklist into Darwin Core Archive format
title_full From text to structured data: Converting a word-processed floristic checklist into Darwin Core Archive format
title_fullStr From text to structured data: Converting a word-processed floristic checklist into Darwin Core Archive format
title_full_unstemmed From text to structured data: Converting a word-processed floristic checklist into Darwin Core Archive format
title_short From text to structured data: Converting a word-processed floristic checklist into Darwin Core Archive format
title_sort from text to structured data: converting a word-processed floristic checklist into darwin core archive format
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3281575/
https://www.ncbi.nlm.nih.gov/pubmed/22371687
http://dx.doi.org/10.3897/phytokeys.9.2770
work_keys_str_mv AT remsendavid fromtexttostructureddataconvertingawordprocessedfloristicchecklistintodarwincorearchiveformat
AT knappsandra fromtexttostructureddataconvertingawordprocessedfloristicchecklistintodarwincorearchiveformat
AT georgievteodor fromtexttostructureddataconvertingawordprocessedfloristicchecklistintodarwincorearchiveformat
AT stoevpavel fromtexttostructureddataconvertingawordprocessedfloristicchecklistintodarwincorearchiveformat
AT penevlyubomir fromtexttostructureddataconvertingawordprocessedfloristicchecklistintodarwincorearchiveformat