Cargando…
The taxonomic name resolution service: an online tool for automated standardization of plant names
BACKGROUND: The digitization of biodiversity data is leading to the widespread application of taxon names that are superfluous, ambiguous or incorrect, resulting in mismatched records and inflated species numbers. The ultimate consequences of misspelled names and bad taxonomy are erroneous scientifi...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3554605/ https://www.ncbi.nlm.nih.gov/pubmed/23324024 http://dx.doi.org/10.1186/1471-2105-14-16 |
_version_ | 1782256930121580544 |
---|---|
author | Boyle, Brad Hopkins, Nicole Lu, Zhenyuan Raygoza Garay, Juan Antonio Mozzherin, Dmitry Rees, Tony Matasci, Naim Narro, Martha L Piel, William H Mckay, Sheldon J Lowry, Sonya Freeland, Chris Peet, Robert K Enquist, Brian J |
author_facet | Boyle, Brad Hopkins, Nicole Lu, Zhenyuan Raygoza Garay, Juan Antonio Mozzherin, Dmitry Rees, Tony Matasci, Naim Narro, Martha L Piel, William H Mckay, Sheldon J Lowry, Sonya Freeland, Chris Peet, Robert K Enquist, Brian J |
author_sort | Boyle, Brad |
collection | PubMed |
description | BACKGROUND: The digitization of biodiversity data is leading to the widespread application of taxon names that are superfluous, ambiguous or incorrect, resulting in mismatched records and inflated species numbers. The ultimate consequences of misspelled names and bad taxonomy are erroneous scientific conclusions and faulty policy decisions. The lack of tools for correcting this ‘names problem’ has become a fundamental obstacle to integrating disparate data sources and advancing the progress of biodiversity science. RESULTS: The TNRS, or Taxonomic Name Resolution Service, is an online application for automated and user-supervised standardization of plant scientific names. The TNRS builds upon and extends existing open-source applications for name parsing and fuzzy matching. Names are standardized against multiple reference taxonomies, including the Missouri Botanical Garden's Tropicos database. Capable of processing thousands of names in a single operation, the TNRS parses and corrects misspelled names and authorities, standardizes variant spellings, and converts nomenclatural synonyms to accepted names. Family names can be included to increase match accuracy and resolve many types of homonyms. Partial matching of higher taxa combined with extraction of annotations, accession numbers and morphospecies allows the TNRS to standardize taxonomy across a broad range of active and legacy datasets. CONCLUSIONS: We show how the TNRS can resolve many forms of taxonomic semantic heterogeneity, correct spelling errors and eliminate spurious names. As a result, the TNRS can aid the integration of disparate biological datasets. Although the TNRS was developed to aid in standardizing plant names, its underlying algorithms and design can be extended to all organisms and nomenclatural codes. The TNRS is accessible via a web interface at http://tnrs.iplantcollaborative.org/ and as a RESTful web service and application programming interface. Source code is available at https://github.com/iPlantCollaborativeOpenSource/TNRS/. |
format | Online Article Text |
id | pubmed-3554605 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35546052013-01-29 The taxonomic name resolution service: an online tool for automated standardization of plant names Boyle, Brad Hopkins, Nicole Lu, Zhenyuan Raygoza Garay, Juan Antonio Mozzherin, Dmitry Rees, Tony Matasci, Naim Narro, Martha L Piel, William H Mckay, Sheldon J Lowry, Sonya Freeland, Chris Peet, Robert K Enquist, Brian J BMC Bioinformatics Software BACKGROUND: The digitization of biodiversity data is leading to the widespread application of taxon names that are superfluous, ambiguous or incorrect, resulting in mismatched records and inflated species numbers. The ultimate consequences of misspelled names and bad taxonomy are erroneous scientific conclusions and faulty policy decisions. The lack of tools for correcting this ‘names problem’ has become a fundamental obstacle to integrating disparate data sources and advancing the progress of biodiversity science. RESULTS: The TNRS, or Taxonomic Name Resolution Service, is an online application for automated and user-supervised standardization of plant scientific names. The TNRS builds upon and extends existing open-source applications for name parsing and fuzzy matching. Names are standardized against multiple reference taxonomies, including the Missouri Botanical Garden's Tropicos database. Capable of processing thousands of names in a single operation, the TNRS parses and corrects misspelled names and authorities, standardizes variant spellings, and converts nomenclatural synonyms to accepted names. Family names can be included to increase match accuracy and resolve many types of homonyms. Partial matching of higher taxa combined with extraction of annotations, accession numbers and morphospecies allows the TNRS to standardize taxonomy across a broad range of active and legacy datasets. CONCLUSIONS: We show how the TNRS can resolve many forms of taxonomic semantic heterogeneity, correct spelling errors and eliminate spurious names. As a result, the TNRS can aid the integration of disparate biological datasets. Although the TNRS was developed to aid in standardizing plant names, its underlying algorithms and design can be extended to all organisms and nomenclatural codes. The TNRS is accessible via a web interface at http://tnrs.iplantcollaborative.org/ and as a RESTful web service and application programming interface. Source code is available at https://github.com/iPlantCollaborativeOpenSource/TNRS/. BioMed Central 2013-01-16 /pmc/articles/PMC3554605/ /pubmed/23324024 http://dx.doi.org/10.1186/1471-2105-14-16 Text en Copyright ©2013 Boyle et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Boyle, Brad Hopkins, Nicole Lu, Zhenyuan Raygoza Garay, Juan Antonio Mozzherin, Dmitry Rees, Tony Matasci, Naim Narro, Martha L Piel, William H Mckay, Sheldon J Lowry, Sonya Freeland, Chris Peet, Robert K Enquist, Brian J The taxonomic name resolution service: an online tool for automated standardization of plant names |
title | The taxonomic name resolution service: an online tool for automated standardization of plant names |
title_full | The taxonomic name resolution service: an online tool for automated standardization of plant names |
title_fullStr | The taxonomic name resolution service: an online tool for automated standardization of plant names |
title_full_unstemmed | The taxonomic name resolution service: an online tool for automated standardization of plant names |
title_short | The taxonomic name resolution service: an online tool for automated standardization of plant names |
title_sort | taxonomic name resolution service: an online tool for automated standardization of plant names |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3554605/ https://www.ncbi.nlm.nih.gov/pubmed/23324024 http://dx.doi.org/10.1186/1471-2105-14-16 |
work_keys_str_mv | AT boylebrad thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT hopkinsnicole thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT luzhenyuan thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT raygozagarayjuanantonio thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT mozzherindmitry thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT reestony thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT matascinaim thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT narromarthal thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT pielwilliamh thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT mckaysheldonj thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT lowrysonya thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT freelandchris thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT peetrobertk thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT enquistbrianj thetaxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT boylebrad taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT hopkinsnicole taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT luzhenyuan taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT raygozagarayjuanantonio taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT mozzherindmitry taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT reestony taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT matascinaim taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT narromarthal taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT pielwilliamh taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT mckaysheldonj taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT lowrysonya taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT freelandchris taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT peetrobertk taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames AT enquistbrianj taxonomicnameresolutionserviceanonlinetoolforautomatedstandardizationofplantnames |