Cargando…
NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata
In scientific research, integration and synthesis require a common understanding of where data come from, how much they can be trusted, and what they may be used for. To make such an understanding computer-accessible requires standards for exchanging richly annotated data. The challenges of conveyin...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376374/ https://www.ncbi.nlm.nih.gov/pubmed/22357728 http://dx.doi.org/10.1093/sysbio/sys025 |
_version_ | 1782235822454472704 |
---|---|
author | Vos, Rutger A. Balhoff, James P. Caravas, Jason A. Holder, Mark T. Lapp, Hilmar Maddison, Wayne P. Midford, Peter E. Priyam, Anurag Sukumaran, Jeet Xia, Xuhua Stoltzfus, Arlin |
author_facet | Vos, Rutger A. Balhoff, James P. Caravas, Jason A. Holder, Mark T. Lapp, Hilmar Maddison, Wayne P. Midford, Peter E. Priyam, Anurag Sukumaran, Jeet Xia, Xuhua Stoltzfus, Arlin |
author_sort | Vos, Rutger A. |
collection | PubMed |
description | In scientific research, integration and synthesis require a common understanding of where data come from, how much they can be trusted, and what they may be used for. To make such an understanding computer-accessible requires standards for exchanging richly annotated data. The challenges of conveying reusable data are particularly acute in regard to evolutionary comparative analysis, which comprises an ever-expanding list of data types, methods, research aims, and subdisciplines. To facilitate interoperability in evolutionary comparative analysis, we present NeXML, an XML standard (inspired by the current standard, NEXUS) that supports exchange of richly annotated comparative data. NeXML defines syntax for operational taxonomic units, character-state matrices, and phylogenetic trees and networks. Documents can be validated unambiguously. Importantly, any data element can be annotated, to an arbitrary degree of richness, using a system that is both flexible and rigorous. We describe how the use of NeXML by the TreeBASE and Phenoscape projects satisfies user needs that cannot be satisfied with other available file formats. By relying on XML Schema Definition, the design of NeXML facilitates the development and deployment of software for processing, transforming, and querying documents. The adoption of NeXML for practical use is facilitated by the availability of (1) an online manual with code samples and a reference to all defined elements and attributes, (2) programming toolkits in most of the languages used commonly in evolutionary informatics, and (3) input–output support in several widely used software applications. An active, open, community-based development process enables future revision and expansion of NeXML. |
format | Online Article Text |
id | pubmed-3376374 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-33763742012-06-18 NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata Vos, Rutger A. Balhoff, James P. Caravas, Jason A. Holder, Mark T. Lapp, Hilmar Maddison, Wayne P. Midford, Peter E. Priyam, Anurag Sukumaran, Jeet Xia, Xuhua Stoltzfus, Arlin Syst Biol Regular Articles In scientific research, integration and synthesis require a common understanding of where data come from, how much they can be trusted, and what they may be used for. To make such an understanding computer-accessible requires standards for exchanging richly annotated data. The challenges of conveying reusable data are particularly acute in regard to evolutionary comparative analysis, which comprises an ever-expanding list of data types, methods, research aims, and subdisciplines. To facilitate interoperability in evolutionary comparative analysis, we present NeXML, an XML standard (inspired by the current standard, NEXUS) that supports exchange of richly annotated comparative data. NeXML defines syntax for operational taxonomic units, character-state matrices, and phylogenetic trees and networks. Documents can be validated unambiguously. Importantly, any data element can be annotated, to an arbitrary degree of richness, using a system that is both flexible and rigorous. We describe how the use of NeXML by the TreeBASE and Phenoscape projects satisfies user needs that cannot be satisfied with other available file formats. By relying on XML Schema Definition, the design of NeXML facilitates the development and deployment of software for processing, transforming, and querying documents. The adoption of NeXML for practical use is facilitated by the availability of (1) an online manual with code samples and a reference to all defined elements and attributes, (2) programming toolkits in most of the languages used commonly in evolutionary informatics, and (3) input–output support in several widely used software applications. An active, open, community-based development process enables future revision and expansion of NeXML. Oxford University Press 2012-07 2012-02-22 /pmc/articles/PMC3376374/ /pubmed/22357728 http://dx.doi.org/10.1093/sysbio/sys025 Text en © The Author(s) 2012. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Regular Articles Vos, Rutger A. Balhoff, James P. Caravas, Jason A. Holder, Mark T. Lapp, Hilmar Maddison, Wayne P. Midford, Peter E. Priyam, Anurag Sukumaran, Jeet Xia, Xuhua Stoltzfus, Arlin NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata |
title | NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata |
title_full | NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata |
title_fullStr | NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata |
title_full_unstemmed | NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata |
title_short | NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata |
title_sort | nexml: rich, extensible, and verifiable representation of comparative data and metadata |
topic | Regular Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376374/ https://www.ncbi.nlm.nih.gov/pubmed/22357728 http://dx.doi.org/10.1093/sysbio/sys025 |
work_keys_str_mv | AT vosrutgera nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT balhoffjamesp nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT caravasjasona nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT holdermarkt nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT lapphilmar nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT maddisonwaynep nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT midfordpetere nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT priyamanurag nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT sukumaranjeet nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT xiaxuhua nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata AT stoltzfusarlin nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata |