Cargando…

NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata

In scientific research, integration and synthesis require a common understanding of where data come from, how much they can be trusted, and what they may be used for. To make such an understanding computer-accessible requires standards for exchanging richly annotated data. The challenges of conveyin...

Descripción completa

Detalles Bibliográficos
Autores principales: Vos, Rutger A., Balhoff, James P., Caravas, Jason A., Holder, Mark T., Lapp, Hilmar, Maddison, Wayne P., Midford, Peter E., Priyam, Anurag, Sukumaran, Jeet, Xia, Xuhua, Stoltzfus, Arlin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376374/
https://www.ncbi.nlm.nih.gov/pubmed/22357728
http://dx.doi.org/10.1093/sysbio/sys025
_version_ 1782235822454472704
author Vos, Rutger A.
Balhoff, James P.
Caravas, Jason A.
Holder, Mark T.
Lapp, Hilmar
Maddison, Wayne P.
Midford, Peter E.
Priyam, Anurag
Sukumaran, Jeet
Xia, Xuhua
Stoltzfus, Arlin
author_facet Vos, Rutger A.
Balhoff, James P.
Caravas, Jason A.
Holder, Mark T.
Lapp, Hilmar
Maddison, Wayne P.
Midford, Peter E.
Priyam, Anurag
Sukumaran, Jeet
Xia, Xuhua
Stoltzfus, Arlin
author_sort Vos, Rutger A.
collection PubMed
description In scientific research, integration and synthesis require a common understanding of where data come from, how much they can be trusted, and what they may be used for. To make such an understanding computer-accessible requires standards for exchanging richly annotated data. The challenges of conveying reusable data are particularly acute in regard to evolutionary comparative analysis, which comprises an ever-expanding list of data types, methods, research aims, and subdisciplines. To facilitate interoperability in evolutionary comparative analysis, we present NeXML, an XML standard (inspired by the current standard, NEXUS) that supports exchange of richly annotated comparative data. NeXML defines syntax for operational taxonomic units, character-state matrices, and phylogenetic trees and networks. Documents can be validated unambiguously. Importantly, any data element can be annotated, to an arbitrary degree of richness, using a system that is both flexible and rigorous. We describe how the use of NeXML by the TreeBASE and Phenoscape projects satisfies user needs that cannot be satisfied with other available file formats. By relying on XML Schema Definition, the design of NeXML facilitates the development and deployment of software for processing, transforming, and querying documents. The adoption of NeXML for practical use is facilitated by the availability of (1) an online manual with code samples and a reference to all defined elements and attributes, (2) programming toolkits in most of the languages used commonly in evolutionary informatics, and (3) input–output support in several widely used software applications. An active, open, community-based development process enables future revision and expansion of NeXML.
format Online
Article
Text
id pubmed-3376374
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-33763742012-06-18 NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata Vos, Rutger A. Balhoff, James P. Caravas, Jason A. Holder, Mark T. Lapp, Hilmar Maddison, Wayne P. Midford, Peter E. Priyam, Anurag Sukumaran, Jeet Xia, Xuhua Stoltzfus, Arlin Syst Biol Regular Articles In scientific research, integration and synthesis require a common understanding of where data come from, how much they can be trusted, and what they may be used for. To make such an understanding computer-accessible requires standards for exchanging richly annotated data. The challenges of conveying reusable data are particularly acute in regard to evolutionary comparative analysis, which comprises an ever-expanding list of data types, methods, research aims, and subdisciplines. To facilitate interoperability in evolutionary comparative analysis, we present NeXML, an XML standard (inspired by the current standard, NEXUS) that supports exchange of richly annotated comparative data. NeXML defines syntax for operational taxonomic units, character-state matrices, and phylogenetic trees and networks. Documents can be validated unambiguously. Importantly, any data element can be annotated, to an arbitrary degree of richness, using a system that is both flexible and rigorous. We describe how the use of NeXML by the TreeBASE and Phenoscape projects satisfies user needs that cannot be satisfied with other available file formats. By relying on XML Schema Definition, the design of NeXML facilitates the development and deployment of software for processing, transforming, and querying documents. The adoption of NeXML for practical use is facilitated by the availability of (1) an online manual with code samples and a reference to all defined elements and attributes, (2) programming toolkits in most of the languages used commonly in evolutionary informatics, and (3) input–output support in several widely used software applications. An active, open, community-based development process enables future revision and expansion of NeXML. Oxford University Press 2012-07 2012-02-22 /pmc/articles/PMC3376374/ /pubmed/22357728 http://dx.doi.org/10.1093/sysbio/sys025 Text en © The Author(s) 2012. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Vos, Rutger A.
Balhoff, James P.
Caravas, Jason A.
Holder, Mark T.
Lapp, Hilmar
Maddison, Wayne P.
Midford, Peter E.
Priyam, Anurag
Sukumaran, Jeet
Xia, Xuhua
Stoltzfus, Arlin
NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata
title NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata
title_full NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata
title_fullStr NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata
title_full_unstemmed NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata
title_short NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata
title_sort nexml: rich, extensible, and verifiable representation of comparative data and metadata
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376374/
https://www.ncbi.nlm.nih.gov/pubmed/22357728
http://dx.doi.org/10.1093/sysbio/sys025
work_keys_str_mv AT vosrutgera nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT balhoffjamesp nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT caravasjasona nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT holdermarkt nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT lapphilmar nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT maddisonwaynep nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT midfordpetere nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT priyamanurag nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT sukumaranjeet nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT xiaxuhua nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata
AT stoltzfusarlin nexmlrichextensibleandverifiablerepresentationofcomparativedataandmetadata