Cargando…

FuGEFlow: data model and markup language for flow cytometry

BACKGROUND: Flow cytometry technology is widely used in both health care and research. The rapid expansion of flow cytometry applications has outpaced the development of data storage and analysis tools. Collaborative efforts being taken to eliminate this gap include building common vocabularies and...

Descripción completa

Detalles Bibliográficos
Autores principales: Qian, Yu, Tchuvatkina, Olga, Spidlen, Josef, Wilkinson, Peter, Gasparetto, Maura, Jones, Andrew R, Manion, Frank J, Scheuermann, Richard H, Sekaly, Rafick-Pierre, Brinkman, Ryan R
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2711079/
https://www.ncbi.nlm.nih.gov/pubmed/19531228
http://dx.doi.org/10.1186/1471-2105-10-184
_version_ 1782169406657265664
author Qian, Yu
Tchuvatkina, Olga
Spidlen, Josef
Wilkinson, Peter
Gasparetto, Maura
Jones, Andrew R
Manion, Frank J
Scheuermann, Richard H
Sekaly, Rafick-Pierre
Brinkman, Ryan R
author_facet Qian, Yu
Tchuvatkina, Olga
Spidlen, Josef
Wilkinson, Peter
Gasparetto, Maura
Jones, Andrew R
Manion, Frank J
Scheuermann, Richard H
Sekaly, Rafick-Pierre
Brinkman, Ryan R
author_sort Qian, Yu
collection PubMed
description BACKGROUND: Flow cytometry technology is widely used in both health care and research. The rapid expansion of flow cytometry applications has outpaced the development of data storage and analysis tools. Collaborative efforts being taken to eliminate this gap include building common vocabularies and ontologies, designing generic data models, and defining data exchange formats. The Minimum Information about a Flow Cytometry Experiment (MIFlowCyt) standard was recently adopted by the International Society for Advancement of Cytometry. This standard guides researchers on the information that should be included in peer reviewed publications, but it is insufficient for data exchange and integration between computational systems. The Functional Genomics Experiment (FuGE) formalizes common aspects of comprehensive and high throughput experiments across different biological technologies. We have extended FuGE object model to accommodate flow cytometry data and metadata. METHODS: We used the MagicDraw modelling tool to design a UML model (Flow-OM) according to the FuGE extension guidelines and the AndroMDA toolkit to transform the model to a markup language (Flow-ML). We mapped each MIFlowCyt term to either an existing FuGE class or to a new FuGEFlow class. The development environment was validated by comparing the official FuGE XSD to the schema we generated from the FuGE object model using our configuration. After the Flow-OM model was completed, the final version of the Flow-ML was generated and validated against an example MIFlowCyt compliant experiment description. RESULTS: The extension of FuGE for flow cytometry has resulted in a generic FuGE-compliant data model (FuGEFlow), which accommodates and links together all information required by MIFlowCyt. The FuGEFlow model can be used to build software and databases using FuGE software toolkits to facilitate automated exchange and manipulation of potentially large flow cytometry experimental data sets. Additional project documentation, including reusable design patterns and a guide for setting up a development environment, was contributed back to the FuGE project. CONCLUSION: We have shown that an extension of FuGE can be used to transform minimum information requirements in natural language to markup language in XML. Extending FuGE required significant effort, but in our experiences the benefits outweighed the costs. The FuGEFlow is expected to play a central role in describing flow cytometry experiments and ultimately facilitating data exchange including public flow cytometry repositories currently under development.
format Text
id pubmed-2711079
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27110792009-07-16 FuGEFlow: data model and markup language for flow cytometry Qian, Yu Tchuvatkina, Olga Spidlen, Josef Wilkinson, Peter Gasparetto, Maura Jones, Andrew R Manion, Frank J Scheuermann, Richard H Sekaly, Rafick-Pierre Brinkman, Ryan R BMC Bioinformatics Correspondence BACKGROUND: Flow cytometry technology is widely used in both health care and research. The rapid expansion of flow cytometry applications has outpaced the development of data storage and analysis tools. Collaborative efforts being taken to eliminate this gap include building common vocabularies and ontologies, designing generic data models, and defining data exchange formats. The Minimum Information about a Flow Cytometry Experiment (MIFlowCyt) standard was recently adopted by the International Society for Advancement of Cytometry. This standard guides researchers on the information that should be included in peer reviewed publications, but it is insufficient for data exchange and integration between computational systems. The Functional Genomics Experiment (FuGE) formalizes common aspects of comprehensive and high throughput experiments across different biological technologies. We have extended FuGE object model to accommodate flow cytometry data and metadata. METHODS: We used the MagicDraw modelling tool to design a UML model (Flow-OM) according to the FuGE extension guidelines and the AndroMDA toolkit to transform the model to a markup language (Flow-ML). We mapped each MIFlowCyt term to either an existing FuGE class or to a new FuGEFlow class. The development environment was validated by comparing the official FuGE XSD to the schema we generated from the FuGE object model using our configuration. After the Flow-OM model was completed, the final version of the Flow-ML was generated and validated against an example MIFlowCyt compliant experiment description. RESULTS: The extension of FuGE for flow cytometry has resulted in a generic FuGE-compliant data model (FuGEFlow), which accommodates and links together all information required by MIFlowCyt. The FuGEFlow model can be used to build software and databases using FuGE software toolkits to facilitate automated exchange and manipulation of potentially large flow cytometry experimental data sets. Additional project documentation, including reusable design patterns and a guide for setting up a development environment, was contributed back to the FuGE project. CONCLUSION: We have shown that an extension of FuGE can be used to transform minimum information requirements in natural language to markup language in XML. Extending FuGE required significant effort, but in our experiences the benefits outweighed the costs. The FuGEFlow is expected to play a central role in describing flow cytometry experiments and ultimately facilitating data exchange including public flow cytometry repositories currently under development. BioMed Central 2009-06-16 /pmc/articles/PMC2711079/ /pubmed/19531228 http://dx.doi.org/10.1186/1471-2105-10-184 Text en Copyright © 2009 Qian et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Correspondence
Qian, Yu
Tchuvatkina, Olga
Spidlen, Josef
Wilkinson, Peter
Gasparetto, Maura
Jones, Andrew R
Manion, Frank J
Scheuermann, Richard H
Sekaly, Rafick-Pierre
Brinkman, Ryan R
FuGEFlow: data model and markup language for flow cytometry
title FuGEFlow: data model and markup language for flow cytometry
title_full FuGEFlow: data model and markup language for flow cytometry
title_fullStr FuGEFlow: data model and markup language for flow cytometry
title_full_unstemmed FuGEFlow: data model and markup language for flow cytometry
title_short FuGEFlow: data model and markup language for flow cytometry
title_sort fugeflow: data model and markup language for flow cytometry
topic Correspondence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2711079/
https://www.ncbi.nlm.nih.gov/pubmed/19531228
http://dx.doi.org/10.1186/1471-2105-10-184
work_keys_str_mv AT qianyu fugeflowdatamodelandmarkuplanguageforflowcytometry
AT tchuvatkinaolga fugeflowdatamodelandmarkuplanguageforflowcytometry
AT spidlenjosef fugeflowdatamodelandmarkuplanguageforflowcytometry
AT wilkinsonpeter fugeflowdatamodelandmarkuplanguageforflowcytometry
AT gasparettomaura fugeflowdatamodelandmarkuplanguageforflowcytometry
AT jonesandrewr fugeflowdatamodelandmarkuplanguageforflowcytometry
AT manionfrankj fugeflowdatamodelandmarkuplanguageforflowcytometry
AT scheuermannrichardh fugeflowdatamodelandmarkuplanguageforflowcytometry
AT sekalyrafickpierre fugeflowdatamodelandmarkuplanguageforflowcytometry
AT brinkmanryanr fugeflowdatamodelandmarkuplanguageforflowcytometry