Cargando…

Data capture in bioinformatics: requirements and experiences with Pedro

BACKGROUND: The systematic capture of appropriately annotated experimental data is a prerequisite for most bioinformatics analyses. Data capture is required not only for submission of data to public repositories, but also to underpin integrated analysis, archiving, and sharing – both within laborato...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jameson, Daniel, Garwood, Kevin, Garwood, Chris, Booth, Tim, Alper, Pinar, Oliver, Stephen G, Paton, Norman W
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2008
Materias:	Software
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2335277/ https://www.ncbi.nlm.nih.gov/pubmed/18402673 http://dx.doi.org/10.1186/1471-2105-9-183

_version_	1782152814692139008
author	Jameson, Daniel Garwood, Kevin Garwood, Chris Booth, Tim Alper, Pinar Oliver, Stephen G Paton, Norman W
author_facet	Jameson, Daniel Garwood, Kevin Garwood, Chris Booth, Tim Alper, Pinar Oliver, Stephen G Paton, Norman W
author_sort	Jameson, Daniel
collection	PubMed
description	BACKGROUND: The systematic capture of appropriately annotated experimental data is a prerequisite for most bioinformatics analyses. Data capture is required not only for submission of data to public repositories, but also to underpin integrated analysis, archiving, and sharing – both within laboratories and in collaborative projects. The widespread requirement to capture data means that data capture and annotation are taking place at many sites, but the small scale of the literature on tools, techniques and experiences suggests that there is work to be done to identify good practice and reduce duplication of effort. RESULTS: This paper reports on experience gained in the deployment of the Pedro data capture tool in a range of representative bioinformatics applications. The paper makes explicit the requirements that have recurred when capturing data in different contexts, indicates how these requirements are addressed in Pedro, and describes case studies that illustrate where the requirements have arisen in practice. CONCLUSION: Data capture is a fundamental activity for bioinformatics; all biological data resources build on some form of data capture activity, and many require a blend of import, analysis and annotation. Recurring requirements in data capture suggest that model-driven architectures can be used to construct data capture infrastructures that can be rapidly configured to meet the needs of individual use cases. We have described how one such model-driven infrastructure, namely Pedro, has been deployed in representative case studies, and discussed the extent to which the model-driven approach has been effective in practice.
format	Text
id	pubmed-2335277
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-23352772008-04-26 Data capture in bioinformatics: requirements and experiences with Pedro Jameson, Daniel Garwood, Kevin Garwood, Chris Booth, Tim Alper, Pinar Oliver, Stephen G Paton, Norman W BMC Bioinformatics Software BACKGROUND: The systematic capture of appropriately annotated experimental data is a prerequisite for most bioinformatics analyses. Data capture is required not only for submission of data to public repositories, but also to underpin integrated analysis, archiving, and sharing – both within laboratories and in collaborative projects. The widespread requirement to capture data means that data capture and annotation are taking place at many sites, but the small scale of the literature on tools, techniques and experiences suggests that there is work to be done to identify good practice and reduce duplication of effort. RESULTS: This paper reports on experience gained in the deployment of the Pedro data capture tool in a range of representative bioinformatics applications. The paper makes explicit the requirements that have recurred when capturing data in different contexts, indicates how these requirements are addressed in Pedro, and describes case studies that illustrate where the requirements have arisen in practice. CONCLUSION: Data capture is a fundamental activity for bioinformatics; all biological data resources build on some form of data capture activity, and many require a blend of import, analysis and annotation. Recurring requirements in data capture suggest that model-driven architectures can be used to construct data capture infrastructures that can be rapidly configured to meet the needs of individual use cases. We have described how one such model-driven infrastructure, namely Pedro, has been deployed in representative case studies, and discussed the extent to which the model-driven approach has been effective in practice. BioMed Central 2008-04-10 /pmc/articles/PMC2335277/ /pubmed/18402673 http://dx.doi.org/10.1186/1471-2105-9-183 Text en Copyright © 2008 Jameson et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Software Jameson, Daniel Garwood, Kevin Garwood, Chris Booth, Tim Alper, Pinar Oliver, Stephen G Paton, Norman W Data capture in bioinformatics: requirements and experiences with Pedro
title	Data capture in bioinformatics: requirements and experiences with Pedro
title_full	Data capture in bioinformatics: requirements and experiences with Pedro
title_fullStr	Data capture in bioinformatics: requirements and experiences with Pedro
title_full_unstemmed	Data capture in bioinformatics: requirements and experiences with Pedro
title_short	Data capture in bioinformatics: requirements and experiences with Pedro
title_sort	data capture in bioinformatics: requirements and experiences with pedro
topic	Software
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2335277/ https://www.ncbi.nlm.nih.gov/pubmed/18402673 http://dx.doi.org/10.1186/1471-2105-9-183
work_keys_str_mv	AT jamesondaniel datacaptureinbioinformaticsrequirementsandexperienceswithpedro AT garwoodkevin datacaptureinbioinformaticsrequirementsandexperienceswithpedro AT garwoodchris datacaptureinbioinformaticsrequirementsandexperienceswithpedro AT boothtim datacaptureinbioinformaticsrequirementsandexperienceswithpedro AT alperpinar datacaptureinbioinformaticsrequirementsandexperienceswithpedro AT oliverstepheng datacaptureinbioinformaticsrequirementsandexperienceswithpedro AT patonnormanw datacaptureinbioinformaticsrequirementsandexperienceswithpedro

Data capture in bioinformatics: requirements and experiences with Pedro

Ejemplares similares