Cargando…

openBIS: a flexible framework for managing and analyzing complex data in biology research

BACKGROUND: Modern data generation techniques used in distributed systems biology research projects often create datasets of enormous size and diversity. We argue that in order to overcome the challenge of managing those large quantitative datasets and maximise the biological information extracted f...

Descripción completa

Detalles Bibliográficos
Autores principales: Bauch, Angela, Adamczyk, Izabela, Buczek, Piotr, Elmer, Franz-Josef, Enimanev, Kaloyan, Glyzewski, Pawel, Kohler, Manuel, Pylak, Tomasz, Quandt, Andreas, Ramakrishnan, Chandrasekhar, Beisel, Christian, Malmström, Lars, Aebersold, Ruedi, Rinn, Bernd
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3275639/
https://www.ncbi.nlm.nih.gov/pubmed/22151573
http://dx.doi.org/10.1186/1471-2105-12-468
Descripción
Sumario:BACKGROUND: Modern data generation techniques used in distributed systems biology research projects often create datasets of enormous size and diversity. We argue that in order to overcome the challenge of managing those large quantitative datasets and maximise the biological information extracted from them, a sound information system is required. Ease of integration with data analysis pipelines and other computational tools is a key requirement for it. RESULTS: We have developed openBIS, an open source software framework for constructing user-friendly, scalable and powerful information systems for data and metadata acquired in biological experiments. openBIS enables users to collect, integrate, share, publish data and to connect to data processing pipelines. This framework can be extended and has been customized for different data types acquired by a range of technologies. CONCLUSIONS: openBIS is currently being used by several SystemsX.ch and EU projects applying mass spectrometric measurements of metabolites and proteins, High Content Screening, or Next Generation Sequencing technologies. The attributes that make it interesting to a large research community involved in systems biology projects include versatility, simplicity in deployment, scalability to very large data, flexibility to handle any biological data type and extensibility to the needs of any research domain.