Cargando…

Big Data in Caenorhabditis elegans: quo vadis?

A clear definition of what constitutes “Big Data” is difficult to identify, but we find it most useful to define Big Data as a data collection that is complete. By this criterion, researchers on Caenorhabditis elegans have a long history of collecting Big Data, since the organism was selected with t...

Descripción completa

Detalles Bibliográficos
Autores principales: Hutter, Harald, Moerman, Donald
Formato: Online Artículo Texto
Lenguaje:English
Publicado: The American Society for Cell Biology 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4710222/
https://www.ncbi.nlm.nih.gov/pubmed/26543198
http://dx.doi.org/10.1091/mbc.E15-05-0312
_version_ 1782409786570047488
author Hutter, Harald
Moerman, Donald
author_facet Hutter, Harald
Moerman, Donald
author_sort Hutter, Harald
collection PubMed
description A clear definition of what constitutes “Big Data” is difficult to identify, but we find it most useful to define Big Data as a data collection that is complete. By this criterion, researchers on Caenorhabditis elegans have a long history of collecting Big Data, since the organism was selected with the idea of obtaining a complete biological description and understanding of development. The complete wiring diagram of the nervous system, the complete cell lineage, and the complete genome sequence provide a framework to phrase and test hypotheses. Given this history, it might be surprising that the number of “complete” data sets for this organism is actually rather small—not because of lack of effort, but because most types of biological experiments are not currently amenable to complete large-scale data collection. Many are also not inherently limited, so that it becomes difficult to even define completeness. At present, we only have partial data on mutated genes and their phenotypes, gene expression, and protein–protein interaction—important data for many biological questions. Big Data can point toward unexpected correlations, and these unexpected correlations can lead to novel investigations; however, Big Data cannot establish causation. As a result, there is much excitement about Big Data, but there is also a discussion on just what Big Data contributes to solving a biological problem. Because of its relative simplicity, C. elegans is an ideal test bed to explore this issue and at the same time determine what is necessary to build a multicellular organism from a single cell.
format Online
Article
Text
id pubmed-4710222
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher The American Society for Cell Biology
record_format MEDLINE/PubMed
spelling pubmed-47102222016-01-20 Big Data in Caenorhabditis elegans: quo vadis? Hutter, Harald Moerman, Donald Mol Biol Cell Big Data A clear definition of what constitutes “Big Data” is difficult to identify, but we find it most useful to define Big Data as a data collection that is complete. By this criterion, researchers on Caenorhabditis elegans have a long history of collecting Big Data, since the organism was selected with the idea of obtaining a complete biological description and understanding of development. The complete wiring diagram of the nervous system, the complete cell lineage, and the complete genome sequence provide a framework to phrase and test hypotheses. Given this history, it might be surprising that the number of “complete” data sets for this organism is actually rather small—not because of lack of effort, but because most types of biological experiments are not currently amenable to complete large-scale data collection. Many are also not inherently limited, so that it becomes difficult to even define completeness. At present, we only have partial data on mutated genes and their phenotypes, gene expression, and protein–protein interaction—important data for many biological questions. Big Data can point toward unexpected correlations, and these unexpected correlations can lead to novel investigations; however, Big Data cannot establish causation. As a result, there is much excitement about Big Data, but there is also a discussion on just what Big Data contributes to solving a biological problem. Because of its relative simplicity, C. elegans is an ideal test bed to explore this issue and at the same time determine what is necessary to build a multicellular organism from a single cell. The American Society for Cell Biology 2015-11-05 /pmc/articles/PMC4710222/ /pubmed/26543198 http://dx.doi.org/10.1091/mbc.E15-05-0312 Text en © 2015 Hutter and Moerman. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0). “ASCB®,” “The American Society for Cell Biology®,” and “Molecular Biology of the Cell®” are registered trademarks of The American Society for Cell Biology.
spellingShingle Big Data
Hutter, Harald
Moerman, Donald
Big Data in Caenorhabditis elegans: quo vadis?
title Big Data in Caenorhabditis elegans: quo vadis?
title_full Big Data in Caenorhabditis elegans: quo vadis?
title_fullStr Big Data in Caenorhabditis elegans: quo vadis?
title_full_unstemmed Big Data in Caenorhabditis elegans: quo vadis?
title_short Big Data in Caenorhabditis elegans: quo vadis?
title_sort big data in caenorhabditis elegans: quo vadis?
topic Big Data
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4710222/
https://www.ncbi.nlm.nih.gov/pubmed/26543198
http://dx.doi.org/10.1091/mbc.E15-05-0312
work_keys_str_mv AT hutterharald bigdataincaenorhabditiselegansquovadis
AT moermandonald bigdataincaenorhabditiselegansquovadis