Cargando…
Big Data in Caenorhabditis elegans: quo vadis?
A clear definition of what constitutes “Big Data” is difficult to identify, but we find it most useful to define Big Data as a data collection that is complete. By this criterion, researchers on Caenorhabditis elegans have a long history of collecting Big Data, since the organism was selected with t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
The American Society for Cell Biology
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4710222/ https://www.ncbi.nlm.nih.gov/pubmed/26543198 http://dx.doi.org/10.1091/mbc.E15-05-0312 |
_version_ | 1782409786570047488 |
---|---|
author | Hutter, Harald Moerman, Donald |
author_facet | Hutter, Harald Moerman, Donald |
author_sort | Hutter, Harald |
collection | PubMed |
description | A clear definition of what constitutes “Big Data” is difficult to identify, but we find it most useful to define Big Data as a data collection that is complete. By this criterion, researchers on Caenorhabditis elegans have a long history of collecting Big Data, since the organism was selected with the idea of obtaining a complete biological description and understanding of development. The complete wiring diagram of the nervous system, the complete cell lineage, and the complete genome sequence provide a framework to phrase and test hypotheses. Given this history, it might be surprising that the number of “complete” data sets for this organism is actually rather small—not because of lack of effort, but because most types of biological experiments are not currently amenable to complete large-scale data collection. Many are also not inherently limited, so that it becomes difficult to even define completeness. At present, we only have partial data on mutated genes and their phenotypes, gene expression, and protein–protein interaction—important data for many biological questions. Big Data can point toward unexpected correlations, and these unexpected correlations can lead to novel investigations; however, Big Data cannot establish causation. As a result, there is much excitement about Big Data, but there is also a discussion on just what Big Data contributes to solving a biological problem. Because of its relative simplicity, C. elegans is an ideal test bed to explore this issue and at the same time determine what is necessary to build a multicellular organism from a single cell. |
format | Online Article Text |
id | pubmed-4710222 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | The American Society for Cell Biology |
record_format | MEDLINE/PubMed |
spelling | pubmed-47102222016-01-20 Big Data in Caenorhabditis elegans: quo vadis? Hutter, Harald Moerman, Donald Mol Biol Cell Big Data A clear definition of what constitutes “Big Data” is difficult to identify, but we find it most useful to define Big Data as a data collection that is complete. By this criterion, researchers on Caenorhabditis elegans have a long history of collecting Big Data, since the organism was selected with the idea of obtaining a complete biological description and understanding of development. The complete wiring diagram of the nervous system, the complete cell lineage, and the complete genome sequence provide a framework to phrase and test hypotheses. Given this history, it might be surprising that the number of “complete” data sets for this organism is actually rather small—not because of lack of effort, but because most types of biological experiments are not currently amenable to complete large-scale data collection. Many are also not inherently limited, so that it becomes difficult to even define completeness. At present, we only have partial data on mutated genes and their phenotypes, gene expression, and protein–protein interaction—important data for many biological questions. Big Data can point toward unexpected correlations, and these unexpected correlations can lead to novel investigations; however, Big Data cannot establish causation. As a result, there is much excitement about Big Data, but there is also a discussion on just what Big Data contributes to solving a biological problem. Because of its relative simplicity, C. elegans is an ideal test bed to explore this issue and at the same time determine what is necessary to build a multicellular organism from a single cell. The American Society for Cell Biology 2015-11-05 /pmc/articles/PMC4710222/ /pubmed/26543198 http://dx.doi.org/10.1091/mbc.E15-05-0312 Text en © 2015 Hutter and Moerman. This article is distributed by The American Society for Cell Biology under license from the author(s). Two months after publication it is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0). “ASCB®,” “The American Society for Cell Biology®,” and “Molecular Biology of the Cell®” are registered trademarks of The American Society for Cell Biology. |
spellingShingle | Big Data Hutter, Harald Moerman, Donald Big Data in Caenorhabditis elegans: quo vadis? |
title | Big Data in Caenorhabditis elegans: quo vadis? |
title_full | Big Data in Caenorhabditis elegans: quo vadis? |
title_fullStr | Big Data in Caenorhabditis elegans: quo vadis? |
title_full_unstemmed | Big Data in Caenorhabditis elegans: quo vadis? |
title_short | Big Data in Caenorhabditis elegans: quo vadis? |
title_sort | big data in caenorhabditis elegans: quo vadis? |
topic | Big Data |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4710222/ https://www.ncbi.nlm.nih.gov/pubmed/26543198 http://dx.doi.org/10.1091/mbc.E15-05-0312 |
work_keys_str_mv | AT hutterharald bigdataincaenorhabditiselegansquovadis AT moermandonald bigdataincaenorhabditiselegansquovadis |