Cargando…

Models of the Gene Must Inform Data-Mining Strategies in Genomics

The gene is a fundamental concept of genetics, which emerged with the Mendelian paradigm of heredity at the beginning of the 20th century. However, the concept has since diversified. Somewhat different narratives and models of the gene developed in several sub-disciplines of genetics, that is in cla...

Descripción completa

Detalles Bibliográficos
Autor principal: Huminiecki, Łukasz
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597212/
https://www.ncbi.nlm.nih.gov/pubmed/33286713
http://dx.doi.org/10.3390/e22090942
_version_ 1783602292250902528
author Huminiecki, Łukasz
author_facet Huminiecki, Łukasz
author_sort Huminiecki, Łukasz
collection PubMed
description The gene is a fundamental concept of genetics, which emerged with the Mendelian paradigm of heredity at the beginning of the 20th century. However, the concept has since diversified. Somewhat different narratives and models of the gene developed in several sub-disciplines of genetics, that is in classical genetics, population genetics, molecular genetics, genomics, and, recently, also, in systems genetics. Here, I ask how the diversity of the concept impacts data-integration and data-mining strategies for bioinformatics, genomics, statistical genetics, and data science. I also consider theoretical background of the concept of the gene in the ideas of empiricism and experimentalism, as well as reductionist and anti-reductionist narratives on the concept. Finally, a few strategies of analysis from published examples of data-mining projects are discussed. Moreover, the examples are re-interpreted in the light of the theoretical material. I argue that the choice of an optimal level of abstraction for the gene is vital for a successful genome analysis.
format Online
Article
Text
id pubmed-7597212
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75972122020-11-09 Models of the Gene Must Inform Data-Mining Strategies in Genomics Huminiecki, Łukasz Entropy (Basel) Review The gene is a fundamental concept of genetics, which emerged with the Mendelian paradigm of heredity at the beginning of the 20th century. However, the concept has since diversified. Somewhat different narratives and models of the gene developed in several sub-disciplines of genetics, that is in classical genetics, population genetics, molecular genetics, genomics, and, recently, also, in systems genetics. Here, I ask how the diversity of the concept impacts data-integration and data-mining strategies for bioinformatics, genomics, statistical genetics, and data science. I also consider theoretical background of the concept of the gene in the ideas of empiricism and experimentalism, as well as reductionist and anti-reductionist narratives on the concept. Finally, a few strategies of analysis from published examples of data-mining projects are discussed. Moreover, the examples are re-interpreted in the light of the theoretical material. I argue that the choice of an optimal level of abstraction for the gene is vital for a successful genome analysis. MDPI 2020-08-27 /pmc/articles/PMC7597212/ /pubmed/33286713 http://dx.doi.org/10.3390/e22090942 Text en © 2020 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Huminiecki, Łukasz
Models of the Gene Must Inform Data-Mining Strategies in Genomics
title Models of the Gene Must Inform Data-Mining Strategies in Genomics
title_full Models of the Gene Must Inform Data-Mining Strategies in Genomics
title_fullStr Models of the Gene Must Inform Data-Mining Strategies in Genomics
title_full_unstemmed Models of the Gene Must Inform Data-Mining Strategies in Genomics
title_short Models of the Gene Must Inform Data-Mining Strategies in Genomics
title_sort models of the gene must inform data-mining strategies in genomics
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7597212/
https://www.ncbi.nlm.nih.gov/pubmed/33286713
http://dx.doi.org/10.3390/e22090942
work_keys_str_mv AT huminieckiłukasz modelsofthegenemustinformdataminingstrategiesingenomics