Cargando…

Incorporating biological structure into machine learning models in biomedicine

In biomedical applications of machine learning, relevant information often has a rich structure that is not easily encoded as real-valued predictors. Examples of such data include DNA or RNA sequences, gene sets or pathways, gene interaction or coexpression networks, ontologies, and phylogenetic tre...

Descripción completa

Detalles Bibliográficos
Autores principales: Crawford, Jake, Greene, Casey S
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308204/
https://www.ncbi.nlm.nih.gov/pubmed/31962244
http://dx.doi.org/10.1016/j.copbio.2019.12.021
_version_ 1783548943420882944
author Crawford, Jake
Greene, Casey S
author_facet Crawford, Jake
Greene, Casey S
author_sort Crawford, Jake
collection PubMed
description In biomedical applications of machine learning, relevant information often has a rich structure that is not easily encoded as real-valued predictors. Examples of such data include DNA or RNA sequences, gene sets or pathways, gene interaction or coexpression networks, ontologies, and phylogenetic trees. We highlight recent examples of machine learning models that use structure to constrain model architecture or incorporate structured data into model training. For machine learning in biomedicine, where sample size is limited and model interpretability is crucial, incorporating prior knowledge in the form of structured data can be particularly useful. The area of research would benefit from performant open source implementations and independent benchmarking efforts.
format Online
Article
Text
id pubmed-7308204
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73082042020-06-23 Incorporating biological structure into machine learning models in biomedicine Crawford, Jake Greene, Casey S Curr Opin Biotechnol Article In biomedical applications of machine learning, relevant information often has a rich structure that is not easily encoded as real-valued predictors. Examples of such data include DNA or RNA sequences, gene sets or pathways, gene interaction or coexpression networks, ontologies, and phylogenetic trees. We highlight recent examples of machine learning models that use structure to constrain model architecture or incorporate structured data into model training. For machine learning in biomedicine, where sample size is limited and model interpretability is crucial, incorporating prior knowledge in the form of structured data can be particularly useful. The area of research would benefit from performant open source implementations and independent benchmarking efforts. 2020-01-18 2020-06 /pmc/articles/PMC7308204/ /pubmed/31962244 http://dx.doi.org/10.1016/j.copbio.2019.12.021 Text en This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Crawford, Jake
Greene, Casey S
Incorporating biological structure into machine learning models in biomedicine
title Incorporating biological structure into machine learning models in biomedicine
title_full Incorporating biological structure into machine learning models in biomedicine
title_fullStr Incorporating biological structure into machine learning models in biomedicine
title_full_unstemmed Incorporating biological structure into machine learning models in biomedicine
title_short Incorporating biological structure into machine learning models in biomedicine
title_sort incorporating biological structure into machine learning models in biomedicine
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7308204/
https://www.ncbi.nlm.nih.gov/pubmed/31962244
http://dx.doi.org/10.1016/j.copbio.2019.12.021
work_keys_str_mv AT crawfordjake incorporatingbiologicalstructureintomachinelearningmodelsinbiomedicine
AT greenecaseys incorporatingbiologicalstructureintomachinelearningmodelsinbiomedicine