Cargando…

A deep learning approach for staging embryonic tissue isolates with small data

Machine learning approaches are becoming increasingly widespread and are now present in most areas of research. Their recent surge can be explained in part due to our ability to generate and store enormous amounts of data with which to train these models. The requirement for large training sets is a...

Descripción completa

Detalles Bibliográficos
Autores principales: Pond, Adam Joseph Ronald, Hwang, Seongwon, Verd, Berta, Steventon, Benjamin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7793293/
https://www.ncbi.nlm.nih.gov/pubmed/33417603
http://dx.doi.org/10.1371/journal.pone.0244151
_version_ 1783633956947623936
author Pond, Adam Joseph Ronald
Hwang, Seongwon
Verd, Berta
Steventon, Benjamin
author_facet Pond, Adam Joseph Ronald
Hwang, Seongwon
Verd, Berta
Steventon, Benjamin
author_sort Pond, Adam Joseph Ronald
collection PubMed
description Machine learning approaches are becoming increasingly widespread and are now present in most areas of research. Their recent surge can be explained in part due to our ability to generate and store enormous amounts of data with which to train these models. The requirement for large training sets is also responsible for limiting further potential applications of machine learning, particularly in fields where data tend to be scarce such as developmental biology. However, recent research seems to indicate that machine learning and Big Data can sometimes be decoupled to train models with modest amounts of data. In this work we set out to train a CNN-based classifier to stage zebrafish tail buds at four different stages of development using small information-rich data sets. Our results show that two and three dimensional convolutional neural networks can be trained to stage developing zebrafish tail buds based on both morphological and gene expression confocal microscopy images, achieving in each case up to 100% test accuracy scores. Importantly, we show that high accuracy can be achieved with data set sizes of under 100 images, much smaller than the typical training set size for a convolutional neural net. Furthermore, our classifier shows that it is possible to stage isolated embryonic structures without the need to refer to classic developmental landmarks in the whole embryo, which will be particularly useful to stage 3D culture in vitro systems such as organoids. We hope that this work will provide a proof of principle that will help dispel the myth that large data set sizes are always required to train CNNs, and encourage researchers in fields where data are scarce to also apply ML approaches.
format Online
Article
Text
id pubmed-7793293
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-77932932021-01-27 A deep learning approach for staging embryonic tissue isolates with small data Pond, Adam Joseph Ronald Hwang, Seongwon Verd, Berta Steventon, Benjamin PLoS One Research Article Machine learning approaches are becoming increasingly widespread and are now present in most areas of research. Their recent surge can be explained in part due to our ability to generate and store enormous amounts of data with which to train these models. The requirement for large training sets is also responsible for limiting further potential applications of machine learning, particularly in fields where data tend to be scarce such as developmental biology. However, recent research seems to indicate that machine learning and Big Data can sometimes be decoupled to train models with modest amounts of data. In this work we set out to train a CNN-based classifier to stage zebrafish tail buds at four different stages of development using small information-rich data sets. Our results show that two and three dimensional convolutional neural networks can be trained to stage developing zebrafish tail buds based on both morphological and gene expression confocal microscopy images, achieving in each case up to 100% test accuracy scores. Importantly, we show that high accuracy can be achieved with data set sizes of under 100 images, much smaller than the typical training set size for a convolutional neural net. Furthermore, our classifier shows that it is possible to stage isolated embryonic structures without the need to refer to classic developmental landmarks in the whole embryo, which will be particularly useful to stage 3D culture in vitro systems such as organoids. We hope that this work will provide a proof of principle that will help dispel the myth that large data set sizes are always required to train CNNs, and encourage researchers in fields where data are scarce to also apply ML approaches. Public Library of Science 2021-01-08 /pmc/articles/PMC7793293/ /pubmed/33417603 http://dx.doi.org/10.1371/journal.pone.0244151 Text en © 2021 Pond et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pond, Adam Joseph Ronald
Hwang, Seongwon
Verd, Berta
Steventon, Benjamin
A deep learning approach for staging embryonic tissue isolates with small data
title A deep learning approach for staging embryonic tissue isolates with small data
title_full A deep learning approach for staging embryonic tissue isolates with small data
title_fullStr A deep learning approach for staging embryonic tissue isolates with small data
title_full_unstemmed A deep learning approach for staging embryonic tissue isolates with small data
title_short A deep learning approach for staging embryonic tissue isolates with small data
title_sort deep learning approach for staging embryonic tissue isolates with small data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7793293/
https://www.ncbi.nlm.nih.gov/pubmed/33417603
http://dx.doi.org/10.1371/journal.pone.0244151
work_keys_str_mv AT pondadamjosephronald adeeplearningapproachforstagingembryonictissueisolateswithsmalldata
AT hwangseongwon adeeplearningapproachforstagingembryonictissueisolateswithsmalldata
AT verdberta adeeplearningapproachforstagingembryonictissueisolateswithsmalldata
AT steventonbenjamin adeeplearningapproachforstagingembryonictissueisolateswithsmalldata
AT pondadamjosephronald deeplearningapproachforstagingembryonictissueisolateswithsmalldata
AT hwangseongwon deeplearningapproachforstagingembryonictissueisolateswithsmalldata
AT verdberta deeplearningapproachforstagingembryonictissueisolateswithsmalldata
AT steventonbenjamin deeplearningapproachforstagingembryonictissueisolateswithsmalldata