Cargando…
DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks
Development of automated analysis tools for “single ion channel” recording is hampered by the lack of available training data. For machine learning based tools, very large training sets are necessary with sample-by-sample point labelled data (e.g., 1 sample point every 100microsecond). In an experim...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9089889/ https://www.ncbi.nlm.nih.gov/pubmed/35536793 http://dx.doi.org/10.1371/journal.pone.0267452 |
_version_ | 1784704611994566656 |
---|---|
author | Ball, Sam T. M. Celik, Numan Sayari, Elaheh Abdul Kadir, Lina O’Brien, Fiona Barrett-Jolley, Richard |
author_facet | Ball, Sam T. M. Celik, Numan Sayari, Elaheh Abdul Kadir, Lina O’Brien, Fiona Barrett-Jolley, Richard |
author_sort | Ball, Sam T. M. |
collection | PubMed |
description | Development of automated analysis tools for “single ion channel” recording is hampered by the lack of available training data. For machine learning based tools, very large training sets are necessary with sample-by-sample point labelled data (e.g., 1 sample point every 100microsecond). In an experimental context, such data are labelled with human supervision, and whilst this is feasible for simple experimental analysis, it is infeasible to generate the enormous datasets that would be necessary for a big data approach using hand crafting. In this work we aimed to develop methods to generate simulated ion channel data that is free from assumptions and prior knowledge of noise and underlying hidden Markov models. We successfully leverage generative adversarial networks (GANs) to build an end-to-end pipeline for generating an unlimited amount of labelled training data from a small, annotated ion channel “seed” record, and this needs no prior knowledge of theoretical dynamical ion channel properties. Our method utilises 2D CNNs to maintain the synchronised temporal relationship between the raw and idealised record. We demonstrate the applicability of the method with 5 different data sources and show authenticity with t-SNE and UMAP projection comparisons between real and synthetic data. The model would be easily extendable to other time series data requiring parallel labelling, such as labelled ECG signals or raw nanopore sequencing data. |
format | Online Article Text |
id | pubmed-9089889 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-90898892022-05-11 DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks Ball, Sam T. M. Celik, Numan Sayari, Elaheh Abdul Kadir, Lina O’Brien, Fiona Barrett-Jolley, Richard PLoS One Research Article Development of automated analysis tools for “single ion channel” recording is hampered by the lack of available training data. For machine learning based tools, very large training sets are necessary with sample-by-sample point labelled data (e.g., 1 sample point every 100microsecond). In an experimental context, such data are labelled with human supervision, and whilst this is feasible for simple experimental analysis, it is infeasible to generate the enormous datasets that would be necessary for a big data approach using hand crafting. In this work we aimed to develop methods to generate simulated ion channel data that is free from assumptions and prior knowledge of noise and underlying hidden Markov models. We successfully leverage generative adversarial networks (GANs) to build an end-to-end pipeline for generating an unlimited amount of labelled training data from a small, annotated ion channel “seed” record, and this needs no prior knowledge of theoretical dynamical ion channel properties. Our method utilises 2D CNNs to maintain the synchronised temporal relationship between the raw and idealised record. We demonstrate the applicability of the method with 5 different data sources and show authenticity with t-SNE and UMAP projection comparisons between real and synthetic data. The model would be easily extendable to other time series data requiring parallel labelling, such as labelled ECG signals or raw nanopore sequencing data. Public Library of Science 2022-05-10 /pmc/articles/PMC9089889/ /pubmed/35536793 http://dx.doi.org/10.1371/journal.pone.0267452 Text en © 2022 Ball et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Ball, Sam T. M. Celik, Numan Sayari, Elaheh Abdul Kadir, Lina O’Brien, Fiona Barrett-Jolley, Richard DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks |
title | DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks |
title_full | DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks |
title_fullStr | DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks |
title_full_unstemmed | DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks |
title_short | DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks |
title_sort | deepgannel: synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9089889/ https://www.ncbi.nlm.nih.gov/pubmed/35536793 http://dx.doi.org/10.1371/journal.pone.0267452 |
work_keys_str_mv | AT ballsamtm deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks AT celiknuman deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks AT sayarielaheh deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks AT abdulkadirlina deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks AT obrienfiona deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks AT barrettjolleyrichard deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks |