Cargando…

DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks

Development of automated analysis tools for “single ion channel” recording is hampered by the lack of available training data. For machine learning based tools, very large training sets are necessary with sample-by-sample point labelled data (e.g., 1 sample point every 100microsecond). In an experim...

Descripción completa

Detalles Bibliográficos
Autores principales: Ball, Sam T. M., Celik, Numan, Sayari, Elaheh, Abdul Kadir, Lina, O’Brien, Fiona, Barrett-Jolley, Richard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9089889/
https://www.ncbi.nlm.nih.gov/pubmed/35536793
http://dx.doi.org/10.1371/journal.pone.0267452
_version_ 1784704611994566656
author Ball, Sam T. M.
Celik, Numan
Sayari, Elaheh
Abdul Kadir, Lina
O’Brien, Fiona
Barrett-Jolley, Richard
author_facet Ball, Sam T. M.
Celik, Numan
Sayari, Elaheh
Abdul Kadir, Lina
O’Brien, Fiona
Barrett-Jolley, Richard
author_sort Ball, Sam T. M.
collection PubMed
description Development of automated analysis tools for “single ion channel” recording is hampered by the lack of available training data. For machine learning based tools, very large training sets are necessary with sample-by-sample point labelled data (e.g., 1 sample point every 100microsecond). In an experimental context, such data are labelled with human supervision, and whilst this is feasible for simple experimental analysis, it is infeasible to generate the enormous datasets that would be necessary for a big data approach using hand crafting. In this work we aimed to develop methods to generate simulated ion channel data that is free from assumptions and prior knowledge of noise and underlying hidden Markov models. We successfully leverage generative adversarial networks (GANs) to build an end-to-end pipeline for generating an unlimited amount of labelled training data from a small, annotated ion channel “seed” record, and this needs no prior knowledge of theoretical dynamical ion channel properties. Our method utilises 2D CNNs to maintain the synchronised temporal relationship between the raw and idealised record. We demonstrate the applicability of the method with 5 different data sources and show authenticity with t-SNE and UMAP projection comparisons between real and synthetic data. The model would be easily extendable to other time series data requiring parallel labelling, such as labelled ECG signals or raw nanopore sequencing data.
format Online
Article
Text
id pubmed-9089889
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-90898892022-05-11 DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks Ball, Sam T. M. Celik, Numan Sayari, Elaheh Abdul Kadir, Lina O’Brien, Fiona Barrett-Jolley, Richard PLoS One Research Article Development of automated analysis tools for “single ion channel” recording is hampered by the lack of available training data. For machine learning based tools, very large training sets are necessary with sample-by-sample point labelled data (e.g., 1 sample point every 100microsecond). In an experimental context, such data are labelled with human supervision, and whilst this is feasible for simple experimental analysis, it is infeasible to generate the enormous datasets that would be necessary for a big data approach using hand crafting. In this work we aimed to develop methods to generate simulated ion channel data that is free from assumptions and prior knowledge of noise and underlying hidden Markov models. We successfully leverage generative adversarial networks (GANs) to build an end-to-end pipeline for generating an unlimited amount of labelled training data from a small, annotated ion channel “seed” record, and this needs no prior knowledge of theoretical dynamical ion channel properties. Our method utilises 2D CNNs to maintain the synchronised temporal relationship between the raw and idealised record. We demonstrate the applicability of the method with 5 different data sources and show authenticity with t-SNE and UMAP projection comparisons between real and synthetic data. The model would be easily extendable to other time series data requiring parallel labelling, such as labelled ECG signals or raw nanopore sequencing data. Public Library of Science 2022-05-10 /pmc/articles/PMC9089889/ /pubmed/35536793 http://dx.doi.org/10.1371/journal.pone.0267452 Text en © 2022 Ball et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ball, Sam T. M.
Celik, Numan
Sayari, Elaheh
Abdul Kadir, Lina
O’Brien, Fiona
Barrett-Jolley, Richard
DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks
title DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks
title_full DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks
title_fullStr DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks
title_full_unstemmed DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks
title_short DeepGANnel: Synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks
title_sort deepgannel: synthesis of fully annotated single molecule patch-clamp data using generative adversarial networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9089889/
https://www.ncbi.nlm.nih.gov/pubmed/35536793
http://dx.doi.org/10.1371/journal.pone.0267452
work_keys_str_mv AT ballsamtm deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks
AT celiknuman deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks
AT sayarielaheh deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks
AT abdulkadirlina deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks
AT obrienfiona deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks
AT barrettjolleyrichard deepgannelsynthesisoffullyannotatedsinglemoleculepatchclampdatausinggenerativeadversarialnetworks