Cargando…
Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms
MOTIVATION: Since 2017, an increasing amount of attention has been paid to the supervised deep learning-based macromolecule in situ structural classification (i.e. subtomogram classification) in cellular electron cryo-tomography (CECT) due to the substantially higher scalability of deep learning. Ho...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612867/ https://www.ncbi.nlm.nih.gov/pubmed/31510673 http://dx.doi.org/10.1093/bioinformatics/btz364 |
_version_ | 1783432955018870784 |
---|---|
author | Lin, Ruogu Zeng, Xiangrui Kitani, Kris Xu, Min |
author_facet | Lin, Ruogu Zeng, Xiangrui Kitani, Kris Xu, Min |
author_sort | Lin, Ruogu |
collection | PubMed |
description | MOTIVATION: Since 2017, an increasing amount of attention has been paid to the supervised deep learning-based macromolecule in situ structural classification (i.e. subtomogram classification) in cellular electron cryo-tomography (CECT) due to the substantially higher scalability of deep learning. However, the success of such supervised approach relies heavily on the availability of large amounts of labeled training data. For CECT, creating valid training data from the same data source as prediction data is usually laborious and computationally intensive. It would be beneficial to have training data from a separate data source where the annotation is readily available or can be performed in a high-throughput fashion. However, the cross data source prediction is often biased due to the different image intensity distributions (a.k.a. domain shift). RESULTS: We adapt a deep learning-based adversarial domain adaptation (3D-ADA) method to timely address the domain shift problem in CECT data analysis. 3D-ADA first uses a source domain feature extractor to extract discriminative features from the training data as the input to a classifier. Then it adversarially trains a target domain feature extractor to reduce the distribution differences of the extracted features between training and prediction data. As a result, the same classifier can be directly applied to the prediction data. We tested 3D-ADA on both experimental and realistically simulated subtomogram datasets under different imaging conditions. 3D-ADA stably improved the cross data source prediction, as well as outperformed two popular domain adaptation methods. Furthermore, we demonstrate that 3D-ADA can improve cross data source recovery of novel macromolecular structures. AVAILABILITY AND IMPLEMENTATION: https://github.com/xulabs/projects SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-6612867 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-66128672019-07-12 Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms Lin, Ruogu Zeng, Xiangrui Kitani, Kris Xu, Min Bioinformatics Ismb/Eccb 2019 Conference Proceedings MOTIVATION: Since 2017, an increasing amount of attention has been paid to the supervised deep learning-based macromolecule in situ structural classification (i.e. subtomogram classification) in cellular electron cryo-tomography (CECT) due to the substantially higher scalability of deep learning. However, the success of such supervised approach relies heavily on the availability of large amounts of labeled training data. For CECT, creating valid training data from the same data source as prediction data is usually laborious and computationally intensive. It would be beneficial to have training data from a separate data source where the annotation is readily available or can be performed in a high-throughput fashion. However, the cross data source prediction is often biased due to the different image intensity distributions (a.k.a. domain shift). RESULTS: We adapt a deep learning-based adversarial domain adaptation (3D-ADA) method to timely address the domain shift problem in CECT data analysis. 3D-ADA first uses a source domain feature extractor to extract discriminative features from the training data as the input to a classifier. Then it adversarially trains a target domain feature extractor to reduce the distribution differences of the extracted features between training and prediction data. As a result, the same classifier can be directly applied to the prediction data. We tested 3D-ADA on both experimental and realistically simulated subtomogram datasets under different imaging conditions. 3D-ADA stably improved the cross data source prediction, as well as outperformed two popular domain adaptation methods. Furthermore, we demonstrate that 3D-ADA can improve cross data source recovery of novel macromolecular structures. AVAILABILITY AND IMPLEMENTATION: https://github.com/xulabs/projects SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2019-07 2019-07-05 /pmc/articles/PMC6612867/ /pubmed/31510673 http://dx.doi.org/10.1093/bioinformatics/btz364 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Ismb/Eccb 2019 Conference Proceedings Lin, Ruogu Zeng, Xiangrui Kitani, Kris Xu, Min Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms |
title | Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms |
title_full | Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms |
title_fullStr | Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms |
title_full_unstemmed | Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms |
title_short | Adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms |
title_sort | adversarial domain adaptation for cross data source macromolecule in situ structural classification in cellular electron cryo-tomograms |
topic | Ismb/Eccb 2019 Conference Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6612867/ https://www.ncbi.nlm.nih.gov/pubmed/31510673 http://dx.doi.org/10.1093/bioinformatics/btz364 |
work_keys_str_mv | AT linruogu adversarialdomainadaptationforcrossdatasourcemacromoleculeinsitustructuralclassificationincellularelectroncryotomograms AT zengxiangrui adversarialdomainadaptationforcrossdatasourcemacromoleculeinsitustructuralclassificationincellularelectroncryotomograms AT kitanikris adversarialdomainadaptationforcrossdatasourcemacromoleculeinsitustructuralclassificationincellularelectroncryotomograms AT xumin adversarialdomainadaptationforcrossdatasourcemacromoleculeinsitustructuralclassificationincellularelectroncryotomograms |