Cargando…

Spark-based parallel calculation of 3D fourier shell correlation for macromolecule structure local resolution estimation

BACKGROUND: Resolution estimation is the main evaluation criteria for the reconstruction of macromolecular 3D structure in the field of cryoelectron microscopy (cryo-EM). At present, there are many methods to evaluate the 3D resolution for reconstructed macromolecular structures from Single Particle...

Descripción completa

Detalles Bibliográficos
Autores principales: Lü, Yongchun, Zeng, Xiangrui, Tian, Xinhui, Shi, Xiao, Wang, Hui, Zheng, Xiaohui, Liu, Xiaodong, Zhao, Xiaofang, Gao, Xin, Xu, Min
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7495889/
https://www.ncbi.nlm.nih.gov/pubmed/32938398
http://dx.doi.org/10.1186/s12859-020-03680-6
_version_ 1783582981023072256
author Lü, Yongchun
Zeng, Xiangrui
Tian, Xinhui
Shi, Xiao
Wang, Hui
Zheng, Xiaohui
Liu, Xiaodong
Zhao, Xiaofang
Gao, Xin
Xu, Min
author_facet Lü, Yongchun
Zeng, Xiangrui
Tian, Xinhui
Shi, Xiao
Wang, Hui
Zheng, Xiaohui
Liu, Xiaodong
Zhao, Xiaofang
Gao, Xin
Xu, Min
author_sort Lü, Yongchun
collection PubMed
description BACKGROUND: Resolution estimation is the main evaluation criteria for the reconstruction of macromolecular 3D structure in the field of cryoelectron microscopy (cryo-EM). At present, there are many methods to evaluate the 3D resolution for reconstructed macromolecular structures from Single Particle Analysis (SPA) in cryo-EM and subtomogram averaging (SA) in electron cryotomography (cryo-ET). As global methods, they measure the resolution of the structure as a whole, but they are inaccurate in detecting subtle local changes of reconstruction. In order to detect the subtle changes of reconstruction of SPA and SA, a few local resolution methods are proposed. The mainstream local resolution evaluation methods are based on local Fourier shell correlation (FSC), which is computationally intensive. However, the existing resolution evaluation methods are based on multi-threading implementation on a single computer with very poor scalability. RESULTS: This paper proposes a new fine-grained 3D array partition method by key-value format in Spark. Our method first converts 3D images to key-value data (K-V). Then the K-V data is used for 3D array partitioning and data exchange in parallel. So Spark-based distributed parallel computing framework can solve the above scalability problem. In this distributed computing framework, all 3D local FSC tasks are simultaneously calculated across multiple nodes in a computer cluster. Through the calculation of experimental data, 3D local resolution evaluation algorithm based on Spark fine-grained 3D array partition has a magnitude change in computing speed compared with the mainstream FSC algorithm under the condition that the accuracy remains unchanged, and has better fault tolerance and scalability. CONCLUSIONS: In this paper, we proposed a K-V format based fine-grained 3D array partition method in Spark to parallel calculating 3D FSC for getting a 3D local resolution density map. 3D local resolution density map evaluates the three-dimensional density maps reconstructed from single particle analysis and subtomogram averaging. Our proposed method can significantly increase the speed of the 3D local resolution evaluation, which is important for the efficient detection of subtle variations among reconstructed macromolecular structures.
format Online
Article
Text
id pubmed-7495889
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-74958892020-09-23 Spark-based parallel calculation of 3D fourier shell correlation for macromolecule structure local resolution estimation Lü, Yongchun Zeng, Xiangrui Tian, Xinhui Shi, Xiao Wang, Hui Zheng, Xiaohui Liu, Xiaodong Zhao, Xiaofang Gao, Xin Xu, Min BMC Bioinformatics Methodology BACKGROUND: Resolution estimation is the main evaluation criteria for the reconstruction of macromolecular 3D structure in the field of cryoelectron microscopy (cryo-EM). At present, there are many methods to evaluate the 3D resolution for reconstructed macromolecular structures from Single Particle Analysis (SPA) in cryo-EM and subtomogram averaging (SA) in electron cryotomography (cryo-ET). As global methods, they measure the resolution of the structure as a whole, but they are inaccurate in detecting subtle local changes of reconstruction. In order to detect the subtle changes of reconstruction of SPA and SA, a few local resolution methods are proposed. The mainstream local resolution evaluation methods are based on local Fourier shell correlation (FSC), which is computationally intensive. However, the existing resolution evaluation methods are based on multi-threading implementation on a single computer with very poor scalability. RESULTS: This paper proposes a new fine-grained 3D array partition method by key-value format in Spark. Our method first converts 3D images to key-value data (K-V). Then the K-V data is used for 3D array partitioning and data exchange in parallel. So Spark-based distributed parallel computing framework can solve the above scalability problem. In this distributed computing framework, all 3D local FSC tasks are simultaneously calculated across multiple nodes in a computer cluster. Through the calculation of experimental data, 3D local resolution evaluation algorithm based on Spark fine-grained 3D array partition has a magnitude change in computing speed compared with the mainstream FSC algorithm under the condition that the accuracy remains unchanged, and has better fault tolerance and scalability. CONCLUSIONS: In this paper, we proposed a K-V format based fine-grained 3D array partition method in Spark to parallel calculating 3D FSC for getting a 3D local resolution density map. 3D local resolution density map evaluates the three-dimensional density maps reconstructed from single particle analysis and subtomogram averaging. Our proposed method can significantly increase the speed of the 3D local resolution evaluation, which is important for the efficient detection of subtle variations among reconstructed macromolecular structures. BioMed Central 2020-09-17 /pmc/articles/PMC7495889/ /pubmed/32938398 http://dx.doi.org/10.1186/s12859-020-03680-6 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Lü, Yongchun
Zeng, Xiangrui
Tian, Xinhui
Shi, Xiao
Wang, Hui
Zheng, Xiaohui
Liu, Xiaodong
Zhao, Xiaofang
Gao, Xin
Xu, Min
Spark-based parallel calculation of 3D fourier shell correlation for macromolecule structure local resolution estimation
title Spark-based parallel calculation of 3D fourier shell correlation for macromolecule structure local resolution estimation
title_full Spark-based parallel calculation of 3D fourier shell correlation for macromolecule structure local resolution estimation
title_fullStr Spark-based parallel calculation of 3D fourier shell correlation for macromolecule structure local resolution estimation
title_full_unstemmed Spark-based parallel calculation of 3D fourier shell correlation for macromolecule structure local resolution estimation
title_short Spark-based parallel calculation of 3D fourier shell correlation for macromolecule structure local resolution estimation
title_sort spark-based parallel calculation of 3d fourier shell correlation for macromolecule structure local resolution estimation
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7495889/
https://www.ncbi.nlm.nih.gov/pubmed/32938398
http://dx.doi.org/10.1186/s12859-020-03680-6
work_keys_str_mv AT luyongchun sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation
AT zengxiangrui sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation
AT tianxinhui sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation
AT shixiao sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation
AT wanghui sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation
AT zhengxiaohui sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation
AT liuxiaodong sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation
AT zhaoxiaofang sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation
AT gaoxin sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation
AT xumin sparkbasedparallelcalculationof3dfouriershellcorrelationformacromoleculestructurelocalresolutionestimation