Cargando…

Massive Data Management and Sharing Module for Connectome Reconstruction

Recently, with the rapid development of electron microscopy (EM) technology and the increasing demand of neuron circuit reconstruction, the scale of reconstruction data grows significantly. This brings many challenges, one of which is how to effectively manage large-scale data so that researchers ca...

Descripción completa

Detalles Bibliográficos
Autores principales: Yuan, Jingbin, Zhang, Jing, Shen, Lijun, Zhang, Dandan, Yu, Wenhuan, Han, Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7288162/
https://www.ncbi.nlm.nih.gov/pubmed/32455914
http://dx.doi.org/10.3390/brainsci10050314
_version_ 1783545216947453952
author Yuan, Jingbin
Zhang, Jing
Shen, Lijun
Zhang, Dandan
Yu, Wenhuan
Han, Hua
author_facet Yuan, Jingbin
Zhang, Jing
Shen, Lijun
Zhang, Dandan
Yu, Wenhuan
Han, Hua
author_sort Yuan, Jingbin
collection PubMed
description Recently, with the rapid development of electron microscopy (EM) technology and the increasing demand of neuron circuit reconstruction, the scale of reconstruction data grows significantly. This brings many challenges, one of which is how to effectively manage large-scale data so that researchers can mine valuable information. For this purpose, we developed a data management module equipped with two parts, a storage and retrieval module on the server-side and an image cache module on the client-side. On the server-side, Hadoop and HBase are introduced to resolve massive data storage and retrieval. The pyramid model is adopted to store electron microscope images, which represent multiresolution data of the image. A block storage method is proposed to store volume segmentation results. We design a spatial location-based retrieval method for fast obtaining images and segments by layers rapidly, which achieves a constant time complexity. On the client-side, a three-level image cache module is designed to reduce latency when acquiring data. Through theoretical analysis and practical tests, our tool shows excellent real-time performance when handling large-scale data. Additionally, the server-side can be used as a backend of other similar software or a public database to manage shared datasets, showing strong scalability.
format Online
Article
Text
id pubmed-7288162
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-72881622020-06-17 Massive Data Management and Sharing Module for Connectome Reconstruction Yuan, Jingbin Zhang, Jing Shen, Lijun Zhang, Dandan Yu, Wenhuan Han, Hua Brain Sci Article Recently, with the rapid development of electron microscopy (EM) technology and the increasing demand of neuron circuit reconstruction, the scale of reconstruction data grows significantly. This brings many challenges, one of which is how to effectively manage large-scale data so that researchers can mine valuable information. For this purpose, we developed a data management module equipped with two parts, a storage and retrieval module on the server-side and an image cache module on the client-side. On the server-side, Hadoop and HBase are introduced to resolve massive data storage and retrieval. The pyramid model is adopted to store electron microscope images, which represent multiresolution data of the image. A block storage method is proposed to store volume segmentation results. We design a spatial location-based retrieval method for fast obtaining images and segments by layers rapidly, which achieves a constant time complexity. On the client-side, a three-level image cache module is designed to reduce latency when acquiring data. Through theoretical analysis and practical tests, our tool shows excellent real-time performance when handling large-scale data. Additionally, the server-side can be used as a backend of other similar software or a public database to manage shared datasets, showing strong scalability. MDPI 2020-05-22 /pmc/articles/PMC7288162/ /pubmed/32455914 http://dx.doi.org/10.3390/brainsci10050314 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yuan, Jingbin
Zhang, Jing
Shen, Lijun
Zhang, Dandan
Yu, Wenhuan
Han, Hua
Massive Data Management and Sharing Module for Connectome Reconstruction
title Massive Data Management and Sharing Module for Connectome Reconstruction
title_full Massive Data Management and Sharing Module for Connectome Reconstruction
title_fullStr Massive Data Management and Sharing Module for Connectome Reconstruction
title_full_unstemmed Massive Data Management and Sharing Module for Connectome Reconstruction
title_short Massive Data Management and Sharing Module for Connectome Reconstruction
title_sort massive data management and sharing module for connectome reconstruction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7288162/
https://www.ncbi.nlm.nih.gov/pubmed/32455914
http://dx.doi.org/10.3390/brainsci10050314
work_keys_str_mv AT yuanjingbin massivedatamanagementandsharingmoduleforconnectomereconstruction
AT zhangjing massivedatamanagementandsharingmoduleforconnectomereconstruction
AT shenlijun massivedatamanagementandsharingmoduleforconnectomereconstruction
AT zhangdandan massivedatamanagementandsharingmoduleforconnectomereconstruction
AT yuwenhuan massivedatamanagementandsharingmoduleforconnectomereconstruction
AT hanhua massivedatamanagementandsharingmoduleforconnectomereconstruction