Cargando…

Towards accelerating model parallelism in distributed deep learning systems

Modern deep neural networks cannot be often trained on a single GPU due to large model size and large data size. Model parallelism splits a model for multiple GPUs, but making it scalable and seamless is challenging due to different information sharing among GPUs with communication overhead. Specifi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Choi, Hyeonseong, Lee, Byung Hyun, Chun, Se Young, Lee, Jaehwan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10621816/ https://www.ncbi.nlm.nih.gov/pubmed/37917655 http://dx.doi.org/10.1371/journal.pone.0293338

Ejemplares similares

Parallel and Distributed Computing in Circular Accelerators
por: Andrianov, S N, et al.
Publicado: (2001)

AIVariant: a deep learning-based somatic variant detector for highly contaminated tumor samples
por: Jeon, Hyeonseong, et al.
Publicado: (2023)

The Anti-Tumor Effect of Boron Neutron Capture Therapy in Glioblastoma Subcutaneous Xenograft Model Using the Proton Linear Accelerator-Based BNCT System in Korea
por: Seo, Il Hyeok, et al.
Publicado: (2022)

Modeling and optimization of parallel and distributed embedded systems
por: Munir, Arslan, et al.
Publicado: (2016)

Deep-Learning-Based Automatic Segmentation of Head and Neck Organs for Radiation Therapy in Dogs
por: Park, Jeongsu, et al.
Publicado: (2021)

Review of Capacitive Touchscreen Technologies: Overview, Research Trends, and Machine Learning Approaches
por: Nam, Hyoungsik, et al.
Publicado: (2021)

Distributed Raman Spectrum Data Augmentation System Using Federated Learning with Deep Generative Models
por: Kim, Yaeran, et al.
Publicado: (2022)

Deploying and scaling distributed parallel deep neural networks on the Tianhe-3 prototype system
por: Wei, Jia, et al.
Publicado: (2021)

Development of a Parallel Code for Modeling Plasma Based Accelerators
por: Hemker, R J, et al.
Publicado: (1999)

On Scalable Deep Learning and Parallelizing Gradient Descent
por: Hermans, Joeri
Publicado: (2017)

Workshop on Parallel and Distributed Computing for Machine Learning
Publicado: (2007)

Acceleration Magnitude at Impact Following Loss of Balance Can Be Estimated Using Deep Learning Model
por: Kim, Tae Hyong, et al.
Publicado: (2020)

Integrated deep learning framework for accelerated optical coherence tomography angiography
por: Kim, Gyuwon, et al.
Publicado: (2022)

Application of Deep Learning System into the Development of Communication Device for Quadriplegic Patient
por: Lee, Jung Hwan, et al.
Publicado: (2019)

Accelerated spin dynamics using deep learning corrections
por: Park, Sojeong, et al.
Publicado: (2020)

Toward Accelerated Training of Parallel Support Vector Machines Based on Voronoi Diagrams
por: Alfaro, Cesar, et al.
Publicado: (2021)

Deep learning acceleration of multiscale superresolution localization photoacoustic imaging
por: Kim, Jongbeom, et al.
Publicado: (2022)

Scaling up machine learning: parallel and distributed approaches
por: Bilenko, Mikhail, et al.
Publicado: (2018)

Deep Learning for Diagnosis of Paranasal Sinusitis Using Multi-View Radiographs
por: Jeon, Yejin, et al.
Publicado: (2021)

Deep learning method for prediction of patient-specific dose distribution in breast cancer
por: Ahn, Sang Hee, et al.
Publicado: (2021)

A fully deep learning model for the automatic identification of cephalometric landmarks
por: Kim, Young Hyun, et al.
Publicado: (2021)

Three-Dimensional Foot Position Estimation Based on Footprint Shadow Image Processing and Deep Learning for Smart Trampoline Fitness System
por: Park, Se-Kyung, et al.
Publicado: (2022)

Automatic Classification Service System for Citrus Pest Recognition Based on Deep Learning
por: Lee, Saebom, et al.
Publicado: (2022)

Modeling Brain Volume Using Deep Learning-Based Physical Activity Features in Patients With Dementia
por: Park, Bumhee, et al.
Publicado: (2022)

Scheduling and load balancing in parallel and distributed systems /
por: Shirazi, Behrooz A.
Publicado: (1995)

Scheduling parallel applications on heterogeneous distributed systems
por: Xie, Guoqi, et al.
Publicado: (2019)

Data orchestration in deep learning accelerators
por: Krishna, Tushar, et al.
Publicado: (2020)

Deep Learning-Based Computer-Aided Detection System for Automated Treatment Response Assessment of Brain Metastases on 3D MRI
por: Cho, Jungheum, et al.
Publicado: (2021)

Towards Interpretable Deep Learning Models for Knowledge Tracing
por: Lu, Yu, et al.
Publicado: (2020)

Accelerate Scientific Deep Learning Models on Heterogeneous Computing Platform with FPGA
por: Jiang, Chao, et al.
Publicado: (2020)

Parallel distributed GEANT
por: Rademakers, A A
Publicado: (1992)

The application of a deep learning system developed to reduce the time for RT-PCR in COVID-19 detection
por: Lee, Yoonje, et al.
Publicado: (2022)

Diagnostic Accuracy of the Deep Learning Model for the Detection of ST Elevation Myocardial Infarction on Electrocardiogram
por: Choi, Hyun Young, et al.
Publicado: (2022)

Deep learning for fast simulation: development for distributed computing systems
por: Orlova, Elena
Publicado: (2017)

Toward parallel intelligence: An interdisciplinary solution for complex systems
por: Zhao, Yong, et al.
Publicado: (2023)

Micron Deep Learning Accelerator, tools and applications
por: Culurciello, Eugenio, et al.
Publicado: (2021)

Accelerating the discovery of new materials with deep learning
por: Vollmar, Melanie
Publicado: (2021)

Parallel Coupled Accelerating Structure Calculation
por: Pavlov, V M
Publicado: (2001)

Injection and Acceleration at Non-Parallel Shocks
por: Giacalone, J
Publicado: (2005)

Accelerating deep learning with high energy efficiency: From microchip to physical systems
por: Li, Huanhao, et al.
Publicado: (2022)

Cannot write session to /tmp/vufind_sessions/sess_ad88586r3u9gcph3lf2cp4fd6o