Cargando…

Optimal Topology Search for Fast Model Averaging in Decentralized Parallel SGD

Distributed training of deep learning models on high-latency systems necessitates decentralized parallel SGD solutions. However, existing solutions suffer from slow convergence because of hand-crafted topologies. The question arises, “for decentralized parallel SGD, is it possible to learn a topolog...

Descripción completa

Detalles Bibliográficos
Autores principales:	Jameel, Mohsan, Jawed, Shayan, Schmidt-Thieme, Lars
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206308/ http://dx.doi.org/10.1007/978-3-030-47436-2_67

Internet

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7206308/
http://dx.doi.org/10.1007/978-3-030-47436-2_67

Optimal Topology Search for Fast Model Averaging in Decentralized Parallel SGD

Internet

Ejemplares similares