Cargando…

Scalable and fault tolerant orthogonalization based on randomized distributed data aggregation

The construction of distributed algorithms for matrix computations built on top of distributed data aggregation algorithms with randomized communication schedules is investigated. For this purpose, a new aggregation algorithm for summing or averaging distributed values, the push-flow algorithm, is d...

Descripción completa

Detalles Bibliográficos
Autores principales: Gansterer, Wilfried N., Niederbrucker, Gerhard, Straková, Hana, Schulze Grotthoff, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3990450/
https://www.ncbi.nlm.nih.gov/pubmed/24748902
http://dx.doi.org/10.1016/j.jocs.2013.01.006
Descripción
Sumario:The construction of distributed algorithms for matrix computations built on top of distributed data aggregation algorithms with randomized communication schedules is investigated. For this purpose, a new aggregation algorithm for summing or averaging distributed values, the push-flow algorithm, is developed, which achieves superior resilience properties with respect to failures compared to existing aggregation methods. It is illustrated that on a hypercube topology it asymptotically requires the same number of iterations as the optimal all-to-all reduction operation and that it scales well with the number of nodes. Orthogonalization is studied as a prototypical matrix computation task. A new fault tolerant distributed orthogonalization method rdmGS, which can produce accurate results even in the presence of node failures, is built on top of distributed data aggregation algorithms.