Back to Search Start Over

Byzantine fault tolerance in distributed machine learning: a survey.

Authors :
Bouhata, Djamila
Moumen, Hamouma
Mazari, Jocelyn Ahmed
Bounceur, Ahcène
Source :
Journal of Experimental & Theoretical Artificial Intelligence. Sep2024, p1-59. 59p. 15 Illustrations.
Publication Year :
2024

Abstract

Byzantine Fault Tolerance (BFT) is crucial for ensuring the resilience of Distributed Machine Learning (DML) systems during training under adversarial conditions. Among the rising corpus of research on BFT in DML, there is no comprehensive classification of techniques or broad analysis of different approaches. This paper provides an in-depth survey of recent advancements in BFT for DML, with a focus on first-order optimisation methods, particularly, the popular one Stochastic Gradient Descent (SGD) during the training phase. We offer a novel classification of BFT approaches based on characteristics such as the communication process, optimisation method, and topology setting. This classification aims to enhance the understanding of various BFT methods and guide future research in addressing open challenges in the field. This work provides the foundations for developing robust BFT systems, using a variety of optimisation methods to strengthen resilience. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
0952813X
Database :
Academic Search Index
Journal :
Journal of Experimental & Theoretical Artificial Intelligence
Publication Type :
Academic Journal
Accession number :
179587607
Full Text :
https://doi.org/10.1080/0952813x.2024.2391778