SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits (Extended Abstract)

SAMBA: A Generic Framework for Secure Federated Multi-Armed Bandits (Extended Abstract)

Radu Ciucanu, Pascal Lafourcade, Gael Marcadet, Marta Soare

Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
Journal Track. Pages 6863-6867. https://doi.org/10.24963/ijcai.2023/772

We tackle the problem of secure cumulative reward maximization in multi-armed bandits in a cross-silo federated learning setting. Under the orchestration of a central server, each data owner participating at the cumulative reward computation has the guarantee that its raw data is not seen by some other participant. We rely on cryptographic schemes and propose SAMBA, a generic framework for Secure federAted Multi-armed BAndits. We show that SAMBA returns the same cumulative reward as the non-secure versions of bandit algorithms, while satisfying formally proven security properties. We also show that the overhead due to cryptographic primitives is linear in the size of the input, which is confirmed by our implementation.
Keywords:
Multidisciplinary Topics and Applications: MDA: Security and privacy
Machine Learning: ML: Federated learning
Uncertainty in AI: UAI: Sequential decision making