Personalized Federated X-Armed Bandit

Published in arXiv, 2024

Recommended citation: Wenjie Li, Jean Honorio, Qifan Song

[AISTATS] [arXiv]

In this work, we study the personalized federated X -armed bandit problem, where the heterogeneous local objectives of the clients are optimized simultaneously in the federated learning paradigm. We propose the PF-PNE algorithm with a unique double elimination strategy, which safely eliminates the non-optimal regions while encouraging federated collaboration through biased but effective evaluations of the local objectives. The proposed PF-PNE algorithm is able to optimize local objectives with arbitrary levels of heterogeneity, and its limited communications protects the confidentiality of the client-wise reward data. Our theoretical analysis shows the benefit of the proposed algorithm over single-client algorithms. Experimentally, PF-PNE outperforms multiple baselines on both synthetic and real-life datasets.