Personalized Federated X-Armed Bandit
Published in AISTATS 2024, 2024
Recommended citation: Wenjie Li, Jean Honorio, Qifan Song. Personalized Federated X-Armed Bandit. Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024
In this work, we study the personalized federated X -armed bandit problem, where the heterogeneous local objectives of the clients are optimized simultaneously in the federated learning paradigm. We propose the PF-PNE algorithm with a unique double elimination strategy, which safely eliminates the non-optimal regions while encouraging federated collaboration through biased but effective evaluations of the local objectives. The proposed PF-PNE algorithm is able to optimize local objectives with arbitrary levels of heterogeneity, and its limited communications protects the confidentiality of the client-wise reward data. Our theoretical analysis shows the benefit of the proposed algorithm over single-client algorithms. Experimentally, PF-PNE outperforms multiple baselines on both synthetic and real-life datasets.