Policy Gradient Algorithm for Multi-armed Bandits Robust to Adversarial Corruptions

Published in Published in the twenty-fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS) , 2025

Download paper here

Recommended citation: Jiayuan Liu, Siwei Wang, Zhixuan Fang, “Policy Gradient Algorithm for Multi-armed Bandits Robust to Adversarial Corruptions,” the twenty-fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2025.