Policy Gradient Algorithm for Multi-armed Bandits Robust to Adversarial Corruptions
Published in Published in the twenty-fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS) , 2025
Recommended citation: Jiayuan Liu, Siwei Wang, Zhixuan Fang, “Policy Gradient Algorithm for Multi-armed Bandits Robust to Adversarial Corruptions,” the twenty-fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2025.
