Policy Gradient Algorithm for Multi-armed Bandits Robust to Adversarial Corruptions

Published in Published in the twenty-fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS) , 2025

Download paper here

Recommended citation: Jiayuan Liu, Siwei Wang, Zhixuan Fang, “Policy Gradient Algorithm for Multi-armed Bandits Robust to Adversarial Corruptions,” the twenty-fourth International Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2025.

Share on

Twitter Facebook LinkedIn

Jiayuan Liu 刘嘉源

Share on