Our new Agentic RL work CLEANER is accepted to ICLR 2026 SPOT Workshop!

Mar 2, 2026 1 min read

Our paper “CLEANER:Self-Purified Trajectories Boost Agentic Reinforcement Learning” is accepted to ICLR 2026 SPOT Workshop! [Paper]; [Code]. CLEANER resolves the credit assignment dilemma in agentic RL by training on self-purified trajectories, achieving SOTA performance with just one-third of the training cost.