SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published 6 days ago • 58
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published Nov 24, 2025 • 61
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published Nov 24, 2025 • 61
DR Tulu Collection Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated Nov 25, 2025 • 33
Alignment through Meta-Weighted Online Sampling: Bridging the Gap between Data Generation and Preference Optimization Paper • 2509.23371 • Published Sep 27, 2025 • 6
WebExplorer: Explore and Evolve for Training Long-Horizon Web Agents Paper • 2509.06501 • Published Sep 8, 2025 • 80