Nanbeige4.1-3B: A Small General Model that Reasons, Aligns, and Acts Paper • 2602.13367 • Published 7 days ago • 20
SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training Paper • 2602.03411 • Published 17 days ago • 37
SWE-World: Building Software Engineering Agents in Docker-Free Environments Paper • 2602.03419 • Published 17 days ago • 39
RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents Paper • 2602.02486 • Published 18 days ago • 18
SWE-Universe: Scale Real-World Verifiable Environments to Millions Paper • 2602.02361 • Published 18 days ago • 60
RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System Paper • 2602.02488 • Published 18 days ago • 32
TTCS: Test-Time Curriculum Synthesis for Self-Evolving Paper • 2601.22628 • Published 21 days ago • 35
DeepSearchQA: Bridging the Comprehensiveness Gap for Deep Research Agents Paper • 2601.20975 • Published 23 days ago • 9