SlidesGen-Bench: Evaluating Slides Generation via Computational and Quantitative Metrics Paper • 2601.09487 • Published 24 days ago • 1
FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation Paper • 2602.03798 • Published 4 days ago • 9
MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning Paper • 2510.14958 • Published Oct 16, 2025 • 23
GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models Paper • 2508.06471 • Published Aug 8, 2025 • 206
CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images Paper • 2510.11718 • Published Oct 13, 2025 • 14