Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed Paper • 2512.14067 • Published 1 day ago • 4
MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives Paper • 2512.14699 • Published about 19 hours ago • 13
WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling Paper • 2512.14614 • Published about 20 hours ago • 43
QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management Paper • 2512.12967 • Published 2 days ago • 80
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published 2 days ago • 38
Exploring MLLM-Diffusion Information Transfer with MetaCanvas Paper • 2512.11464 • Published 5 days ago • 11
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder Paper • 2512.11749 • Published 5 days ago • 34
PersonaLive! Expressive Portrait Image Animation for Live Streaming Paper • 2512.11253 • Published 5 days ago • 24
The N-Body Problem: Parallel Execution from Single-Person Egocentric Video Paper • 2512.11393 • Published 5 days ago • 2
V-RGBX: Video Editing with Accurate Controls over Intrinsic Properties Paper • 2512.11799 • Published 5 days ago • 29
MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos Paper • 2512.10881 • Published 6 days ago • 27
Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale Paper • 2512.10398 • Published 6 days ago • 5
Evaluating Gemini Robotics Policies in a Veo World Simulator Paper • 2512.10675 • Published 6 days ago • 15
The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality Paper • 2512.10791 • Published 6 days ago • 5