Papers - a flaviusburca Collection

flaviusburca 's Collections

Papers

Papers

updated 4 days ago

PretrainZero: Reinforcement Active Pretraining

Paper • 2512.03442 • Published Dec 3, 2025 • 47
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs

Paper • 2512.03383 • Published Dec 3, 2025 • 4
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 111
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models

Paper • 2511.18890 • Published Nov 24, 2025 • 32
Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models

Paper • 2511.23319 • Published Nov 28, 2025 • 22
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization

Paper • 2512.00956 • Published Nov 30, 2025 • 19
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning

Paper • 2512.02551 • Published Dec 2, 2025 • 12
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 95
LightRAG: Simple and Fast Retrieval-Augmented Generation

Paper • 2410.05779 • Published Oct 8, 2024 • 27
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 139
End-to-End Test-Time Training for Long Context

Paper • 2512.23675 • Published 6 days ago • 14