flaviusburca
's Collections
Papers
updated
PretrainZero: Reinforcement Active Pretraining
Paper
•
2512.03442
•
Published
•
47
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
Paper
•
2512.03383
•
Published
•
4
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
Paper
•
2511.21689
•
Published
•
111
Nemotron-Flash: Towards Latency-Optimal Hybrid Small Language Models
Paper
•
2511.18890
•
Published
•
32
Every Token Counts: Generalizing 16M Ultra-Long Context in Large Language Models
Paper
•
2511.23319
•
Published
•
22
WUSH: Near-Optimal Adaptive Transforms for LLM Quantization
Paper
•
2512.00956
•
Published
•
19
CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication through Reinforcement Learning
Paper
•
2512.02551
•
Published
•
12
Stabilizing Reinforcement Learning with LLMs: Formulation and Practices
Paper
•
2512.01374
•
Published
•
95
LightRAG: Simple and Fast Retrieval-Augmented Generation
Paper
•
2410.05779
•
Published
•
27
MinerU2.5: A Decoupled Vision-Language Model for Efficient
High-Resolution Document Parsing
Paper
•
2509.22186
•
Published
•
139
End-to-End Test-Time Training for Long Context
Paper
•
2512.23675
•
Published
•
14