Running 76 Unlocking On-Policy Distillation for Any Model Family 📝 76 Apply on-policy distillation to any model family
OpenGVLab/InternVL3_5-GPT-OSS-20B-A4B-Preview Image-Text-to-Text • 0.4B • Updated Aug 29, 2025 • 34k • 82
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation • 71B • Updated Feb 24, 2025 • 157k • • 737